Stay updated with the latest features and improvements to Prologue AI.
Inpaint Tool Now Available with Image-1.5 Edit
Precisely edit specific areas of your images with our new Inpaint tool. Select the exact regions you want to modify using an intuitive mask editor, then let AI seamlessly regenerate those areas based on your prompt.
- Upload an image to the Image → Image - Edit playground
- Click on the uploaded image to view it in fullscreen
- Click the 'Select Inpaint Area' button to open the mask editor
- Use the brush tool to paint areas you want to edit, or use the eraser to remove parts of the mask
- Use the rectangle tool for quick rectangular selections
- Adjust brush size and opacity for precise control
- Once your mask is ready, enter a prompt describing what you want in the masked area
- Generate to see AI seamlessly fill the selected regions with your desired content


Prologue Intent Now Available
Prologue Intent bridges the gap between human vision and AI execution. It translates your audio, video, and visual references into precise scripts and high-fidelity prompts that capture your true creative intent.
- Translate raw audio, video, and visual references into precise scripts
- Generate high-fidelity prompts that capture your true creative intent
- Turn a mood board and a voice memo into AI-ready creative direction.
- Align image references and sound cues into a single narrative intent.
- Reduce prompt iteration by establishing intent upfront.
- Set creative direction once, then generate consistently.


Full Platform Redesign
We've completed a comprehensive redesign of Prologue AI, bringing you a cleaner interface, improved workflows, and better performance across the platform. We value your feedback and would love to hear your thoughts on the new design.
- Cleaner, more intuitive design
- Improved navigation and user experience
- Better visual hierarchy and organization
- Faster load times
- Smoother interactions
- Optimized workflows
- Streamlined creation process
- More efficient tool access
- Enhanced productivity features
GPT-Image-1.5 Edit Available
GPT-Image-1.5 Edit is now available for high-fidelity image editing
- High-fidelity edits with strong prompt adherence
- Choose quality (low / medium / high)
- Choose image size (auto, 1024×1024, 1536×1024, 1024×1536)
- Generate up to 4 images per request


Kling 2.6 Pro Available
Top-tier image-to-video generation with cinematic visuals, fluid motion, and native audio generation. Kling 2.6 Pro delivers professional-grade cinematic outputs with precise motion control. Supports English voice output.
- Top-tier image-to-video with cinematic visuals and fluid motion
- Native audio generation supporting Chinese and English voice output
- Professional-grade cinematic outputs with precise motion control
- 5s or 10s duration options with flexible audio on/off pricing
Seedream 4.5 Models Released
Major expansion of video-to-video utilities, image-to-video capabilities, and video-to-audio tools with professional-grade models.
- Ultra-realistic image generation
- Unified architecture for generation and editing
- Fast 2K image generation in seconds
- 4K support for production workflows

- Multi-reference editing with up to 10 style/identity images
- Confined edits that preserve scene structure
- Character consistency and face cloning
- Production-ready 2K and 4K outputs


- High-quality character animations with motion control
- Product showcase videos with smooth transitions
- Cinematic sequences from still images
- Social media content with optional background music
- Animate headshots with natural motion
- Bring product images to life
- Add environmental effects (trees swaying, water)
- Social clips (portrait → TikTok-style video)
- Add cinematic moves to static art
- Short ad creatives from stills
- Extend short clips with matching motion and context
- Create natural transitions at the tail of a shot
- Finish cuts for social or ad content
- Convert landscape videos to portrait for social media (TikTok, Instagram Reels)
- Transform portrait videos to landscape for YouTube or widescreen displays
- Adapt existing content to different aspect ratios without manual cropping
- Maintain visual quality while changing video format for different platforms
- Sequential shot generation maintaining motion style and camera language
- Cinematic style preservation across new scenes
- Multi-shot narratives with consistent visual language
- Character continuity across generated shots with up to 4 tracked references
- Production extensions with matching cinematic feel
- Social media content: TikTok, Instagram Reels, YouTube Shorts with animated subtitles
- Accessibility: Add captions for hearing-impaired viewers
- Multi-language content: Transcribe videos in 10+ languages automatically
- Professional videos: Add branded subtitles with custom fonts and colors
- Educational content: Create clear, readable subtitles for tutorials
- Marketing videos: Enhance engagement with karaoke-style word highlighting
- Craft background music for video
- Add creative audio layers that follow visuals
- Test different moods/genres on the same clip
- Add realistic foley (footsteps, doors, clinks)
- Create ASMR-like soundscapes
- Replace or enhance audio tracks without recording
O1 Models Available
Create or modify images and videos while maintaining consistency and continuity with our new O1 models.
- Advanced video editing with natural language instructions
- Character replacement while preserving motion
- Scene environment transformations
- Multi-reference editing with up to 4 elements/images
- Transform images into consistent, high-quality video scenes
- Stable character identity and object details
- Multi-reference support for complex scenes
FLUX.2 Models Available
Next-generation Flux models with advanced prompt understanding and professional-grade image generation and editing capabilities.
- High-quality image generation with advanced prompt understanding
- Support for @{field} syntax for referencing uploaded images
- Professional-grade visuals up to 2K

- Enhanced image editing with multi-reference support
- Up to 8 images via API (9MP total limit)
- Color matching with hex codes or image references
- Natural language editing instructions


Nano-Banana Pro Model Launch
Google's state-of-the-art Nano-Banana Pro model delivers exceptional prompt adherence and sophisticated photo editing capabilities.
- Ultra-realistic image generation with exceptional prompt adherence
- Sophisticated photo editing using conversational text prompts
- Multi-image editing support with up to 10 reference images
- 4K output support for production workflows
- Advanced composition control

- Conversational text prompts for image editing
- Multi-image editing support
- Exceptional prompt adherence
- 4K output support
- Advanced composition control


SAM-3 Object Detection and Segmentation
Advanced 2-layer object detection system using Gemini 3 Pro for semantic understanding and SAM-3 for precise pixel-level segmentation.
- Semantic scene understanding with Gemini 3 Pro
- Precise image segmentation with SAM-3
- Video segmentation with tracking support
- Multi-concept detection and masking
- Enhanced content editing capabilities
- Great for masks/segmentation/rotoscopy
- Object-centric analysis and shot planning
Veo 3.1 with Cinematic Control and Sound Integration
Google's Veo 3.1 model now supports integrated sound generation and advanced cinematic controls for professional video production.
- Text-to-video with native audio generation
- Cinematic control parameters for precise motion
- Enhanced video quality up to 1080p
- Animate still images with cinematic motion
- Reference-to-video mode for style consistency
- First-last-frame-to-video for controlled sequences
- Enhanced video quality up to 1080p
- Reference-to-video mode for style consistency
- Multi-reference image support
- Controlled video sequences
- First-last-frame-to-video for controlled sequences
- Precise motion control between frames
- Cinematic transitions
Sora 2 Pro Models Available
OpenAI's Sora 2 Pro models now available for high-fidelity text-to-video and image-to-video generation.
- High-quality video generation from text
- 720p and 1080p output support
- Enhanced visual fidelity
- Cinematic quality outputs
- Cinematic motion from still images
- 720p and 1080p output support
- Enhanced visual fidelity
- Professional-grade video generation
- Video remix capabilities
- Transform weather/lighting in videos
- Style transfers and seasonal changes
- Object transformations and mood shifts
Seedream 4 and Seedance Models Integration
ByteDance's latest Seedream 4.0 (T2I/I2I) and Seedance (T2V/I2V) models bring unified generation and editing workflows.
- High-quality video generation from text
- Expressive motion and cinematic quality
- 1080p output support
20+ New AI Models Added
Major expansion of our model library with 20+ new models across image generation, video creation, and audio processing.
- Creative image generation
- Style-focused outputs

- High-quality image generation
- Professional outputs

- Fast image generation
- Creative workflows

- Style transfer
- Image transformation


- Master-quality video generation
- Professional outputs
- Fast video generation
- Creative video outputs
- Master-quality animation
- Professional motion
- Fast image animation
- Creative motion
LipSync Pro Released
Professional lip-sync correction tool for fixing audio-video synchronization issues in video content.
- Fix lip-sync issues in dubbed content
- Animate avatars to match voiceovers
- Match podcast or narration audio to video