Stay updated with the latest features and improvements to Prologue AI.
Kling 2.6 Pro Available
Top-tier image-to-video generation with cinematic visuals, fluid motion, and native audio generation. Kling 2.6 Pro delivers professional-grade cinematic outputs with precise motion control. Supports English voice output.
- Top-tier image-to-video with cinematic visuals and fluid motion
- Native audio generation supporting Chinese and English voice output
- Professional-grade cinematic outputs with precise motion control
- 5s or 10s duration options with flexible audio on/off pricing
Seedream 4.5 Models Released
Major expansion of video-to-video utilities, image-to-video capabilities, and video-to-audio tools with professional-grade models.
- Ultra-realistic image generation
- Unified architecture for generation and editing
- Fast 2K image generation in seconds
- 4K support for production workflows

- Multi-reference editing with up to 10 style/identity images
- Confined edits that preserve scene structure
- Character consistency and face cloning
- Production-ready 2K and 4K outputs

- High-quality character animations with motion control
- Product showcase videos with smooth transitions
- Cinematic sequences from still images
- Social media content with optional background music
- Animate headshots with natural motion
- Bring product images to life
- Add environmental effects (trees swaying, water)
- Social clips (portrait → TikTok-style video)
- Add cinematic moves to static art
- Short ad creatives from stills
- Extend short clips with matching motion and context
- Create natural transitions at the tail of a shot
- Finish cuts for social or ad content
- Convert landscape videos to portrait for social media (TikTok, Instagram Reels)
- Transform portrait videos to landscape for YouTube or widescreen displays
- Adapt existing content to different aspect ratios without manual cropping
- Maintain visual quality while changing video format for different platforms
- Sequential shot generation maintaining motion style and camera language
- Cinematic style preservation across new scenes
- Multi-shot narratives with consistent visual language
- Character continuity across generated shots with up to 4 tracked references
- Production extensions with matching cinematic feel
- Character replacement while preserving original movements and camera angles
- Scene environment transformations through prompts
- Style transfer with motion preservation
- Multi-reference editing combining up to 4 elements/images
- Natural language control for direct edits without masking
- Social media content: TikTok, Instagram Reels, YouTube Shorts with animated subtitles
- Accessibility: Add captions for hearing-impaired viewers
- Multi-language content: Transcribe videos in 10+ languages automatically
- Professional videos: Add branded subtitles with custom fonts and colors
- Educational content: Create clear, readable subtitles for tutorials
- Marketing videos: Enhance engagement with karaoke-style word highlighting
- Craft background music for video
- Add creative audio layers that follow visuals
- Test different moods/genres on the same clip
- Add realistic foley (footsteps, doors, clinks)
- Create ASMR-like soundscapes
- Replace or enhance audio tracks without recording
O1 Models Available
Create or modify images and videos while maintaining consistency and continuity with our new O1 models.
- Advanced video editing with natural language instructions
- Character replacement while preserving motion
- Scene environment transformations
- Multi-reference editing with up to 4 elements/images
- Transform images into consistent, high-quality video scenes
- Stable character identity and object details
- Multi-reference support for complex scenes
FLUX.2 Models Available
Next-generation Flux models with advanced prompt understanding and professional-grade image generation and editing capabilities.
- High-quality image generation with advanced prompt understanding
- Support for @{field} syntax for referencing uploaded images
- Professional-grade visuals up to 2K

- Enhanced image editing with multi-reference support
- Up to 8 images via API (9MP total limit)
- Color matching with hex codes or image references
- Natural language editing instructions

Nano-Banana Pro Model Launch
Google's state-of-the-art Nano-Banana Pro model delivers exceptional prompt adherence and sophisticated photo editing capabilities.
- Ultra-realistic image generation with exceptional prompt adherence
- Sophisticated photo editing using conversational text prompts
- Multi-image editing support with up to 10 reference images
- 4K output support for production workflows
- Advanced composition control

- Conversational text prompts for image editing
- Multi-image editing support
- Exceptional prompt adherence
- 4K output support
- Advanced composition control

SAM-3 Object Detection and Segmentation
Advanced 2-layer object detection system using Gemini 3 Pro for semantic understanding and SAM-3 for precise pixel-level segmentation.
- Semantic scene understanding with Gemini 3 Pro
- Precise image segmentation with SAM-3
- Video segmentation with tracking support
- Multi-concept detection and masking
- Enhanced content editing capabilities
- Great for masks/segmentation/rotoscopy
- Object-centric analysis and shot planning
Veo 3.1 with Cinematic Control and Sound Integration
Google's Veo 3.1 model now supports integrated sound generation and advanced cinematic controls for professional video production.
- Text-to-video with native audio generation
- Cinematic control parameters for precise motion
- Enhanced video quality up to 1080p
- Animate still images with cinematic motion
- Reference-to-video mode for style consistency
- First-last-frame-to-video for controlled sequences
- Enhanced video quality up to 1080p
- Reference-to-video mode for style consistency
- Multi-reference image support
- Controlled video sequences
- First-last-frame-to-video for controlled sequences
- Precise motion control between frames
- Cinematic transitions
Sora 2 Pro Models Available
OpenAI's Sora 2 Pro models now available for high-fidelity text-to-video and image-to-video generation.
- High-quality video generation from text
- 720p and 1080p output support
- Enhanced visual fidelity
- Cinematic quality outputs
- Cinematic motion from still images
- 720p and 1080p output support
- Enhanced visual fidelity
- Professional-grade video generation
- Video remix capabilities
- Transform weather/lighting in videos
- Style transfers and seasonal changes
- Object transformations and mood shifts
Seedream 4 and Seedance Models Integration
ByteDance's latest Seedream 4.0 (T2I/I2I) and Seedance (T2V/I2V) models bring unified generation and editing workflows.
- Ultra-realistic image generation
- Unified architecture for generation and editing
- Fast 2K image generation in seconds
- 4K support for production workflows

- Expressive video motion from still images
- Animate headshots with natural motion
- Bring product images to life
- Add environmental effects
- High-quality video generation from text
- Expressive motion and cinematic quality
- 1080p output support
20+ New AI Models Added
Major expansion of our model library with 20+ new models across image generation, video creation, and audio processing.
- State-of-the-art image generation
- Exceptional prompt adherence

- Ultra-realistic image generation
- Fast 2K image generation

- Creative image generation
- Style-focused outputs

- High-quality image generation
- Professional outputs

- Fast image generation
- Creative workflows

- Sophisticated photo editing
- Multi-image editing support

- Style transfer
- Image transformation

- High-fidelity video generation
- Professional video creation
- Cinematic video generation
- Integrated sound generation
- Master-quality video generation
- Professional outputs
- Fast video generation
- Creative video outputs
- Cinematic motion from images
- High-quality animation
- Image animation
- Cinematic control
- Master-quality animation
- Professional motion
- Fast image animation
- Creative motion
LipSync Pro Released
Professional lip-sync correction tool for fixing audio-video synchronization issues in video content.
- Fix lip-sync issues in dubbed content
- Animate avatars to match voiceovers
- Match podcast or narration audio to video