Prologue AI Logo
O1 and GPT-Image & Flux2 are live ! Create or modify images and videos while maintaining consistency, and continuity.

Release Notes

Stay updated with the latest features and improvements to Prologue AI.

GPT-Image-1.5 Edit Available

GPT-Image-1.5 Edit is now available for high-fidelity image editing

GPT-Image-1.5 EditImage → Image - Edit
  • High-fidelity edits with strong prompt adherence
  • Choose quality (low / medium / high)
  • Choose image size (auto, 1024×1024, 1536×1024, 1024×1536)
  • Generate up to 4 images per request
GPT-Image-1.5 Edit before
GPT-Image-1.5 Edit after
BeforeAfter

Kling 2.6 Pro Available

Top-tier image-to-video generation with cinematic visuals, fluid motion, and native audio generation. Kling 2.6 Pro delivers professional-grade cinematic outputs with precise motion control. Supports English voice output.

Kling 2.6 ProImage → Video
  • Top-tier image-to-video with cinematic visuals and fluid motion
  • Native audio generation supporting Chinese and English voice output
  • Professional-grade cinematic outputs with precise motion control
  • 5s or 10s duration options with flexible audio on/off pricing

Seedream 4.5 Models Released

Major expansion of video-to-video utilities, image-to-video capabilities, and video-to-audio tools with professional-grade models.

Seedream 4.5Text → Image
  • Ultra-realistic image generation
  • Unified architecture for generation and editing
  • Fast 2K image generation in seconds
  • 4K support for production workflows
Seedream 4.5 preview
Seedream 4.5 EditingImage → Image - Edit
  • Multi-reference editing with up to 10 style/identity images
  • Confined edits that preserve scene structure
  • Character consistency and face cloning
  • Production-ready 2K and 4K outputs
Seedream 4.5 Editing before
Seedream 4.5 Editing after
BeforeAfter
WAN 25Image → Video
  • High-quality character animations with motion control
  • Product showcase videos with smooth transitions
  • Cinematic sequences from still images
  • Social media content with optional background music
Seedance I2V ProImage → Video
  • Animate headshots with natural motion
  • Bring product images to life
  • Add environmental effects (trees swaying, water)
MiniMax I2V-ProImage → Video
  • Social clips (portrait → TikTok-style video)
  • Add cinematic moves to static art
  • Short ad creatives from stills
ExtensionProVideo to Video Utilities
  • Extend short clips with matching motion and context
  • Create natural transitions at the tail of a shot
  • Finish cuts for social or ad content
ReframeVideo to Video Utilities
  • Convert landscape videos to portrait for social media (TikTok, Instagram Reels)
  • Transform portrait videos to landscape for YouTube or widescreen displays
  • Adapt existing content to different aspect ratios without manual cropping
  • Maintain visual quality while changing video format for different platforms
O1 Next ShotVideo to Video Utilities
  • Sequential shot generation maintaining motion style and camera language
  • Cinematic style preservation across new scenes
  • Multi-shot narratives with consistent visual language
  • Character continuity across generated shots with up to 4 tracked references
  • Production extensions with matching cinematic feel
Auto SubtitleVideo to Video Utilities
  • Social media content: TikTok, Instagram Reels, YouTube Shorts with animated subtitles
  • Accessibility: Add captions for hearing-impaired viewers
  • Multi-language content: Transcribe videos in 10+ languages automatically
  • Professional videos: Add branded subtitles with custom fonts and colors
  • Educational content: Create clear, readable subtitles for tutorials
  • Marketing videos: Enhance engagement with karaoke-style word highlighting
SyncWaveVideo to Audio Tools
  • Craft background music for video
  • Add creative audio layers that follow visuals
  • Test different moods/genres on the same clip
FoleySFXVideo to Audio Tools
  • Add realistic foley (footsteps, doors, clinks)
  • Create ASMR-like soundscapes
  • Replace or enhance audio tracks without recording

O1 Models Available

Create or modify images and videos while maintaining consistency and continuity with our new O1 models.

Kling O1 EditVideo to Video Utilities
  • Advanced video editing with natural language instructions
  • Character replacement while preserving motion
  • Scene environment transformations
  • Multi-reference editing with up to 4 elements/images
O1 Reference-to-VideoImage → Video
  • Transform images into consistent, high-quality video scenes
  • Stable character identity and object details
  • Multi-reference support for complex scenes

FLUX.2 Models Available

Next-generation Flux models with advanced prompt understanding and professional-grade image generation and editing capabilities.

Flux2 FlexText → Image
  • High-quality image generation with advanced prompt understanding
  • Support for @{field} syntax for referencing uploaded images
  • Professional-grade visuals up to 2K
Flux2 Flex preview
FluxFlex-2-EditingImage → Image - Edit
  • Enhanced image editing with multi-reference support
  • Up to 8 images via API (9MP total limit)
  • Color matching with hex codes or image references
  • Natural language editing instructions
FluxFlex-2-Editing before
FluxFlex-2-Editing after
BeforeAfter

Nano-Banana Pro Model Launch

Google's state-of-the-art Nano-Banana Pro model delivers exceptional prompt adherence and sophisticated photo editing capabilities.

Nano-Banana ProText → Image
  • Ultra-realistic image generation with exceptional prompt adherence
  • Sophisticated photo editing using conversational text prompts
  • Multi-image editing support with up to 10 reference images
  • 4K output support for production workflows
  • Advanced composition control
Nano-Banana Pro preview
Nano-Banana Pro EditImage → Image - Edit
  • Conversational text prompts for image editing
  • Multi-image editing support
  • Exceptional prompt adherence
  • 4K output support
  • Advanced composition control
Nano-Banana Pro Edit before
Nano-Banana Pro Edit after
BeforeAfter

SAM-3 Object Detection and Segmentation

Advanced 2-layer object detection system using Gemini 3 Pro for semantic understanding and SAM-3 for precise pixel-level segmentation.

MaskGen Videos (SA2VA)Video to Video Utilities
  • Semantic scene understanding with Gemini 3 Pro
  • Precise image segmentation with SAM-3
  • Video segmentation with tracking support
  • Multi-concept detection and masking
  • Enhanced content editing capabilities
  • Great for masks/segmentation/rotoscopy
  • Object-centric analysis and shot planning

Veo 3.1 with Cinematic Control and Sound Integration

Google's Veo 3.1 model now supports integrated sound generation and advanced cinematic controls for professional video production.

Veo 3.1Text→Videos+Animations
  • Text-to-video with native audio generation
  • Cinematic control parameters for precise motion
  • Enhanced video quality up to 1080p
Veo 3.1 Image-to-VideoImage → Video
  • Animate still images with cinematic motion
  • Reference-to-video mode for style consistency
  • First-last-frame-to-video for controlled sequences
  • Enhanced video quality up to 1080p
Veo 3.1 Reference-to-VideoImage → Video
  • Reference-to-video mode for style consistency
  • Multi-reference image support
  • Controlled video sequences
Veo 3.1 First-Last-Frame-to-VideoImage → Video
  • First-last-frame-to-video for controlled sequences
  • Precise motion control between frames
  • Cinematic transitions

Sora 2 Pro Models Available

OpenAI's Sora 2 Pro models now available for high-fidelity text-to-video and image-to-video generation.

Sora 2 T2V ProText→Videos+Animations
  • High-quality video generation from text
  • 720p and 1080p output support
  • Enhanced visual fidelity
  • Cinematic quality outputs
Sora 2 I2V ProImage → Video
  • Cinematic motion from still images
  • 720p and 1080p output support
  • Enhanced visual fidelity
  • Professional-grade video generation
Sora 2 RemixVideo to Video Utilities
  • Video remix capabilities
  • Transform weather/lighting in videos
  • Style transfers and seasonal changes
  • Object transformations and mood shifts

Seedream 4 and Seedance Models Integration

ByteDance's latest Seedream 4.0 (T2I/I2I) and Seedance (T2V/I2V) models bring unified generation and editing workflows.

Seedance T2V ProText→Videos+Animations
  • High-quality video generation from text
  • Expressive motion and cinematic quality
  • 1080p output support

20+ New AI Models Added

Major expansion of our model library with 20+ new models across image generation, video creation, and audio processing.

RecraftText → Image
  • Creative image generation
  • Style-focused outputs
Recraft preview
HiDreamText → Image
  • High-quality image generation
  • Professional outputs
HiDream preview
FluxDevText → Image
  • Fast image generation
  • Creative workflows
FluxDev preview
Recraft Image-to-ImageImage → Image - Edit
  • Style transfer
  • Image transformation
Recraft Image-to-Image before
Recraft Image-to-Image after
BeforeAfter
KlingGen MasterText→Videos+Animations
  • Master-quality video generation
  • Professional outputs
PixVerseText→Videos+Animations
  • Fast video generation
  • Creative video outputs
Kling Master I2VImage → Video
  • Master-quality animation
  • Professional motion
PixVerse I2VImage → Video
  • Fast image animation
  • Creative motion

LipSync Pro Released

Professional lip-sync correction tool for fixing audio-video synchronization issues in video content.

LipSync ProVideo to Audio Tools
  • Fix lip-sync issues in dubbed content
  • Animate avatars to match voiceovers
  • Match podcast or narration audio to video