GPT Image 1.5 Edit is now live!
🎨 Image Generation

OmniGen V2

Edit images, swap outfits, personalize content, or create multi-person scenes with up to 3 input images

Example Output

Prompt

"Make the dress blue"

Generated Result

Generated

Try OmniGen V2

Fill in the parameters below and click "Generate" to try this model

Edit/generation prompt. Be specific (e.g., 'Add bird from image 1 to desk in image 2')

Input images for editing (up to 3 images)

The size of the generated image. Width and height must be between 1024 and 4096

Inference steps (20-50). Higher = better quality

Text guidance (1-8). How closely to follow text prompt

Image guidance (1-3). Editing: 1.3-2.0, Generation: 2.0-3.0

Negative prompt to avoid unwanted elements

CFG range start value (0-1)

CFG range end value (0-1)

Diffusion scheduler

Number of images to generate (1-4)

Output format

Your inputs will be saved and ready after sign in

More Image Generation Models

LongCat Image

LongCat Image

Generate photorealistic images with multilingual text rendering support.

Bria Image 3.2

Bria Image 3.2

Generate images from text descriptions

Stable Diffusion v1.5

Stable Diffusion v1.5

Generate 1-8 images with LoRA support, custom sizes, and prompt expansion

FLUX.1 SRPO Text-to-Image

FLUX.1 SRPO Text-to-Image

Generate beautiful, high-quality images from text for personal or commercial use.

Hunyuan Image v2.1 Text-to-Image

Hunyuan Image v2.1 Text-to-Image

Generate expressive, high-quality images from text descriptions.

Midjourney Text to Image

Midjourney Text to Image

Turn text into artistic images. Get 4 unique variations per prompt with exceptional quality.

FLUX Dev

FLUX Dev

Turn text descriptions into detailed images with this 12B parameter model

Runway Gen-4 Image

Runway Gen-4 Image

Create exact images you need using up to 3 reference images for guidance

Bagel Text to Image

Bagel Text to Image

Create 1024x1024 images from text with optional quality boost for better results.

About OmniGen V2

OmniGen V2 is a cutting-edge, unified multimodal AI model designed to revolutionize image editing and generation workflows. Leveraging state-of-the-art diffusion technology, OmniGen V2 enables users to perform advanced image manipulations, virtual try-ons, and multi-person compositions with unprecedented flexibility and control. Capable of working with up to three input images, this model excels at both simple edits and complex composite generation, making it an ideal tool for creatives and professionals seeking powerful, intuitive image editing solutions. At its core, OmniGen V2 combines separate text and image guidance, allowing users to specify detailed instructions for image edits or creations. For example, you can prompt the model to "Add bird from image 1 to desk in image 2" or "Change the color of the dress to blue." The dual-guidance system ensures that results are closely aligned with both the visual input and the textual prompt, delivering highly personalized and accurate outcomes. The model supports various image aspect ratios and resolutions, from square to portrait and landscape formats, with customizable image sizes ranging from 1024 to 4096 pixels. OmniGen V2 offers advanced control over the generation process through adjustable inference steps (20-50), guidance scales for both text and images, and configurable classifier-free guidance (CFG) range values. These features empower users to fine-tune quality, fidelity, and style according to their specific needs. The inclusion of two powerful diffusion schedulers—Euler and DPMSolver—further enhances creative control, enabling users to experiment with different generation dynamics and visual effects. Safety and quality are integral to OmniGen V2. The model incorporates an optional content safety checker and advanced negative prompt settings to minimize unwanted elements, such as deformities or artifacts. Users can also select their preferred output format (JPEG or PNG) and generate up to four images per request, supporting efficient batch creation and comparison. With random seed control for reproducibility and a robust synchronization mode, OmniGen V2 ensures consistent, reliable performance for a wide range of applications. Ideal use cases for OmniGen V2 include professional photo editing, content creation, e-commerce virtual try-on solutions, creative artwork generation, marketing collateral design, and collaborative multi-person image synthesis. Whether you are a designer, marketer, content creator, or developer, OmniGen V2 streamlines complex image tasks and opens new avenues for visual storytelling and personalization. Its pay-as-you-go credit system ensures scalable, flexible access to advanced AI-powered image editing capabilities without upfront commitments. Experience the next generation of multimodal image editing and unlock new creative possibilities with OmniGen V2.

✨ Key Features

Unified multimodal model supports image editing, generation, personalization, and virtual try-on capabilities.

Accepts up to 3 input images for advanced compositing and multi-person generation.

Separate text and image guidance allows precise control over edits and creative outputs.

Flexible image size options from 1024 to 4096 pixels, with multiple aspect ratios including square, portrait, and landscape.

Adjustable inference steps, guidance scales, and CFG range for fine-tuned quality and style.

Choice of Euler and DPMSolver diffusion schedulers for diverse generation dynamics.

Built-in safety checker and negative prompt settings to help avoid unwanted artifacts or inappropriate content.

💡 Use Cases

Personalizing fashion or product images with virtual try-on and color changes.

Editing and enhancing portraits by combining elements from multiple images.

Creating marketing visuals and social media graphics with custom edits and enhancements.

Generating creative artwork or illustrations from detailed prompts and image references.

Building e-commerce catalogs with automated product swaps and background modifications.

Composing multi-person group photos or scenes for creative projects.

Developing content for advertising campaigns with unique visual concepts.

🎯

Best For

Professional designers, marketers, e-commerce teams, content creators, and developers seeking advanced AI-powered image editing and generation.

👍 Pros

  • Highly flexible with support for multiple input images and detailed prompts.
  • Advanced control over output quality and style through adjustable parameters.
  • Wide range of supported image sizes and aspect ratios for various applications.
  • Dual guidance system ensures outputs match both text and visual intent.
  • Efficient batch generation with up to four images per request.
  • Optional safety features reduce the risk of inappropriate or unwanted content.

⚠️ Considerations

  • Requires careful prompt and parameter tuning for optimal results.
  • Supports a maximum of three input images per generation.
  • Generation times may vary based on complexity and image size.
  • Some advanced features may require a learning curve for new users.

📚 How to Use OmniGen V2

1

Prepare your input images (up to three) and upload them to the platform.

2

Enter a detailed prompt describing the desired edit or image generation.

3

Select your preferred image size and aspect ratio from the available options.

4

Adjust advanced settings such as inference steps, guidance scales, and scheduler as needed.

5

Set a negative prompt if you want to avoid specific unwanted elements in the output.

6

Click generate and review the resulting images; download or further edit as desired.

Frequently Asked Questions

🏷️ Related Keywords

AI image editor multimodal AI virtual try-on image generation photo editing creative AI tools composite images diffusion model content creation e-commerce imagery