GPT Image 1.5 Edit is now live!
🎨 Image Generation

OmniGen V1

Edit images, personalize content, try on clothes, or generate multiple people in one scene

Example Output

Prompt

"Neon words "Omni Gen" are flashing in the prosperous future city, 8K, hyper realistic"

Generated Result

Generated

Try OmniGen V1

Fill in the parameters below and click "Generate" to try this model

Text prompt for generation. Use <img><|image_1|></img> to reference input images

Input images for editing/generation. Reference as <img><|image_1|></img>

The size of the generated image. Width and height must be between 1024 and 4096

Inference steps (1-50). Higher = better quality

Text guidance scale (0-20). How closely to follow prompt

Image guidance scale (0-20). How closely to follow input images

Number of images to generate (1-4)

Output format

Your inputs will be saved and ready after sign in

More Image Generation Models

Wan v2.6 Image to Image

Wan v2.6 Image to Image

Wan 2.6 image-to-image model with multi-image composition. Combine elements from up to 3 reference images with natural language instructions. Supports Chinese and English

Google Imagen 4

Google Imagen 4

Generate images with superior clarity and text rendering

Leonardo Phoenix 1.0

Leonardo Phoenix 1.0

Create photorealistic images up to 5MP with exceptional prompt accuracy

Bagel Text to Image

Bagel Text to Image

Create 1024x1024 images from text with optional quality boost for better results.

Luma Photon

Luma Photon

Create high-quality images from text in 7 aspect ratios

OmniGen V2

OmniGen V2

Edit images, swap outfits, personalize content, or create multi-person scenes with up to 3 input images

Hidream I1 Full

Hidream I1 Full

Generate high-quality images in seconds with this 17B parameter model

Stable Diffusion v1.5

Stable Diffusion v1.5

Generate 1-8 images with LoRA support, custom sizes, and prompt expansion

Gemini 2.5 Flash Image(nano banana)

Gemini 2.5 Flash Image(nano banana)

Generate high-quality images quickly with Google's fast image model.

About OmniGen V1

OmniGen V1 is an advanced unified multimodal image generation model designed to revolutionize creative workflows with its robust capabilities in image editing, personalization, and virtual try-on. This AI-powered tool seamlessly blends text and image inputs, allowing users to generate, modify, and customize visuals with unparalleled flexibility and precision. At its core, OmniGen V1 leverages dual guidance scales—one for text and another for images—empowering users to finely control how closely the output adheres to their creative vision. Whether you’re crafting entirely new images from textual prompts or refining existing visuals through sophisticated image editing, OmniGen V1 gives you the tools to achieve exceptional results. The model supports a flexible prompt system, using a specialized syntax (<img><|image_1|></img>) to reference input images, enabling complex multimodal compositions and advanced editing scenarios. OmniGen V1 stands out with its broad support for multiple image sizes and aspect ratios, from high-definition square formats to various portrait and landscape orientations. Users can adjust the number of inference steps (up to 50) for optimal image quality, tweak guidance scales for both text and image adherence, and generate multiple variations (up to four images at once). Output formats include both JPEG and PNG, ensuring compatibility with diverse workflows. The model also features an integrated content safety checker, helping to maintain responsible and appropriate image generation. Ideal for professionals and enthusiasts alike, OmniGen V1 is perfectly suited for a wide range of applications. Graphic designers, marketers, e-commerce businesses, and content creators can leverage its capabilities for personalized product imagery, virtual try-on experiences, creative photo editing, and multi-person image generation. Its powerful personalization features enable users to overlay, blend, or transform input images based on detailed textual instructions, making it invaluable for rapid prototyping, campaign visuals, and social media content. Using OmniGen V1 is intuitive and highly customizable. Users simply provide a text prompt, optionally upload one or more input images, select their preferred image size and output format, and fine-tune advanced settings for quality and guidance. The model’s streamlined interface and flexible parameter controls make it accessible for both beginners and seasoned professionals seeking to push the boundaries of AI-powered image creation. With OmniGen V1, unlocking next-generation visual creativity is just a few steps away. Whether you’re editing photos, generating hyper-realistic visuals, or building immersive virtual experiences, this model delivers the versatility, quality, and control demanded by today’s creative industries.

✨ Key Features

Unified multimodal image generation that blends text and image inputs for highly customizable outputs.

Dual guidance scales for fine-tuning how closely results follow text prompts and input images.

Supports a wide range of image sizes and aspect ratios, from square HD to custom portrait and landscape formats.

Advanced image editing capabilities, including personalization, virtual try-on, and multi-person generation.

Generates up to four images per request, enabling rapid comparison and selection of creative options.

Offers both JPEG and PNG output formats for seamless integration into various workflows.

Built-in safety checker helps ensure generated content is appropriate and responsible.

💡 Use Cases

Creating personalized marketing visuals from product images and custom prompts.

Enabling virtual try-on experiences for fashion and retail e-commerce.

Editing and enhancing photographs with advanced AI-driven modifications.

Generating multi-person scenes for advertising or creative projects.

Rapidly prototyping concepts for campaigns, social media, or branding.

Producing hyper-realistic digital art and illustrations from descriptive text.

Blending multiple input images into unique, cohesive compositions.

🎯

Best For

Creative professionals, designers, marketers, e-commerce teams, and content creators seeking advanced AI-powered image generation and editing.

👍 Pros

  • Highly flexible multimodal input system for complex creative scenarios.
  • Fine-grained control over output quality and content adherence via dual guidance scales.
  • Supports a wide variety of image sizes and formats for diverse needs.
  • User-friendly interface with customizable parameters for both beginners and experts.
  • Fast generation times enable efficient experimentation and iteration.
  • Integrated safety checker for responsible content creation.

⚠️ Considerations

  • Requires understanding of prompt syntax for referencing input images.
  • Maximum of four images per generation may limit batch processing needs.
  • High-quality results depend on careful tuning of guidance and inference settings.
  • Content safety checker may occasionally filter desired creative outputs.

📚 How to Use OmniGen V1

1

Enter a detailed text prompt describing your desired image, using <img><|image_1|></img> syntax to reference any uploaded images.

2

Upload up to ten input images if you wish to personalize or edit existing visuals.

3

Select your preferred image size and aspect ratio from the available options or choose a custom dimension.

4

Adjust advanced parameters such as inference steps, text guidance scale, and image guidance scale to balance quality and prompt adherence.

5

Choose the number of images to generate (up to four) and select your desired output format (JPEG or PNG).

6

Submit your request and wait for the model to process and return your generated images.

Frequently Asked Questions

🏷️ Related Keywords

AI image generation multimodal model image editor virtual try-on AI personalized image creation text-to-image AI creative AI tools content generation AI photo editing multi-person generation