GPT Image 1.5
GPT Image 1.5 builds on GPT Image 1 with improved visual fidelity, stronger prompt alignment, and more stable results across complex scenes and styles.
Input
Prompt
Upload up to 16 images (Optional)



upload
Aspect Ratio
Output Quality
Result
View History| Model & Modality | Credits / Gen | Our Price (USD) | Official Price (USD) | DISCOUNT |
|---|---|---|---|---|
gpt image 1.5, t2i, i2i, medium imageOpenAI | 4 per image | $0.0179 | $0.034 | - 47% |
gpt image 1.5, t2i, i2i, high imageOpenAI | 22 per image | $0.0982 | $0.133 | - 26% |
High-quality image generation and precise image editing with strong prompt control.

Prompt:
Create an infographic image of [LANDMARK], combining a real photograph of the landmark with blueprint-style technical annotations and diagrams overlaid on the image. Include the title “[LANDMARK]” in a hand-drawn box in the corner. Add white chalk-style sketches showing key.
GPT Image 1.5 delivers reliable image generation and fine-grained editing with strong prompt understanding, built for real-world applications.
Generate detailed and visually consistent images from text prompts, suitable for product visuals, creative concepts, and design drafts.
Edit specific areas of an image using text instructions while preserving original composition, lighting, and visual consistency.
Accurately follows complex prompts, enabling controlled outputs for styles, objects, layouts, and visual details.
Supports both generating images from text and transforming existing images, allowing flexible creative and editing workflows.
Maintains visual coherence across edits and iterations, making it suitable for multi-step image workflows.
Designed for easy API integration, offering stable performance and predictable results for developer-focused products.
Designed for scenarios that require precise control, detailed editing, and consistent visual results.
GPT Image 1.5 is well-suited for applications where users upload images and request specific changes using natural language. Typical examples include changing clothing, adjusting appearance details, modifying backgrounds, or adding and removing objects. The model follows detailed instructions while preserving facial features, lighting, and composition, making it ideal for consumer-facing photo editing tools and UGC platforms that rely on text-driven interactions.

For workflows that require multiple rounds of edits on the same image, GPT Image 1.5 maintains strong visual consistency across iterations. Developers can apply sequential instructions—such as refining style, adjusting elements, or correcting details—without restarting from scratch. This makes it suitable for professional image workflows, design tools, and applications where controlled, step-by-step refinement is essential.

GPT Image 1.5 excels in generating images that contain readable and accurate text elements, such as labels, UI components, signage, or product packaging. This capability is especially valuable for generating UI mockups, onboarding screens, marketing banners, and product visuals where text clarity and layout accuracy matter. Compared to general-purpose image models, it provides more reliable text placement and legibility.

GPT Image 1.5 improves on GPT Image 1 with higher resolution, faster generation, finer editing precision, and more reliable multi-step iterations for production-ready applications.
| Feature | GPT Image 1 | GPT Image 1.5 |
|---|---|---|
Max Resolution | 512×512 | 1024×1024 |
Generation Speed | ~6–8 sec per 512×512 | ~3–4 sec per 1024×1024 |
Prompt Compliance | Medium | High – accurately follows complex prompts |
Editing Granularity | Basic global edits | Fine-grained local edits with instruction support |
Multi-Step Iteration | Low consistency | High – preserves style, lighting, composition across multiple edits |
Text Rendering in Images | Often distorted | Clear and readable, even in complex scenes |
Supported Input Types | Text only | Text + Image for image-to-image edits |
API Cost Efficiency | Moderate | Lower cost per image with faster response |
Output Style Stability | Medium | High – consistent style across outputs |