GPT Image 2 is OpenAI's flagship image generation model, released in 2025 as the successor to DALL-E 3 and the first image model in the OpenAI lineup to operate natively at 1536px with multi-modal conditioning. It pulls ahead of peer models on prompt-faithfulness (rendering exactly the subject, action, and composition you describe rather than a generic interpretation), native typography (legible text in images, finally), and image-to-image transformation where it preserves subject identity instead of regenerating a similar-looking new face. Polyfaced runs GPT Image 2 via official OpenAI infrastructure through Kie, giving you both modalities, both resolutions, and the full feature set.
How to use
- 1
Pick text-to-image or image-to-image
Open the Image Generator panel and choose your mode. Text-to-image renders a fresh image from your prompt; image-to-image takes a reference image (paste a URL) plus an instruction prompt and transforms it while keeping the original subject identity. Image-to-image is the right tool when you want a known character or product in a new scene, lighting, or style — text-to-image is the right tool when starting from a blank canvas.
- 2
Write a clear, specific prompt
GPT Image 2 rewards specific instructions. Describe the subject, the composition (camera angle, framing), the lighting (warm sunset, harsh studio, soft window), and the style (photorealistic, illustrated, isometric, line art). Example: "A vintage leather messenger bag on a wooden desk beside a brass desk lamp. Top-down camera angle, warm tungsten lighting from camera right, shallow depth of field, editorial photography style, 35mm lens."
- 3
Set ratio and resolution
Choose the aspect ratio that matches your end use — square 1:1 for social posts and avatars, 3:2 or 16:9 for landscape (banners, headers), 2:3 or 9:16 for portrait and mobile. Resolution toggles between 1024px (faster, cheaper) and 1536px (production-grade detail, takes longer). 1536px is recommended whenever you need to print or embed at high pixel density.
- 4
Generate and preview
Press Create Image. Polyfaced freezes the credit cost upfront, dispatches the job to OpenAI, and returns the result inline — usually in under 15 seconds for 1024px, 20-30 seconds for 1536px. If the result misses your intent, iterate the prompt; each retry is independent (no charge for the previous failed-to-match attempt, only for the new run).
- 5
Download or share
The generated image lands in the right panel with a Download button. The asset is permanently stored on Cloudflare R2 with a permanent share URL — copy it, link it from a tweet, drop it into Figma or Photoshop, embed it in a Notion doc. Hit Generate Another to iterate the prompt or pull from the dashboard history page anytime.
What people are creating
Marketing and ad creatives
Generate hero images for landing pages, social ads, and email campaigns in minutes instead of days. GPT Image 2 handles product hero shots, lifestyle scenes, and abstract brand visuals at production quality — no shoot, no stock, no lengthy art direction loops.
Concept art and storyboards
Bring early-stage ideas into a visual form fast. Illustrators, game designers, and filmmakers use GPT Image 2 to sketch out scenes and characters before committing to a fully crafted pass — a single afternoon can yield 30-50 concept frames covering the visual exploration space.
Product mockups and previews
Generate packaging mockups, t-shirt prints, mug designs, and product-in-environment shots without setting up a 3D scene or hiring a photographer. Combine image-to-image (preserving the product) with style and scene prompts for fast variation runs.
Editorial and blog illustrations
Generate custom illustrations for articles, newsletters, and reports — moving away from generic stock photography. GPT Image 2 handles illustrated styles (flat vector, isometric, hand-drawn, retro print) cleanly, making it a strong replacement for editorial illustration commissions on tight timelines.
Model specifications
| Provider | OpenAI (via Kie infrastructure) |
|---|---|
| Modalities | Text-to-image, Image-to-image |
| Max resolution | 1536px (1536×1536 square, scaled for ratios) |
| Aspect ratios | 1:1, 3:2, 2:3, 16:9, 9:16 |
| Credit cost | 1024px = 2 credits · 1536px = 4 credits |
| Typical generation time | 8-15s @ 1024px, 20-30s @ 1536px |
| Prompt length cap | 4000 characters |
| Refund policy | Failed runs auto-refund credits to balance (FIFO ledger) |
FAQ
How does GPT Image 2 compare to Midjourney v7, Imagen 4, and Flux 1.1?
GPT Image 2 sits in the top tier across most prompt-faithfulness and typography benchmarks — meaningfully ahead of Midjourney on rendering exactly what you ask for, slightly ahead of Imagen 4 on multi-subject scenes, and ahead of Flux on legible in-image text. Each model has stylistic fingerprints: Midjourney leans painterly and aesthetic; Imagen leans photoreal; Flux leans naturalistic. GPT Image 2 leans editorial and prompt-faithful — ideal when you need exactly the composition you describe rather than the most beautiful interpretation.
Can I generate consistent characters across multiple images?
Yes — use image-to-image mode with the same starting reference image to keep subject identity stable across a series. For example, generate one hero portrait you like, then re-use it as the reference for all subsequent "same character in different scene" prompts. Pure text-to-image will not reliably preserve identity across runs (the same prompt twice yields different faces); image-to-image is the right tool for any "character consistency" workflow.
Why does GPT Image 2 handle text in images better than other models?
OpenAI specifically trained the model on a curated set of in-image typography, so it learned to render legible letters and short phrases rather than the squiggly fake-text other generative models produce. Use this for posters, packaging, ad creatives, sign-in-image scenes, etc. For very long copy (more than ~10 words), reliability drops — at that point, generate the visual without text and composite the copy in your design tool.
Are the outputs watermarked?
Free plan outputs carry a small Polyfaced badge in the bottom corner — fine for personal preview but not for commercial use. Pro plan members and Credit Pack buyers get fully clean, watermark-free outputs with a full commercial license — use them in client work, paid ads, products, or sale-distributed content. OpenAI also requires a metadata-level provenance signal (C2PA), which Polyfaced preserves in the original download.
Where do my images go after generation?
Every successful generation is stored on Cloudflare R2 with a permanent share URL — both the 1024px and the 1536px version when available. You can download the original quality file, copy the share URL, or revisit anytime from your dashboard history page. Images remain stored indefinitely while your account is active. If you cancel and don't reactivate within 12 months, we email you a final export link before purging.
Can I use GPT Image 2 outputs commercially?
Pro plan members and Credit Pack buyers receive a full commercial license — use the images in marketing, products, client work, paid social, or sale-distributed content. Free plan outputs are watermarked and personal-use only. OpenAI's underlying provider terms also apply (no extremist content, no impersonation of real public figures, no copyright-circumventing reproductions). Content policy violations on the provider side may trigger a failed generation and an automatic refund.
Ready to generate your first GPT Image 2 still? Sign in with Google in one click to claim your 5 free credits. The free grant covers two 1024px text-to-image runs plus an extra short test — enough to evaluate whether GPT Image 2 fits your specific creative workflow before you commit to a Pro plan or Credit Pack. Cancel anytime, no annual lock, no hidden fees.
