Generate Image is a full AI image studio with 20+ models in a single tool. Create from text prompts, edit existing images with natural language, compose new shots from multiple references, and upscale to 4K — all without switching between different apps or APIs.
Models range from fast drafts to photorealistic renders, with specialists for typography, editorial photography, product shots, and artistic styles. The default model handles most tasks well; call list_models when you need something specific.
Typography guidance: All models approximate named fonts (e.g. "Outfit Black") rather than rendering them exactly. For designs where legible or stylised text is the focus, use ideogram-v3 or gpt-image-2 — they produce the cleanest results. The default model is better suited to conceptual or illustrative output where precise text is not required. For pixel-exact font matching, generate the image without text and add typography in a design tool afterward.
What you can do
- text_to_image — generate images from text prompts with full control over size, style, aspect ratio, and model
- edit_image — modify an existing image using natural language: change backgrounds, add objects, remove elements, apply style transfers
- image_to_image — compose a new image from up to 4 reference photos, combining scenes, personas, products, and outfits
- check_image — poll for a pending result when a model is still processing
- upscale_image — increase resolution up to 10x from inside the same workflow
- list_models — browse all available models with pricing, capabilities, and supported parameters
Who it's for
Marketers producing campaign assets, product teams visualizing concepts, content creators building visual content, and developers adding image generation to AI workflows.
How to use it
- Use text_to_image with a descriptive prompt — the default model works well without setting a model parameter
- Use edit_image to modify an existing photo with a natural language instruction
- Use image_to_image to combine reference images from your library (scenes, personas, products, outfits) into one new composition
- Use list_models to pick a specialist model for typography, photorealism, or artistic styles
Getting started
Works immediately with no configuration — the default model handles text-to-image, editing, and composition out of the box.