Image Generation
Text-to-image generation integrated directly into your pipelines.
Overview
PipeImgGen generates images from text prompts using state-of-the-art models. Generated images can be stored locally or in cloud storage with public or signed URLs. Image generation supports both text-to-image and image-to-image workflows.
Supported Models
Via Pipelex Gateway:
- GPT-Image-1.5 — OpenAI's latest image generation model
- GPT-Image-1 — OpenAI image generation
- GPT-Image-1-mini — Smaller, faster variant for quick generations
- FLUX-2-pro — Black Forest Labs' high-quality generation model
- Nano Banana / Nano Banana Pro / Nano Banana 2 — Google Gemini-based image generation
Via direct provider SDKs:
- OpenAI — Direct OpenAI API for GPT Image models
- Google Gemini — Native Google image generation
- fal — FLUX and other models via the fal platform
- Hugging Face Inference — Open-source models like qwen-image
- BlackboxAI — Via completions-based image generation
- Azure REST — Azure-hosted image generation
- OpenRouter — Multi-provider image generation access
Cloud Storage Integration
Generated images can be automatically uploaded to AWS S3 or Google Cloud Storage with configurable URL signing. See Cloud Storage for details.
Usage in Pipelines
Use PipeImgGen in your .mthds files to generate images as part of a pipeline. The operator accepts a text prompt (or an ImgGenPrompt concept) and outputs an Image concept. Model presets let you configure quality levels and default models in 2_img_gen_deck.toml.
See PipeImgGen reference.