#What it does
The imagegen skill generates and edits images using the OpenAI Image API. It classifies requests into taxonomy categories (product shots, UI mockups, concept art, background edits, etc.), augments prompts into structured specs, and runs the bundled CLI for deterministic, reproducible image generation.
#How to use
bash
$imagegenGenerate or edit images for this task and return the final prompt plus selected outputs.
#Decision tree
- User provides an input image or says "edit/retouch/inpaint/mask" -- edit
- User needs many different prompts/assets -- generate-batch
- Otherwise -- generate
#Workflow
- Decide intent -- Generate vs edit vs batch
- Collect inputs -- Prompt(s), exact text, constraints, input images/masks
- Classify -- Map to a taxonomy slug (see categories below)
- Augment prompt -- Reformat into a structured spec without inventing new requirements
- Run CLI -- Execute
scripts/image_gen.pywith sensible defaults - Inspect and validate -- Check subject, style, composition, text accuracy
- Iterate -- Single targeted change, re-run, re-check
- Save -- Return final outputs with the prompt and flags used
#Taxonomy categories
Generate:
photorealistic-natural-- Candid/editorial lifestyle scenesproduct-mockup-- Product/packaging shots, catalog imageryui-mockup-- App/web interface mockupsinfographic-diagram-- Diagrams with structured layout and textlogo-brand-- Logo/mark explorationillustration-story-- Comics, narrative scenesstylized-concept-- Style-driven concept art, 3D rendershistorical-scene-- Period-accurate scenes
Edit:
text-localization-- Translate/replace in-image textidentity-preserve-- Try-on, person-in-sceneprecise-object-edit-- Remove/replace a specific elementlighting-weather-- Time-of-day/atmosphere changesbackground-extraction-- Transparent background / clean cutoutstyle-transfer-- Apply reference stylecompositing-- Multi-image mergesketch-to-render-- Line art to photoreal
#Prompt template
Use case: <taxonomy slug>
Asset type: <where the asset will be used>
Primary request: <user's main prompt>
Style/medium: <photo/illustration/3D/etc>
Composition/framing: <wide/close/top-down>
Lighting/mood: <lighting + mood>
Text (verbatim): "<exact text>"
Constraints: <must keep/must avoid>
#Environment
OPENAI_API_KEYmust be set for live API calls- Python packages:
openai,pillow
This skill is from the OpenAI Skills Catalog.