john-carroll-sw/generating-images-with-gpt-image-1

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.env.sample		.env.sample
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
aoai_responses_gpt_41.py		aoai_responses_gpt_41.py
aoai_responses_structured.py		aoai_responses_structured.py
aoai_responses_structured.ts		aoai_responses_structured.ts
get_item_image.py		get_item_image.py
get_item_image.ts		get_item_image.ts
package-lock.json		package-lock.json
package.json		package.json

Repository files navigation

OpenAI Image Generation & Extraction

Demo

This repository demonstrates how to use OpenAI's Response API (with GPT-4o, and gpt-image-1 via Azure OpenAI or OpenAI) to:

Generate images from a text prompt
Generate images from an existing image and a prompt (image-to-image)
Extract the main product image URL from an e-commerce product page

It provides both Python and TypeScript implementations, returning structured outputs for easy integration.

Features

Generate images from a prompt using GPT-4o, or gpt-image-1
Generate images from an image and a prompt (image-to-image)
Extracts the main product image URL from a given e-commerce product page
Uses OpenAI tool-calling to orchestrate the extraction and generation process
Returns a structured response with the image URL, source URL, and extraction/generation method
Supports both Python and TypeScript (Node.js) environments

How it Works

For generation: The script sends a prompt (and optionally an image) to the OpenAI/Azure OpenAI API to generate a new image.
The result is returned in a structured format, including the extraction or generation method used.

Flow Diagram

flowchart TD
 A[User provides product URL or prompt (+ optional image)] --> B(OpenAI Response API: tool calling)
 B --> C{Tool: extract_image_url or generate_image}
 C --> D[Fetch product page HTML or send prompt/image to API]
 D --> E{Extraction or Generation strategies}
 E -->|og:image| F[Extract from Open Graph meta tag]
 E -->|product__media| G[Extract from product__media class]
 E -->|first_img| H[Extract first <img> tag]
 E -->|prompt| L[Generate from prompt]
 E -->|image+prompt| M[Generate from image and prompt]
 F & G & H & L & M --> I[Return structured output]
 I --> J[OpenAI returns structured JSON]
 J --> K[User receives image_url, source_url, found_method]

Example Output

{
 "image_url": "http://minecraftshop.com/cdn/shop/files/MINE-PLU1_R_MF_1200x1200.jpg?v=1729532536",
 "source_url": "https://minecraftshop.com/collections/plush/products/minecraft-goat-8-plush",
 "found_method": "og:image"
}

Requirements

Python 3.8+ (for Python version)
Node.js 18+ (for TypeScript version)
Access to Azure OpenAI or OpenAI API (with appropriate keys and endpoint)

Setup

1. Clone the repository

git clone <repo-url>
cd get-item-image

2. Environment Variables

Copy .env.sample to .env and fill in your API keys and endpoints:

cp .env.sample .env

Edit .env and set:

AZURE_OPENAI_V1_API_ENDPOINT
AZURE_OPENAI_API_KEY
AZURE_OPENAI_API_MODEL

3. Install Dependencies

Python/.py

pip install -r requirements.txt
# or manually:
pip install openai python-dotenv beautifulsoup4 requests

TypeScript/Node.js

npm install

Usage

Python

python get_item_image.py "https://minecraftshop.com/collections/plush/products/minecraft-goat-8-plush"
# or for image generation:
python get_item_image.py --generate --prompt "A cat in a spacesuit"
python get_item_image.py --generate --prompt "A cat in a spacesuit" --image path/to/input.jpg

TypeScript

npx ts-node get_item_image.ts "https://minecraftshop.com/collections/plush/products/minecraft-goat-8-plush"
# or for image generation:
npx ts-node get_item_image.ts --generate --prompt "A cat in a spacesuit"
npx ts-node get_item_image.ts --generate --prompt "A cat in a spacesuit" --image path/to/input.jpg

File Overview

get_item_image.py — Python implementation
get_item_image.ts — TypeScript/Node.js implementation
.env.sample — Example environment variable file
demo.gif — Demo animation

License

MIT

About

Generate images from a prompt or from an image and a prompt using OpenAI's Response API (GPT-4.1, GPT-4o, and gpt-image-1) via Azure OpenAI or OpenAI. Includes both Python and TypeScript implementations for easy integration.

Releases

No releases published

Packages

No packages published

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

john-carroll-sw/generating-images-with-gpt-image-1

Folders and files

Latest commit

History

Repository files navigation

OpenAI Image Generation & Extraction

Features

How it Works

Flow Diagram

Example Output

Requirements

Setup

1. Clone the repository

2. Environment Variables

3. Install Dependencies

Python/.py

TypeScript/Node.js

Usage

Python

TypeScript

File Overview

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

License

john-carroll-sw/generating-images-with-gpt-image-1

Folders and files

Latest commit

History

Repository files navigation

OpenAI Image Generation & Extraction

Features

How it Works

Flow Diagram

Example Output

Requirements

Setup

1. Clone the repository

2. Environment Variables

3. Install Dependencies

Python/.py

TypeScript/Node.js

Usage

Python

TypeScript

File Overview

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages