OpenAI’s New GPT Image Model API in 5 Minutes 📸
AI Summary
OpenAI GPT Image 1 Model Release
- Launch Date: Recently released through OpenAI’s API.
- Popularity: Image generation in ChatGPT gained over 130 million users and created more than 700 million images in the first week.
Access and Features
- Developer Access: Available for all developer tiers; ID validation required.
- Integration Examples: Companies like Adobe, Air Table, Figma, and Gamma are using this in their products.
- Moderation Parameters: Options for auto mode (standard filtering) and low mode (less restrictive filtering).
Pricing Structure
- Costs: 10 per million image input tokens, $40 per million output tokens.
- Image Costs: Approximately 2, 7, or 19 cents per generated image for low, medium, and high-quality square images, respectively.
API Access
- Playground URL: OpenAI Playground
- Usage: Users can try different examples, but API costs apply.
Requesting Images
- Setup: Use OpenAI SDK to specify the GPT image model and prompts.
- Inpainting Feature: Allows users to edit parts of an image using masks, requiring the mask to match the input image’s format and size.
Output Specifications
- Aspect Ratios: Available options include square, portrait, and landscape.
- Image Formats: Outputs can be JPEG or WEBP, with options for transparency and compression levels.
Limitations
- Processing Time: Complex prompts may take up to 2 minutes.
- Text Generation Issues: Challenges with text clarity and placement, along with maintaining visual consistency for recurring elements.
Summary
- The OpenAI GPT Image 1 model is now available, bringing powerful image generation capabilities to developers and various platforms.