Google’s NEW OpenAI killer 💥 The CHEAPEST Reasoning AI Model for Developers💥
AI Summary
Overview of Gemini 2.5 Flash
- Developed by Google DeepMind, aimed to challenge OpenAI’s models.
- Introduces a customizable thinking mode for reasoning.
Key Features
- Thinking Mode: Enables/disables reasoning; adjustable budget for reasoning in tokens.
- Performance Benchmarks:
- Closer to OpenAI’s O4 mini in capability.
- Scores: 12.1 (Gemini 2.5 Flash) vs. 14.3 (O4 mini).
- 78% on AM benchmarks.
- 51.1% on polyglot tasks.
- Cost Efficiency:
- 15 cents per million input tokens, 60 cents for output without reasoning, $3.5 with reasoning.
- O4 mini costs significantly more (e.g., $4.4 for reasoning).
Performance Observations
- Latency: Gemini 2.5 Flash takes longer to process tasks compared to O4 mini (54 seconds vs. 6 seconds).
- Model Versatility: Supports long context inputs (up to 1 million tokens) and various types of queries.
Accessing and Using the Model
- Available via Google AI Studio:
- Open a new chat in AI Studio.
- Select Gemini 2.5 Flash from the model options.
- Enable/disable thinking mode and code execution.
- Use Cases:
- Can solve computational problems and generate code (e.g., Fibonacci series).
- Capable of creating visual aids like charts after processing input data.
- Demonstrated multilingual capabilities, translating text effectively.