Google’s NEW OpenAI killer 💥 The CHEAPEST Reasoning AI Model for Developers💥



AI Summary

Overview of Gemini 2.5 Flash

  • Developed by Google DeepMind, aimed to challenge OpenAI’s models.
  • Introduces a customizable thinking mode for reasoning.

Key Features

  • Thinking Mode: Enables/disables reasoning; adjustable budget for reasoning in tokens.
  • Performance Benchmarks:
    • Closer to OpenAI’s O4 mini in capability.
    • Scores: 12.1 (Gemini 2.5 Flash) vs. 14.3 (O4 mini).
    • 78% on AM benchmarks.
    • 51.1% on polyglot tasks.
  • Cost Efficiency:
    • 15 cents per million input tokens, 60 cents for output without reasoning, $3.5 with reasoning.
    • O4 mini costs significantly more (e.g., $4.4 for reasoning).

Performance Observations

  • Latency: Gemini 2.5 Flash takes longer to process tasks compared to O4 mini (54 seconds vs. 6 seconds).
  • Model Versatility: Supports long context inputs (up to 1 million tokens) and various types of queries.

Accessing and Using the Model

  • Available via Google AI Studio:
    1. Open a new chat in AI Studio.
    2. Select Gemini 2.5 Flash from the model options.
    3. Enable/disable thinking mode and code execution.
  • Use Cases:
    • Can solve computational problems and generate code (e.g., Fibonacci series).
    • Capable of creating visual aids like charts after processing input data.
    • Demonstrated multilingual capabilities, translating text effectively.