Google has unveiled the Gemini 3.1 Flash-Lite, the fastest and most affordable model in the Gemini 3 series, specifically designed for developers.
Scientists confirm: This is the most effective way to get your cat’s attention, according to new research
Elderly Couple Refuses Reserved Seats—Viral Train Standoff Sparks Fiery Debate on Courtesy
Just two weeks after the release of its most advanced model, the Gemini 3.1 Pro, Google is now taking a different approach by introducing the Gemini 3.1 Flash-Lite on Tuesday, March 3, 2026. This new model is not focused on peak performance but rather on processing high-volume tasks at minimal cost. It is available immediately for developer preview through the Gemini API in Google AI Studio and Vertex AI.
The “Workhorse” Model of the Gemini 3 Series
With Flash-Lite, Google aims at a specific market segment, namely the repetitive and massive tasks that companies need to handle daily. Large-scale translation, content moderation, data extraction, image sorting, and routing requests to more powerful models are some of the use cases where speed and cost per request are more critical than deep reasoning capabilities.
In this video, Flash-Lite analyzes and sorts images:
Why You Should Never Reheat These Foods in the Microwave – The Hidden Dangers Experts Warn About
I tried the top 5 guard dogs—here’s what makes these breeds the ultimate protectors
In its blog post, Google describes the model as “designed for high-volume, large-scale developer workloads”. The pricing strategy reflects this orientation: Flash-Lite is positioned below GPT-5 mini and Claude 4.5 Haiku in terms of cost, while offering a significantly faster generation speed compared to its direct competitors, according to benchmarks released by Google. The Mountain View company has not published any benchmarks focused on agents, which suggests that this model is not intended for orchestrating complex tasks or managing fleets of AI agents.
A Reasoning Slider to Adjust the Cost-Intelligence Ratio
Gemini 3.1 Flash-Lite includes configurable levels of reasoning accessible directly from AI Studio or Vertex AI. According to the API documentation, developers can adjust the model’s reasoning level based on the task, opting for a high setting for cases requiring step-by-step reasoning or a low setting for simple, high-throughput processes.
This feature is crucial for managing high-frequency workloads. 3.1 Flash-Lite can handle large-scale tasks like translating substantial volumes and moderating content, where cost is a critical factor. It can also manage more complex workloads that require deeper reasoning.
The economic benefit is clear: the less the model reasons, the fewer tokens it generates, and the lower the cost. For industrial uses where volumes are counted in millions of requests, this lever is far from trivial.
Note: Flash-Lite is unlikely to be available in the Gemini consumer app. The tool is intended for developers and businesses.
A Quick Overview of the Gemini 3 Series
- Gemini 3 Pro (November 2025): the flagship model, focused on advanced reasoning and multimodal understanding. Available in the Gemini app for subscribers.
- Gemini 3 Flash (December 2025): a faster version, three times quicker than the Gemini 2.5 Pro according to Google. The default model in the Gemini app.
- Gemini 3.1 Pro (February 2026): an update of the Gemini 3 Pro, with enhanced reasoning and improved code generation capabilities. Priced the same as its predecessor.
- Gemini 3.1 Flash-Lite (March 2026): the lightest and least expensive of the family. Designed for mass processing at low cost, reserved for developers through the API.
Similar Posts
- Google Chooses Gemini 3 Flash as Default Model: What You Need to Know!
- Google Unveils Gemini 3.1 Pro: New AI Model Dominates Benchmarks Again!
- Google’s Gemini 3 AI Takes on ChatGPT: Revolutionary Model Unveiled!
- Gemini Update: Google Supercharges API with 2.5 Flash, Pro, and New Multimodal Features
- Google Strikes Back at OpenAI: Launches Interactions API and Enhanced Deep Research

Jordan Park writes in-depth reviews and editorial opinion pieces for Touch Reviews. With a background in UI/UX design, Jordan offers a unique perspective on device usability and user experience across smartphones, tablets, and mobile software.