Google launches Gemini 3.1 Flash Lite AI model with faster speed and lower cost: Check details

HIGHLIGHTS

Google has introduced Gemini 3.1 Flash Lite, which is said to be the fastest and most cost-efficient Gemini 3 series model.

The company says that 3.1 Flash Lite is designed for high-volume developer workloads at scale.

The new model is rolling out in preview to developers.

Google has introduced Gemini 3.1 Flash Lite, which is said to be the fastest and most cost-efficient Gemini 3 series model. The company says that 3.1 Flash Lite is designed for high-volume developer workloads at scale and offers high quality for its price and model tier. The new model is rolling out in preview to developers through the Gemini API in Google AI Studio and for enterprises via Vertex AI.

One of the biggest highlights of Gemini 3.1 Flash Lite is its cost-efficiency. It costs $0.25 per one million input tokens and $1.50 per one million output tokens. ‘3.1 Flash Lite delivers enhanced performance at a fraction of the cost of larger models,’ the tech giant explains. ‘It outperforms 2.5 Flash with a 2.5X faster Time to First Answer Token and 45 per cent increase in output speed, according to the Artificial Analysis benchmark while maintaining similar or better quality. ‘

Also read: OpenAI introduces GPT 5.3 Instant for ChatGPT: Check new upgrades and availability details

Also, Gemini 3.1 Flash Lite achieved an Elo score of 1432 on the Arena.ai Leaderboard and outperformed other models of similar tier across reasoning and multimodal understanding benchmarks, as per Google.

Another useful feature is the thinking levels in AI Studio and Vertex AI. This allows developers to control how much reasoning power the model uses for each task.

Also read: After Apple iPhone 17e launch, iPhone 16e now available with over Rs 11,000 discount on this platform

‘Early-access developers on AI Studio and Vertex AI, and companies like Latitude, Cartwheel and Whering are already using 3.1 Flash Lite to solve complex problems at scale. Early testers highlighted 3.1 Flash Lite’s efficiency and reasoning capabilities, saying it can handle complex inputs with the precision of a larger-tier model, plus follow instructions and maintain adherence,’ Google said.

Also read: Apple iPhone 18 Pro Max, iPhone 18 Pro leaks: When will they launch and how much they may cost

You May Also Like

OpenAI introduces GPT 5.3 Instant for ChatGPT: Check new upgrades and availability details

Updated on 04-Mar-2026

OpenAI scientist says Pentagon deal not worth it amid growing backlash

Updated on 03-Mar-2026

Happy Holi WhatsApp status video: Download and share short videos on Instagram, Facebook and more

Updated on 03-Mar-2026

Holi 2026 wishes: Best Holi wishes, quotes, status, images to share on WhatsApp, Instagram and FB

Updated on 03-Mar-2026

Want to make Instagram-ready Holi images? These 5 prompts will make it easier for you

Updated on 03-Mar-2026

Ayushi Jain

Ayushi works as Chief Copy Editor at Digit, covering everything from breaking tech news to in-depth smartphone reviews. Prior to Digit, she was part of the editorial team at IANS.

Ayushi Jain

04-Mar-2026

Google launches Gemini 3.1 Flash Lite AI model with faster speed and lower cost: Check details

Google has introduced Gemini 3.1 Flash Lite, which is said to be the fastest and most cost-efficient Gemini 3 series model.

The company says that 3.1 Flash Lite is designed for high-volume developer workloads at scale.

The new model is rolling out in preview to developers.

Latest Article