• About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy
Tech Chilli
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us
No Result
View All Result
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us
No Result
View All Result
Tech Chilli
No Result
View All Result

Home » News » Google Unveils Gemini 1.5 Flash-8B: Faster, Cheaper AI for Developers at Half the Cost

Google Unveils Gemini 1.5 Flash-8B: Faster, Cheaper AI for Developers at Half the Cost

Google has launched Gemini 1.5 Flash-8B, a faster and more affordable version of its popular AI model. Designed for developers, it offers a 50% cost reduction while delivering faster processing speeds and enhanced performance. Ideal for low-power devices like sensors and smartphones, Gemini 1.5 Flash-8B aims to make AI more accessible and efficient.

Kumud Sahni Pruthi by Kumud Sahni Pruthi
Sunday, 6 October 2024, 0:04 AM
in News
Gemini 1.5 Flash 8B

Gemini 1.5 Flash 8B

Google LLC is releasing a quicker and more compact version of its well-liked Gemini 1.5 Flash artificial intelligence model.

At half the price, it is substantially cheaper and goes by the name Gemini 1.5 Flash-8B. Google’s Gemini 1.5 Flash big language model is a lightweight variant geared for speed and efficiency that may be used on low-power devices like sensors and smartphones.

A few weeks after the company’s May announcement at Google I/O 2024, Gemini 1.5 Flash was made available to a select group of paying clients. A few weeks later, the Gemini mobile app made the device available for free, but with certain usage limitations.

At the end of June, it became generally available and offered high-speed processing, a competitive price, and a context window with a million tokens. When it was first released, Google claimed that its input size was 60% larger and 40% faster on average than OpenAI’s GPT-3.5 Turbo.

Also Read: Google Develops AI with Human-Like Reasoning, Rivals OpenAI’s O1 Model

The initial version, which powers the Eats AI assistant in Uber Technologies Inc.’s UberEats food delivery service, was created to offer a very low token input fee, making it price-competitive for developers. Customers like Uber Technologies Inc. embraced the original version.

Google is releasing the Gemini 1.5 Flash-8B, which is 50% less expensive and has double the rate limits of the 1.5 Flash, making it one of the lightest LLMs on the market. According to the business, it also provides reduced latency on little prompts.

Gemini 1.5 Flash-8B is available to developers via Google AI Studio and the Gemini API at no cost.

Gemini API Senior Product Manager Logan Kilpatrick stated in a blog post that the business has improved 1.5 Flash “considerably,” listened to developer feedback, and is “testing the limits” of what can be achieved with such lightweight LLMs.

He clarified that last month, the business had announced the release of an experimental Gemini 1.5 Flash-8B version. Since then, it has undergone more refinement, and it is currently widely accessible for usage in production.

Also Read: Google Strengthens India’s Healthcare with Free AI-Powered Screenings for Cancer and TB

Kilpatrick claims that the 8-B version can nearly match the 1.5 Flash model’s performance on some important benchmarks and that it performs particularly well on jobs like chat, transcription, and extended context language translation.

Kilpatrick continued, “Developer feedback and our testing of what is possible with these models continue to inform our release of best-in-class small models.” “We believe this model has the greatest potential for tasks ranging from lengthy context summarization tasks to high volume multimodal use cases.”

Kilpatrick continued, “Of all the Gemini models released to date, the Gemini 1.5 Flash-8B offers the lowest cost per intelligence.”

Also Read: How to Use Google Gemini Live on Android App Users?

The pricing is in line with comparable models from Anthropic PBC and OpenAI. Regarding OpenAI, the most affordable model remains GPT-4o small, with an input cost of $0.15 per million; however, this decreases by 50% when using batched queries and reusable prompt prefixes. The Claude 3 Haiku model from Anthropic is the most economical at $0.25/M, while cached tokens are just $0.03/M.

Kilpatrick further stated that the corporation is attempting to increase the use of 1.5 Flash-8B for straightforward, high-volume activities by increasing its rate limits. As a result, 4,000 queries can now be sent by developers every minute, according to him.

Also Read: Google AI Skills House to Empower 10 Million Indians with Free AI Courses, Launched at Google India 2024

Previous Post

Meta Launches Movie Gen AI: Create Realistic Videos & Sound, Competing with Top AI Platforms

Next Post

Jhana AI Raises $1.6M, Bringing AI-Powered Efficiency to Legal Research and Document Review

Kumud Sahni Pruthi

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Next Post
JhanaAI

Jhana AI Raises $1.6M, Bringing AI-Powered Efficiency to Legal Research and Document Review

  • Trending
  • Comments
  • Latest
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

What are 10 Largest AI Data Centers in the World?

December 15, 2025
Best NFT discord servers

[Updated] Top 13 NFT Discord Servers (Groups) to Join In 2025 with Channel Name

April 22, 2025
AI Courses on edx

Best edX AI Courses and Certifications in 2024 (FREE and Paid)

August 27, 2024
Perplexity Campus Strategist Program 2024

Perplexity Campus Strategist Program 2024: How to Apply and Key Benefits

Gaurav Chaudhary Net Worth

Gaurav Chaudhary Net Worth – Technical Guruji, Indian YouTuber

Best AI Development Platforms and Tools in 2026

All About Canva Tools & Features

How to Use Canva AI Tools and Features to Enhance Your Posts and Designs?

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

Recent News

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

Trending in AI

  • Perplexity CEO Net Worth
  • Grammarly AI Detection
  • What is LangChain
  • Canva AI Tool
  • Koupon AI
Tech Chilli

Tech Chilli is a beacon of knowledge, a relentless purveyor of the latest information, news, and groundbreaking research in the realm of cutting-edge technology.

We are dedicated to curating and delivering the most relevant, accurate, and up-to-the-minute information on the technologies that are shaping our world.
Contact us – su*****@********li.com

Follow Us

Browse by Category

  • AI
  • AI India
  • AI Tools
  • Courses
  • Crypto
  • Featured
  • FinTech
  • Gaming
  • How-To
  • News
  • Puzzles
  • Robotics

Top Searches

  • Scott Wu Net Worth
  • Mira Murati Net Worth
  • Online Games for Couples
  • Amazon Q vs Microsoft Copilot
  • DarkGPT

Recent News

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026
  • About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy

© 2025 Tech Chilli

No Result
View All Result
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us

© 2025 Tech Chilli

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.