• About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy
Tech Chilli
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us
No Result
View All Result
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us
No Result
View All Result
Tech Chilli
No Result
View All Result

Home » News » Google DeepMind Reveals a Visual Processing Architecture to Lower Processing Expenses

Google DeepMind Reveals a Visual Processing Architecture to Lower Processing Expenses

Google DeepMind researchers have developed the Mixture of Nested Experts (MoNE) framework, which significantly lowers processing costs for photos and videos without compromising accuracy. MoNE divides processing power across visual tokens dynamically, allowing larger models to handle more significant tokens and smaller stacked models to handle less significant ones.

Kumud Sahni Pruthi by Kumud Sahni Pruthi
Thursday, 1 August 2024, 23:08 PM
in News

The Mixture of Nested Experts (MoNE), a unique framework developed by a group of Google DeepMind researchers, dramatically lowers processing costs for photos and videos without compromising accuracy. This novel method divides up the processing power across various visual tokens according to priority in a dynamic manner.

Go here to read the entire paper.

When compared to baseline models, MoNE achieves an inference time compute reduction of over two orders of magnitude while retaining equal performance on common image and video datasets. MoNE achieved 64.4% accuracy on the Something-Something-v2 video dataset, utilizing just 162.8 GFLOPs as opposed to the baseline’s 376.3 GFLOPs.

Also Read: Gemma 2 by Google: Revolutionizing AI with 9B and 27B Parameter Models

The idea of nested models—in which smaller sub-models are enclosed within bigger ones—is the foundation of the framework. MoNE dynamically assigns visual tokens to these layered experts of different sizes via a router network. This enables larger, more computationally expensive models to handle more significant or informative tokens, while smaller stacked models handle-less significant ones.

Using the Kinetics-400 and Something-Something-v2 datasets for video classification, as well as the ImageNet-21k dataset for image classification, the team assessed MoNE. They discovered that MoNE continuously performed better than alternative methods such as a Mixture of Depths and baseline models, particularly at smaller computational budgets. 

MoNE’s ability to use a single trained model to adapt to various inference-time compute budgets is one of its main advantages. Because of its adaptability, the framework can accommodate different computing restrictions without the need for retraining.

Also Read: Google DeepMind’s PaliGemma: A Small But Mighty Open-Source Vision-Language Model

MoNE successfully recognized significant regions in pictures and videos, as demonstrated by visualizations, and routed tokens from these regions to more complex nested models. This illustrates how the framework might concentrate processing power on the most illuminating portions of visual inputs. 

The researchers point out that although MoNE was initially created for encoder architectures, it is still difficult to extend to autoregressive decoding in big language models. They also draw attention to the possible societal benefits of MoNE, such as the reduction of energy consumption and carbon emissions during model inference and the democratization of access to AI through the wider use of trained models without the need for significant computational resources.

Also Read: Google Cloud Partners with Mistral AI to Boost Vertex AI with Codestral Code Generation

Methods like MoNE can drastically lower computational costs as deep learning models get bigger and more complicated. Additionally, in contexts with limited resources, preserving performance is probably going to become more and more crucial for real-world AI systems. 

Google DeepMind is always creating cutting-edge AI frameworks and models. They released the MatFormer Framework last month, which improves on-device capabilities by enabling users to mix and match AI models within a single framework to optimize performance for particular tasks, and they also introduced Foundational Large Autorater Models (FLAMe) for a variety of quality assessment tasks.
Also Read: What is Google Deepmind’s SynthID? How Does it Work?

Previous Post

What are GitHub Models and How are they useful for AI Engineers?

Next Post

Arjun Pillai Net Worth: Co-founder and CEO of DocketAI – AI Sales Engineer Company

Kumud Sahni Pruthi

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Next Post
Arjun Pillai Net Worth

Arjun Pillai Net Worth: Co-founder and CEO of DocketAI - AI Sales Engineer Company

  • Trending
  • Comments
  • Latest
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

What are 10 Largest AI Data Centers in the World?

December 15, 2025
Best NFT discord servers

[Updated] Top 13 NFT Discord Servers (Groups) to Join In 2025 with Channel Name

April 22, 2025
AI Courses on edx

Best edX AI Courses and Certifications in 2024 (FREE and Paid)

August 27, 2024
Perplexity Campus Strategist Program 2024

Perplexity Campus Strategist Program 2024: How to Apply and Key Benefits

Gaurav Chaudhary Net Worth

Gaurav Chaudhary Net Worth – Technical Guruji, Indian YouTuber

Best AI Development Platforms and Tools in 2026

All About Canva Tools & Features

How to Use Canva AI Tools and Features to Enhance Your Posts and Designs?

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

Recent News

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

Trending in AI

  • Perplexity CEO Net Worth
  • Grammarly AI Detection
  • What is LangChain
  • Canva AI Tool
  • Koupon AI
Tech Chilli

Tech Chilli is a beacon of knowledge, a relentless purveyor of the latest information, news, and groundbreaking research in the realm of cutting-edge technology.

We are dedicated to curating and delivering the most relevant, accurate, and up-to-the-minute information on the technologies that are shaping our world.
Contact us – su*****@********li.com

Follow Us

Browse by Category

  • AI
  • AI India
  • AI Tools
  • Courses
  • Crypto
  • Featured
  • FinTech
  • Gaming
  • How-To
  • News
  • Puzzles
  • Robotics

Top Searches

  • Scott Wu Net Worth
  • Mira Murati Net Worth
  • Online Games for Couples
  • Amazon Q vs Microsoft Copilot
  • DarkGPT

Recent News

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026
  • About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy

© 2025 Tech Chilli

No Result
View All Result
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us

© 2025 Tech Chilli

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.