• About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy
Tech Chilli
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
Tech Chilli
No Result
View All Result

Home » News » Google DeepMind Reveals a Visual Processing Architecture to Lower Processing Expenses

Google DeepMind Reveals a Visual Processing Architecture to Lower Processing Expenses

Google DeepMind researchers have developed the Mixture of Nested Experts (MoNE) framework, which significantly lowers processing costs for photos and videos without compromising accuracy. MoNE divides processing power across visual tokens dynamically, allowing larger models to handle more significant tokens and smaller stacked models to handle less significant ones.

Kumud Sahni Pruthi by Kumud Sahni Pruthi
Thursday, 1 August 2024, 23:08 PM
in News

The Mixture of Nested Experts (MoNE), a unique framework developed by a group of Google DeepMind researchers, dramatically lowers processing costs for photos and videos without compromising accuracy. This novel method divides up the processing power across various visual tokens according to priority in a dynamic manner.

Go here to read the entire paper.

When compared to baseline models, MoNE achieves an inference time compute reduction of over two orders of magnitude while retaining equal performance on common image and video datasets. MoNE achieved 64.4% accuracy on the Something-Something-v2 video dataset, utilizing just 162.8 GFLOPs as opposed to the baseline’s 376.3 GFLOPs.

Also Read: Gemma 2 by Google: Revolutionizing AI with 9B and 27B Parameter Models

The idea of nested models—in which smaller sub-models are enclosed within bigger ones—is the foundation of the framework. MoNE dynamically assigns visual tokens to these layered experts of different sizes via a router network. This enables larger, more computationally expensive models to handle more significant or informative tokens, while smaller stacked models handle-less significant ones.

Using the Kinetics-400 and Something-Something-v2 datasets for video classification, as well as the ImageNet-21k dataset for image classification, the team assessed MoNE. They discovered that MoNE continuously performed better than alternative methods such as a Mixture of Depths and baseline models, particularly at smaller computational budgets. 

MoNE’s ability to use a single trained model to adapt to various inference-time compute budgets is one of its main advantages. Because of its adaptability, the framework can accommodate different computing restrictions without the need for retraining.

Also Read: Google DeepMind’s PaliGemma: A Small But Mighty Open-Source Vision-Language Model

MoNE successfully recognized significant regions in pictures and videos, as demonstrated by visualizations, and routed tokens from these regions to more complex nested models. This illustrates how the framework might concentrate processing power on the most illuminating portions of visual inputs. 

The researchers point out that although MoNE was initially created for encoder architectures, it is still difficult to extend to autoregressive decoding in big language models. They also draw attention to the possible societal benefits of MoNE, such as the reduction of energy consumption and carbon emissions during model inference and the democratization of access to AI through the wider use of trained models without the need for significant computational resources.

Also Read: Google Cloud Partners with Mistral AI to Boost Vertex AI with Codestral Code Generation

Methods like MoNE can drastically lower computational costs as deep learning models get bigger and more complicated. Additionally, in contexts with limited resources, preserving performance is probably going to become more and more crucial for real-world AI systems. 

Google DeepMind is always creating cutting-edge AI frameworks and models. They released the MatFormer Framework last month, which improves on-device capabilities by enabling users to mix and match AI models within a single framework to optimize performance for particular tasks, and they also introduced Foundational Large Autorater Models (FLAMe) for a variety of quality assessment tasks.
Also Read: What is Google Deepmind’s SynthID? How Does it Work?

Previous Post

What are GitHub Models and How are they useful for AI Engineers?

Next Post

Arjun Pillai Net Worth: Co-founder and CEO of DocketAI – AI Sales Engineer Company

Kumud Sahni Pruthi

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Next Post
Arjun Pillai Net Worth

Arjun Pillai Net Worth: Co-founder and CEO of DocketAI - AI Sales Engineer Company

  • Trending
  • Comments
  • Latest
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2025: Maximize APY with Secure and Trusted Crypto Tools

April 17, 2025
scott wu net worth

Scott Wu Net Worth: Devin AI Software Engineer, CEO of Cognition Labs

April 17, 2025
Artificial Intelligence (AI) Glossary and Terminologies

Artificial Intelligence (AI) Glossary and Terminologies – Complete Cheat Sheet List

April 18, 2025
TurbolearnAI

Turbolearn AI: How to Use It for FREE, Features and Pricing Models

April 3, 2025
What is Blockchain Technology

What is Blockchain Technology And How Does It Work?

Enterprise AI

What is Enterprise AI? Meaning, Companies, Examples and More Details

Cosine Genie AI Software Engineer

What is Cosine Genie and How to Use? Check Benchmark, Functions, and Access Details

PhonePe Leads UPI Market in August 2024, Claims 50% Share by Value and 48% by Volume

PhonePe Partners with Liquid Group to Bring UPI Payments to Singapore for Indian Travelers

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025

Recent News

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025

Trending in AI

  • Perplexity CEO Net Worth
  • Grammarly AI Detection
  • What is LangChain
  • Canva AI Tool
  • Koupon AI
Tech Chilli

Tech Chilli is a beacon of knowledge, a relentless purveyor of the latest information, news, and groundbreaking research in the realm of cutting-edge technology.

We are dedicated to curating and delivering the most relevant, accurate, and up-to-the-minute information on the technologies that are shaping our world.
Contact us – [email protected]

Follow Us

Browse by Category

  • AI
  • AI India
  • Courses
  • Crypto
  • Featured
  • FinTech
  • Gaming
  • How-To
  • News
  • Puzzles
  • Robotics

Top Searches

  • Scott Wu Net Worth
  • Mira Murati Net Worth
  • Online Games for Couples
  • Amazon Q vs Microsoft Copilot
  • DarkGPT

Recent News

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025
  • About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy

© 2024 Tech Chilli

No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us

© 2024 Tech Chilli

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OK