• About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy
Tech Chilli
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us
No Result
View All Result
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us
No Result
View All Result
Tech Chilli
No Result
View All Result

Home » News » Sarvam AI OpenHathi: First Hindi Large Language Model

Sarvam AI OpenHathi: First Hindi Large Language Model

Sarvam AI launches OpenHathi-Hi-v0.1, the first Hindi large language model, rivalling GPT-3.5's prowess for Indic languages. Their strategic approach and collaborations signal a promising frontier in AI innovation.

tech chilli logo by Tech Chilli Desk
Thursday, 14 December 2023, 15:36 PM
in AI, News
Sarvam AI OpenHathi

Sarvam AI OpenHathi

Indian AI startup Sarvam AI has released OpenHathi-Hi-v0.1, the first Hindi large language model (LLM) in the OpenHathi series. Leveraging Meta AI’s Llama2-7B architecture, this model is positioned to deliver performance on par with the renowned GPT-3.5, specifically tailored for Indian languages.

Also Read: Google Gemini vs OpenAI ChatGPT 4

Sarvam AI has Constructed with a 48,000-token extension of Llama2-7B’s tokenizer, OpenHathi-Hi-v0.1 undergoes a meticulous two-phase training process. The initial phase focuses on embedding alignment, strategically aligning randomly initialised Hindi embeddings. The subsequent phase, bilingual language modelling, entails training the model to cross-lingually attend to tokens.

Sarvam AI proudly asserts that OpenHathi-Hi-v0.1 exhibits comparable, if not superior, performance to GPT-3.5 across various Hindi tasks while maintaining proficiency in English. This achievement signifies a significant milestone for the startup, demonstrating its prowess in advancing language models tailored for specific linguistic nuances.

Must Read: Mistral Drops OpenAI Language Model via Torrent Link

Beyond standard Natural Language Generation (NLG) tasks, Sarvam AI conducted a comprehensive evaluation of OpenHathi-Hi-v0.1’s capabilities in real-world scenarios. The company’s commitment to practical applications underscores the model’s versatility and potential impact across diverse applications.

In a notable collaboration, Sarvam AI joined forces with KissanAI to refine its base model using conversational data gathered from a GPT-powered bot engaging with farmers in different languages. This strategic partnership demonstrates the startup’s dedication to refining and enhancing OpenHathi-Hi-v0.1 through real-world interactions, contributing to its adaptability and effectiveness in dynamic linguistic environments.

Must Read: Microsoft Unveils Copilot: AI Innovations & Potential Revenue Surge

The startup, a mere five months old, has rapidly gained recognition and support in the AI landscape. Securing $41 million in a recent funding round led by Lightspeed Ventures, with contributions from Peak XV Partners and Khosla Ventures, Sarvam AI is positioned for continued growth and innovation.

To enhance OpenHathi-Hi-v0.1’s Hindi capabilities, Sarvam AI outlines steps such as reducing the fertility score of its tokenizer in Hindi text to improve efficiency. The company details the creation of a sentence-piece tokenizer from a subsample of 100K documents from the Sangraha corpus, in collaboration with AI4Bharat, resulting in a new tokenizer with a 48K vocabulary.

Sarvam AI’s commitment to linguistic diversity and practical applications, coupled with the strategic partnerships and cutting-edge technology underpinning OpenHathi-Hi-v0.1, positions the startup as a key player in advancing the landscape of large language models, particularly tailored for the nuances of Hindi and other Indian languages. As Sarvam AI continues to evolve, the unveiling of OpenHathi-Hi-v0.1 sets a promising trajectory for the future of AI-driven linguistic innovation.

Must Read: Election 2024: How Meta is Planning with $20 Billion Investment; Check Latest Social Media Guidelines

Previous Post

Google Launched Gemini Pro AI Model for Developers and Enterprises

Next Post

Why OpenAI giving $10 million in grants for Super Intelligence Development

tech chilli logo

Tech Chilli Desk

Tech Chilli News Desk is a conglomeration of Tech enthusiasts who are committed to delving deep into the evolving new-age technology of Web 3.0, Artificial Intelligence (AI), Robotics, Fintech, Crypto and more. This desk brings the latest information on Digital Transformation through use cases, implementations, coverage, case studies, reporting and deep analysis.

Next Post
OpenAI Superhuman AI Intelligence

Why OpenAI giving $10 million in grants for Super Intelligence Development

  • Trending
  • Comments
  • Latest
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

What are 10 Largest AI Data Centers in the World?

December 15, 2025
Best NFT discord servers

[Updated] Top 13 NFT Discord Servers (Groups) to Join In 2025 with Channel Name

April 22, 2025
AI Courses on edx

Best edX AI Courses and Certifications in 2024 (FREE and Paid)

August 27, 2024
Perplexity Campus Strategist Program 2024

Perplexity Campus Strategist Program 2024: How to Apply and Key Benefits

Gaurav Chaudhary Net Worth

Gaurav Chaudhary Net Worth – Technical Guruji, Indian YouTuber

Best AI Development Platforms and Tools in 2026

All About Canva Tools & Features

How to Use Canva AI Tools and Features to Enhance Your Posts and Designs?

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

Recent News

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026

Trending in AI

  • Perplexity CEO Net Worth
  • Grammarly AI Detection
  • What is LangChain
  • Canva AI Tool
  • Koupon AI
Tech Chilli

Tech Chilli is a beacon of knowledge, a relentless purveyor of the latest information, news, and groundbreaking research in the realm of cutting-edge technology.

We are dedicated to curating and delivering the most relevant, accurate, and up-to-the-minute information on the technologies that are shaping our world.
Contact us – su*****@********li.com

Follow Us

Browse by Category

  • AI
  • AI India
  • AI Tools
  • Courses
  • Crypto
  • Featured
  • FinTech
  • Gaming
  • How-To
  • News
  • Puzzles
  • Robotics

Top Searches

  • Scott Wu Net Worth
  • Mira Murati Net Worth
  • Online Games for Couples
  • Amazon Q vs Microsoft Copilot
  • DarkGPT

Recent News

Best AI Model for Every Task: Image, Video, PPT and More

June 17, 2026
Agentic-AI

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

June 14, 2026
Free Online Vocal Remover AI Tools

13 Best Free Online Vocal Remover AI Tools in 2026

January 4, 2026
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

January 4, 2026
  • About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy

© 2025 Tech Chilli

No Result
View All Result
  • AI
  • AI India
  • Robotics
  • Fintech
  • Crypto
  • Courses
  • How-To
  • Gaming
  • Contact Us

© 2025 Tech Chilli

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.