News

Sarvam AI OpenHathi: First Hindi Large Language Model

Sarvam AI launches OpenHathi-Hi-v0.1, the first Hindi large language model, rivalling GPT-3.5's prowess for Indic languages. Their strategic approach and collaborations signal a promising frontier in AI innovation.

Indian AI startup Sarvam AI has released OpenHathi-Hi-v0.1, the first Hindi large language model (LLM) in the OpenHathi series. Leveraging Meta AI’s Llama2-7B architecture, this model is positioned to deliver performance on par with the renowned GPT-3.5, specifically tailored for Indian languages.

Also Read: Google Gemini vs OpenAI ChatGPT 4

Sarvam AI has Constructed with a 48,000-token extension of Llama2-7B’s tokenizer, OpenHathi-Hi-v0.1 undergoes a meticulous two-phase training process. The initial phase focuses on embedding alignment, strategically aligning randomly initialised Hindi embeddings. The subsequent phase, bilingual language modelling, entails training the model to cross-lingually attend to tokens.

Sarvam AI proudly asserts that OpenHathi-Hi-v0.1 exhibits comparable, if not superior, performance to GPT-3.5 across various Hindi tasks while maintaining proficiency in English. This achievement signifies a significant milestone for the startup, demonstrating its prowess in advancing language models tailored for specific linguistic nuances.

Must Read: Mistral Drops OpenAI Language Model via Torrent Link

Beyond standard Natural Language Generation (NLG) tasks, Sarvam AI conducted a comprehensive evaluation of OpenHathi-Hi-v0.1’s capabilities in real-world scenarios. The company’s commitment to practical applications underscores the model’s versatility and potential impact across diverse applications.

In a notable collaboration, Sarvam AI joined forces with KissanAI to refine its base model using conversational data gathered from a GPT-powered bot engaging with farmers in different languages. This strategic partnership demonstrates the startup’s dedication to refining and enhancing OpenHathi-Hi-v0.1 through real-world interactions, contributing to its adaptability and effectiveness in dynamic linguistic environments.

Must Read: Microsoft Unveils Copilot: AI Innovations & Potential Revenue Surge

The startup, a mere five months old, has rapidly gained recognition and support in the AI landscape. Securing $41 million in a recent funding round led by Lightspeed Ventures, with contributions from Peak XV Partners and Khosla Ventures, Sarvam AI is positioned for continued growth and innovation.

To enhance OpenHathi-Hi-v0.1’s Hindi capabilities, Sarvam AI outlines steps such as reducing the fertility score of its tokenizer in Hindi text to improve efficiency. The company details the creation of a sentence-piece tokenizer from a subsample of 100K documents from the Sangraha corpus, in collaboration with AI4Bharat, resulting in a new tokenizer with a 48K vocabulary.

Sarvam AI’s commitment to linguistic diversity and practical applications, coupled with the strategic partnerships and cutting-edge technology underpinning OpenHathi-Hi-v0.1, positions the startup as a key player in advancing the landscape of large language models, particularly tailored for the nuances of Hindi and other Indian languages. As Sarvam AI continues to evolve, the unveiling of OpenHathi-Hi-v0.1 sets a promising trajectory for the future of AI-driven linguistic innovation.

Must Read: Election 2024: How Meta is Planning with $20 Billion Investment; Check Latest Social Media Guidelines

This post was last modified on December 14, 2023 3:36 pm

Tech Chilli Desk

Tech Chilli News Desk is a conglomeration of Tech enthusiasts who are committed to delving deep into the evolving new-age technology of Web 3.0, Artificial Intelligence (AI), Robotics, Fintech, Crypto and more. This desk brings the latest information on Digital Transformation through use cases, implementations, coverage, case studies, reporting and deep analysis.

Recent Posts

Top 13 Vibe Coding AI Tools You Need to Know for Apps, Website Building

Bolt.new stands out as the best Vibe Coding AI tool for its ability to build…

June 1, 2025

Explained: What is Digital Arrest?

What is digital arrest, and why is it becoming critical in today’s cybercrime-ridden world? This…

May 31, 2025

AI in Cybersecurity [2025]: Benefits, Examples, and How it is Transforming its Future

AI in Cybersecurity segment: AI has the potential to revolutionize cybersecurity with its ability to…

May 31, 2025

Best AI Security Solutions in 2025

Explore the best AI security solutions of 2025 designed to protect against modern cyber threats.…

May 31, 2025

What Are Autonomous AI Agent Layers?

Autonomous agent layers are self-governing AI programs capable of sensing their environment, making decisions, and…

May 30, 2025

How Will Artificial Intelligence (AI) Transform the Crypto Industry?

Artificial Intelligence is transforming the cryptocurrency industry by enhancing security, improving predictive analytics, and enabling…

May 30, 2025