AI India

Cognitive Lab Presents Devanagari Text Tokenizer Arena

Hugging Face is a transformer js-based platform where Cognitive Lab has launched the Tokenizer Arena, which allows users to compare various tokenizers at once. Several models, including Gemma, Mistral, Grok-1, GPT-3, GPT-4, Claude, Phi-3, and Command R, are present in the arena.

Tokenization has been one of the most talked-about topics when developing Indian language models because it varies greatly depending on the model. Tokenizer Arena has been introduced on Hugging Face by Cognitive Lab in light of this.

The arena, which is based on theTransformerJSs package, allows users to compare many tokenizers at once.

The creator and CEO of Cognitive Labs, Adithya SK, announced the tokenizer arena with a post on X, formerly Twitter.

Also Read: Meet Adithya Kolavi, a 20-year-old, who developed the Indic LLM Leaderboard

Tokenization has been one of the most talked-about topics when developing Indic language models because it varies greatly depending on the model. Tokenizer Arena has been introduced on Hugging Face by Cognitive Lab in light of this.

The arena, which is based on the TransformerJS package, allows users to compare many tokenizers at once.

Many models, including Gemma, Mistral, Grok-1, GPT-3, GPT-4, Claude, Phi-3, and Command R, are present in the arena.

This is perfect for developers attempting to overlay open source models for tokenizing on Devanagari text—a language that differs greatly from English—with Indic LLM models.

To view Tokenizer Arena, click this link.

The Indic LLM Leaderboard was recently developed by Cognitive Lab’s creator, Adithya S. Kolavi, to track the many Indic LLMs that are becoming more popular in the nation. Ambari, the first multilingual Kannada model developed on top of Llama 2, was also released by the team.

Also Read: OpenAI offers a glimpse into its AI’s secret instructions

With support for seven Indic languages—Hindi, Kannada, Tamil, Telugu, Malayalam, Marathi, and Gujarati—the Indic LLM Leaderboard offers a thorough evaluation tool. It is now hosted on Hugging Face and supports four indicator benchmarks, with intentions to add more in the future.

Adithya Kolavi has become a phenomenon in the field of artificial intelligence. His most recent project, the Indic LLM Leaderboard, is causing a stir in the AI field, and he is the founder and CEO of CognitiveLab.

This post was last modified on May 10, 2024 8:01 am

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Recent Posts

Best AI Model for Every Task: Image, Video, PPT and More

Pick your task, get the best AI model for it — images, video, slides, research,…

June 17, 2026

What is Agentic AI? Check How it Works with Real-Life Agentic AI Automation Examples

Learn what Agentic AI is, how it works, and how it differs from Generative AI.…

June 14, 2026

13 Best Free Online Vocal Remover AI Tools in 2026

Discover the 13 best free online vocal remover AI tools for 2026, designed to isolate…

January 4, 2026

Top 13 Yield Farming Platforms in 2026: Maximize APY with Secure and Trusted Crypto Tools

Explore the top 13 yield farming platforms for 2026, featuring secure, trusted, and high-APY crypto…

January 4, 2026

Top AI Learning Platforms for 2026: Master AI Skills with Coursera, edX, and Udacity

Explore the best AI learning platforms for 2026, including Coursera, edX, Udacity, and more. Learn…

January 4, 2026

13 Best Polygon Wallets in 2026 You Need to Checkout

Explore the 13 best Polygon wallets in 2026, comparing security, DeFi access, hardware and mobile…

January 1, 2026