• About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy
Tech Chilli
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
Tech Chilli
No Result
View All Result

Home » AI » What is Llama 3.1? New MetaAI LLM Model Performance, Benchmarks, Price and Other Details

What is Llama 3.1? New MetaAI LLM Model Performance, Benchmarks, Price and Other Details

Llama 3.1 is the latest version of MetaAI's LLM model, offering improved performance and benchmarks compared to its predecessors. This article will delve into the details of its performance, pricing and accessibility.

raya-author-image by Raya
Wednesday, 24 July 2024, 2:32 AM
in AI
what is llama 3.1

What is Llama 3.1?

A few months back, Facebook’s parent company Meta hinted that it was working on an open-source large language model that would outperform private models. On July 23, 2024, they released the model. 

Meta’s recently released Llama 3.1 405B is the largest-ever open-source artificial intelligence (AI) model. Announcing the release in a blog post, Meta said, “We’re publicly releasing Meta Llama 3.1 405B, which we believe is the world’s largest and most capable openly available foundation model.”

Llama 3.1 405B is the first freely accessible model that can compete with the best AI models in terms of general knowledge, steerability, math, tool use, and language translation. Meta is launching improved versions of the 8B and 70B models. These are multilingual, with a greater context length of 128K, and overall stronger reasoning ability.

Meta’s CEO Mark Zuckerberg expects that from next year onwards the Llama models will be the most advanced in the industry. 

How Is Meta Llama 3 Better Than Claude 3 Sonnet & Gemini Pro 1.5? Check Here 

Key Features

These are the key features of the newly released Llama 3.1 models: 

  1. Context Length- Llama 3.1 models have an expanded context length of 128K tokens that allows for more complex and lengthy interactions. 
  1. Multilingual Support- The models also include support across eight languages. 
  1.  Llama 3.1 405B- Meta’s flagship model, Llama 3.1 405B, is the first open-source AI model of its caliber. It has unmatched flexibility, control, and capabilities that can rival the best closed-source models.

Performance and Benchmarks

Llama 3.1 uses a typical decoder-only transformer model design, with minimal modifications. The 405B parameter model was trained on almost 15 trillion tokens utilizing 16 thousand H100 GPUs, making it the most powerful Llama model to date.

Meta analyzed the performance of Llama 3.1 models on more than 150 benchmark datasets from a variety of languages. Also, they conducted human evaluations comparing Llama 3.1 to competing models in real-world circumstances. 

Meta claims that the testing results show that Llama 3.1 405B can compete with leading foundation models- GPT-4, GPT-4o, and Claude 3.5 Sonnet- on different tasks. Let’s take a detailed look: 

CategoryBenchmarkLlama 3.1 405BNemotron 4 340B InstructGPT-4 (0125)GPT-4 OmniClaude 3.5 Sonnet
GeneralMMLU (0-shot, CoT)88.678.7 (non-CoT)85.488.788.3
MMLU PRO (5-shot, CoT)73.362.764.874.077.0
IFEval88.685.184.385.688.0
CodeHumanEval (0-shot)89.073.286.690.292.0
MBPP EvalPlus (base) (0-shot)88.672.883.687.890.5
MathGSM8K (8-shot, CoT)96.892.3 (0-shot)94.296.196.4 (0-shot)
MATH (0-shot, CoT)73.841.164.576.671.1
ReasoningARC Challenge (0-shot)96.994.696.496.796.7
GPOQA (0-shot, CoT)51.1–41.453.659.4
Tool useBFCL88.586.588.380.590.2
Nexus58.7–50.356.145.7
Long contextZeroSCROLLS/QuALITY95.2––95.290.5
InfiniteBench/En.MC83.4–72.182.5–
NIH/Multi-needle98.1–100.0100.090.8
MultilingualMultilingual MGSM91.6–85.990.591.6
CategoryBenchmarkLlama 3.1 8BGemma 2 9B ITMistral 7B InstructLlama 3.1 70BMixtral 8x22B InstructGPT 3.5 Turbo
GeneralMMLU (0-shot, CoT)73.072.3 (5-shot, non-CoT)60.586.079.969.8
MMLU PRO (5-shot, CoT)48.3–36.966.456.349.2
IFEval80.473.657.687.572.769.9
CodeHumanEval (0-shot)72.654.340.280.575.668.0
MBPP EvalPlus (base) (0-shot)72.871.749.586.078.682.0
MathGSM8K (8-shot, CoT)84.576.753.295.188.281.6
MATH (0-shot, CoT)51.944.313.068.054.143.1
ReasoningARC Challenge (0-shot)83.487.674.294.888.783.7
GPOQA (0-shot, CoT)32.8–28.846.733.330.8
Tool useBFCL76.1–60.484.8–85.9
Nexus38.530.024.756.748.537.2
Long contextZeroSCROLLS/QuALITY81.0––90.5––
InfiniteBench/En.MC65.1––78.2––
NIH/Multi-needle98.8––97.5––
MultilingualMultilingual MGSM (0-shot)68.953.229.986.971.151.4
Mistral 7B Outperforms LLaMA 2 and GPT-3.5 by running 6x faster

Human Evaluation

When testing their flagship model Llama 3.1 405B against its competitors like GPT-4-0125-Preview, GPT-4o, and Claude 3.5 Sonnet, the following results were found. Llama 3.1 405B shows a win record of 23.3% against GPT-4-0125-Preview, 19.1% against GPT-4o, and 24.9% against Claude 3.5 Sonnet, with ties at 52.2%, 51.7%, and 50.8%.

Pricing 

Here is a table depicting Llama 3.1 inference API public pricing per million tokens: 

Model5B70B405B
InputOutputInputOutputInputOutput
AWS$0.30$0.60$2.65$3.50––
Azure$0.30$0.61$2.68$3.54$5.33$16.00
Databricks––$1.00$3.00$10.00$30.00
Fireworks.ai$0.20$0.20$0.90$0.90$3.00$3.00
IBM$0.60$0.60$1.80$1.80$35.00$35.00
Octo.ai$0.15$0.15$0.90$0.90$3.00$9.00
Snowflake$0.57$0.57$3.63$3.63$15.00$15.00
Together.AI$0.18$0.18$0.88$0.88$5.00$15.00

Meta AI vs ChatGPT: Which One is Better and Best?

Accessibility

Llama 3.1 models are available for download on llama.meta.com and Hugging Face. They are also ready for immediate development on various partner platforms, including AWS, NVIDIA, and Databricks. 

The Bottom Line

The release of Llama 3.1 models has the whole tech community in a grapple. From Twitter to Reddit, every enthusiast is talking about Meta and its latest open-source LLM. If you would like to test the LLM, then head over to https://llama.meta.com/ and see for yourself.

Claude 3.5 Sonnet vs GPT-4o vs Gemini 1.5: Which is the Most Powerful AI Model?

Previous Post

How to use Imagine Me? Meta AI Tool for Selfie (Text to Image) Generation

Next Post

Andrej Karpathy Net Worth: Eureka Labs, Ex – OpenAI and Tesla

raya-author-image

Raya

Raya is a tech enthusiast diving deep into New-Age technology, especially Artificial Intelligence (AI) and Machine Learning (ML). She is passionate about decoding the complexities and uses of new-age tech. Raya is on a mission to write articles that bridge the gap between technical jargon and everyday understanding, making AI and ML accessible to a wider audience.

Next Post
andrej karpathy net worth

Andrej Karpathy Net Worth: Eureka Labs, Ex - OpenAI and Tesla

  • Trending
  • Comments
  • Latest
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2025: Maximize APY with Secure and Trusted Crypto Tools

April 17, 2025
scott wu net worth

Scott Wu Net Worth: Devin AI Software Engineer, CEO of Cognition Labs

April 17, 2025
Artificial Intelligence (AI) Glossary and Terminologies

Artificial Intelligence (AI) Glossary and Terminologies – Complete Cheat Sheet List

April 18, 2025
TurbolearnAI

Turbolearn AI: How to Use It for FREE, Features and Pricing Models

April 3, 2025
What is Blockchain Technology

What is Blockchain Technology And How Does It Work?

Enterprise AI

What is Enterprise AI? Meaning, Companies, Examples and More Details

Cosine Genie AI Software Engineer

What is Cosine Genie and How to Use? Check Benchmark, Functions, and Access Details

PhonePe Leads UPI Market in August 2024, Claims 50% Share by Value and 48% by Volume

PhonePe Partners with Liquid Group to Bring UPI Payments to Singapore for Indian Travelers

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025

Recent News

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025

Trending in AI

  • Perplexity CEO Net Worth
  • Grammarly AI Detection
  • What is LangChain
  • Canva AI Tool
  • Koupon AI
Tech Chilli

Tech Chilli is a beacon of knowledge, a relentless purveyor of the latest information, news, and groundbreaking research in the realm of cutting-edge technology.

We are dedicated to curating and delivering the most relevant, accurate, and up-to-the-minute information on the technologies that are shaping our world.
Contact us – [email protected]

Follow Us

Browse by Category

  • AI
  • AI India
  • Courses
  • Crypto
  • Featured
  • FinTech
  • Gaming
  • How-To
  • News
  • Puzzles
  • Robotics

Top Searches

  • Scott Wu Net Worth
  • Mira Murati Net Worth
  • Online Games for Couples
  • Amazon Q vs Microsoft Copilot
  • DarkGPT

Recent News

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025
  • About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy

© 2024 Tech Chilli

No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us

© 2024 Tech Chilli

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OK