• About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy
Tech Chilli
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
Tech Chilli
No Result
View All Result

Home » News » Stanford Researchers Develop Solution to AI Hallucinations, Enhancing Accuracy

Stanford Researchers Develop Solution to AI Hallucinations, Enhancing Accuracy

Researchers at Stanford University have developed a new method to detect AI hallucinations, improving the reliability of generative AI tools like ChatGPT. This groundbreaking approach offers a significant step toward more accurate AI systems.

tech chilli logo by Tech Chilli Desk
Saturday, 22 June 2024, 0:45 AM
in News
AI Hallucinations Tackled: New Detection Method Increases AI Reliability by 79%

AI Hallucinations Tackled: New Detection Method Increases AI Reliability by 79%

In the innovative world of AI, generative AI tools like ChatGPT have the problem of hallucinating and yielding the wrong output: In the emerging, innovative world of AI, a constant issue is that AI might confidently generate wrong information—a situation known as “hallucination “ Recent AI hallucinations include Air Canada implementing the wrong discount-based on ChatGPT’s output and Google’s AI stating that ingesting rocks is safe.

Sebastian Farquhar, an author of the study, is a senior research fellow and research scientist on Google DeepMind’s safety team “I hope that this opens up ways for large language models to be deployed where they can’t currently be deployed—where a little bit more reliability than is currently available is needed,” says at Oxford University’s department of computer science. 

Though a recent advancement holds prospects for resolving this question, in a work that was published in Nature Scientific magazine, the researchers have discovered a new way of identifying the presence of AI hallucinations. This approach is designed to ask a question, and then analyze both the question and the AI-generated correct answer to determine if the student’s answer is correct or not; It can work with approximately 79 % accuracy, which is higher than current solutions. This may only address one of the causes of the AI hallucinations and the new approach demands more computations but this development may lead to more accurate AI systems in the future.

The research team chose to investigate one form of hallucination known as “confabulations,” which is a phenomenon in which an AI model generates inaccurate and inconsistent responses to well-defined questions. Through such confabulations, the researchers hope to enhance the correctness or fitness of the AI-derived responses.

Procedure

The method used in the study involves creating multiple responses to a specific question by a chatbot, and then using an LM to group the responses into equivalent meanings. To measure the relatedness of the meanings of string vectors, researchers use a concept called semantic entropy. For generating a high semantic entropy score, the model is considered to be confabulating, while a low score means that the answer decided has been consistent, and hence, it’s less probable that it is a hallucination.

Despite these challenges, the development of a method to detect AI hallucinations is a significant step forward in more accurate and reliable AI systems. As AI continues to permeate various aspects of our lives, the importance of ensuring that these tools provide accurate and trustworthy information cannot be overstated. This groundbreaking research brings us one step closer to a future where AI can be relied upon to deliver consistently.

Automated Evaluation Method for Assessing Hallucination in RAG Models

Previous Post

Discover JASCO: Meta FAIR’s Innovative AI Model for High-Quality Text-to-Music Generation

Next Post

Who is Venkat Venkataramani, and why did OpenAI acquire Rockset?

tech chilli logo

Tech Chilli Desk

Tech Chilli News Desk is a conglomeration of Tech enthusiasts who are committed to delving deep into the evolving new-age technology of Web 3.0, Artificial Intelligence (AI), Robotics, Fintech, Crypto and more. This desk brings the latest information on Digital Transformation through use cases, implementations, coverage, case studies, reporting and deep analysis.

Next Post
Venkat Venkatramani

Who is Venkat Venkataramani, and why did OpenAI acquire Rockset?

  • Trending
  • Comments
  • Latest
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2025: Maximize APY with Secure and Trusted Crypto Tools

April 17, 2025
scott wu net worth

Scott Wu Net Worth: Devin AI Software Engineer, CEO of Cognition Labs

April 17, 2025
Artificial Intelligence (AI) Glossary and Terminologies

Artificial Intelligence (AI) Glossary and Terminologies – Complete Cheat Sheet List

April 18, 2025
TurbolearnAI

Turbolearn AI: How to Use It for FREE, Features and Pricing Models

April 3, 2025
What is Blockchain Technology

What is Blockchain Technology And How Does It Work?

Enterprise AI

What is Enterprise AI? Meaning, Companies, Examples and More Details

Cosine Genie AI Software Engineer

What is Cosine Genie and How to Use? Check Benchmark, Functions, and Access Details

PhonePe Leads UPI Market in August 2024, Claims 50% Share by Value and 48% by Volume

PhonePe Partners with Liquid Group to Bring UPI Payments to Singapore for Indian Travelers

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025

Recent News

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025

Trending in AI

  • Perplexity CEO Net Worth
  • Grammarly AI Detection
  • What is LangChain
  • Canva AI Tool
  • Koupon AI
Tech Chilli

Tech Chilli is a beacon of knowledge, a relentless purveyor of the latest information, news, and groundbreaking research in the realm of cutting-edge technology.

We are dedicated to curating and delivering the most relevant, accurate, and up-to-the-minute information on the technologies that are shaping our world.
Contact us – [email protected]

Follow Us

Browse by Category

  • AI
  • AI India
  • Courses
  • Crypto
  • Featured
  • FinTech
  • Gaming
  • How-To
  • News
  • Puzzles
  • Robotics

Top Searches

  • Scott Wu Net Worth
  • Mira Murati Net Worth
  • Online Games for Couples
  • Amazon Q vs Microsoft Copilot
  • DarkGPT

Recent News

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025
  • About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy

© 2024 Tech Chilli

No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us

© 2024 Tech Chilli

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OK