CriticGPT is a tool to find error in AI output. This new model critiques are preferred by trainers over ChatGPT critiques in 63% of cases on naturally occurring bugs, in part because the new critic produces fewer “nitpicks” (small complaints that are unhelpful) and hallucinates problems less often.
All About CriticGPT
OpenAI recently introduced CriticGPT to find GPT-4’s mistakes. As per the official blog, CriticGPT is a step towards evaluating outputs from advanced AI systems that can be difficult for people to rate without better tools. This GPT-4 series of models, which powers ChatGPT, is aligned to be helpful and interactive through “Reinforcement Learning from Human Feedback” (RLHF).
Now, read this article to learn and understand how CriticGPT works and what are its current limitations.
CriticGPT is a model based on GPT-4, which writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF. CriticGPT helps trainers write more comprehensive critiques than they do without help, while producing fewer hallucinations than critiques from the model alone. According to the OpenAI blog, “ As we make advances in reasoning and model behaviour, ChatGPT becomes more accurate and its mistakes become more subtle. This can make it hard for AI trainers to spot inaccuracies when they do occur, making the comparison task that powers RLHF much harder. This is a fundamental limitation of RLHF, and it may make it increasingly difficult to align models as they gradually become more knowledgeable than any person that could provide feedback.”
So, this is now when CriticGPT enters the picture. CriticGPT is trained to write critiques that highlight inaccuracies in ChatGPT answers. For Example
Global IndiaAI Summit 2024: Date, Place, Speakers and Discussion Pointers
OpenAI LLM critics are auto-regressive Transformer policies similar to InstructGPT and ChatGPT. They are trained or prompted to accept a (question, answer) pair as input. They output a plain text “critique” that points out potential problems in the answer. The critiques output by the model follow a particular format by attaching comments to quotes from the answer, but each critique can contain multiple such quotes with comments about each problem.
CriticGPT’s suggestions are not always correct, but we find that they can help trainers catch many more problems with model-written answers than they would without AI help. Various limitations of CriticGPT as per OpenAI are:
To align AI systems that are increasingly complex, we’ll need better tools. CriticGPT is just the first step, and applying RLHF to GPT-4 has the promise to help humans produce better RLHF data for GPT-4. Hence, OpenAI plans to scale this work further and put it into practice.
This post was last modified on June 28, 2024 1:16 pm
Google is launching The Android Show: I/O Edition, featuring Android ecosystem president Sameer Samat, to…
The top 11 generative AI companies in the world are listed below. These companies have…
Google has integrated Veo 2 video generation into the Gemini app for Advanced subscribers, enabling…
Perplexity's iOS app now makes its conversational AI voice assistant compatible with Apple devices, enabling…
Bhavish Aggarwal is in talks to raise $300 million for his AI company, Krutrim AI…
The Beijing Humanoid Robot Innovation Center won the Yizhuang Half-Marathon with the "Tiangong Ultra," a…