• About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy
Tech Chilli
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
Tech Chilli
No Result
View All Result

Home » AI » What is OpenAI System Card and How is GPT-4o Following AI Safety Measures?

What is OpenAI System Card and How is GPT-4o Following AI Safety Measures?

OpenAI has released GPT-4o System Card for it latest flagship model GPT-4o. It provides a detailed look into a specific AI model's capabilities, limitations, and most importantly, the safety measures implemented during its development and deployment.

raya-author-image by Raya
Friday, 9 August 2024, 8:35 AM
in AI
OpenAI

OpenAI

AI startup, OpenAI has released GPT-4o System Card, a report that outlines the safety measures that the company carried out before the release of GPT-4o. GPT-4o, where “o” stands for “omni”, was released to the public earlier this year in May. Before releasing a large language model like GPT, it is standard procedure to examine and evaluate the model for any potential risks or safety concerns. These evaluations are typically carried out by a group of red teamers or security experts. 

OpenAI has been battling security and privacy allegations for quite some time now. Earlier in July, it was reported by an anonymous source that OpenAI hurried through their safety tests to meet the launch date. “We basically failed the test,” the source said.  

GPT-4o vs GPT-4o Mini: Check the Key Differences Here

What is OpenAI System Card?

The OpenAI System Card is a report that provides a detailed look into a specific AI model’s capabilities, limitations, and most importantly, the safety measures implemented during its development and deployment. It provides insights into the model’s behavior and the steps taken to mitigate potential risks. 

According to the System Card, OenAI’s latest flagship model GPT-4o was rated as having a “medium” risk. OpenAI conducted a thorough evaluation of GPT-4o’s text, vision, and audio capabilities. There are four categories for risk assessment- cybersecurity, biological threats, persuasion, and model autonomy. 

Overall, three of the four risk categories—cybersecurity, biological threats, and model autonomy—were rated as low risk. The only category with a higher risk rating was persuasion.

The GPT-4o System Card is not the first system card released by OpenAI. The startup earlier released similar reports for GPT-4, GPT-4 with vision, and DALL-E 3. 

OpenAI’s GPT-4o Mini: Check Features, Capabilities and Pricing

How is GPT-4o Following AI Safety Measurements?

Some of the key safety measures for GPT-4o include:   

  • External Red Teaming: According to the System Card, OpenAI engaged over 100 external red teamers from 29 different countries to stress-test the model for potential vulnerabilities and risks. It was carried out in four phases. External red teaming covered categories “that spanned violative & disallowed content (illegal erotic content, violence, self-harm, etc), mis/disinformation, bias, ungrounded inferences, sensitive trait attribution, private information, geolocation, person identification, emotional perception and anthropomorphism risks, fraudulent behavior and impersonation, copyright, natural science capabilities, and multilingual observations.”
  • Preparedness Framework Evaluation: The model was assessed against a framework evaluating risks in cybersecurity, biological threats, persuasion, and model autonomy. GPT-4o scored low in three categories and medium in persuasion.  After reviewing the Preparedness evaluations, the company’s Safety Advisory Group recommended classifying the LLM as the borderline medium risk for persuasion and low risk in all other areas. Since the overall risk score is based on the highest risk category, GPT-4o’s overall risk was classified as medium.
  • Risk Identification, Assessment, and Mitigation: OpenAI identified key risk areas such as unauthorized voice generation, speaker identification, ungrounded inference, and generation of explicit or violent content. The company then developed specific mitigations and implemented them to address these risks. 
  • Continuous Evaluation: Facing the onslaught of criticism for its safety standards by its own employees as well as the Senate, the company continues to monitor and evaluate its model’s performance and safety. However, the company needs to be more transparent about its model training data as well as safety testing. 

Some of the prominent safety measures that OpenAI has implemented in GPT-4o are:

  • Preventing Unauthorized Voice Generation: OpenAI has restricted to use of pre-selected voices, following the Scarlett Johansson lawsuit. It is now using a classifier to detect deviations from these approved voices.   
  • Protecting Privacy: GPT-4o is trained to refuse requests for speaker identification based on voice input. However, it still complies with requests to identify famous personalities “associated with famous quotes.” 
  • Mitigating Bias: The model is designed to avoid making unfounded inferences about individuals and to provide safe responses to requests for sensitive trait attribution. 
  • Blocking Harmful Content: The company has also placed filters to prevent the generation of violent, erotic, or otherwise harmful content. Their moderation classifier runs over text transcriptions of audio prompts and blocks the output if the prompt contains explicit or violent language.

OpenAI’s AI Detection Tool Sparks Debate Over ChatGPT Watermarking

The Bottom Line

As stated above, OpenAI needs to be more transparent about its model training data as well as safety testing. The company has been called out numerous times for its safety issues, including founder and CEO Sam Altman’s dismissal in 2023. 

Moreover, the company is reportedly developing a highly capable multimodal AI model, right before the Presidential Elections of the United States take place. We have already discussed how AI models like GPT can pose a severe threat to democratic elections and electoral processes. These models can easily mitigate misinformation and influence public opinion. 

It is crucial for OpenAI as well as other firms operating in this domain to address these concerns and prioritize the safety and ethical implications of their technology. Transparency and accountability are key in ensuring that their AI models are not used to perpetuate harm or misinformation. 

 Open AI Search Engine: How this AI-Powered Search Product is Different from Google? Check Here

Previous Post

What is Digital Manufacturing? Check Examples with Definition

Next Post

Finding Words Puzzle: Find the word “ignore” in 15 seconds!

raya-author-image

Raya

Raya is a tech enthusiast diving deep into New-Age technology, especially Artificial Intelligence (AI) and Machine Learning (ML). She is passionate about decoding the complexities and uses of new-age tech. Raya is on a mission to write articles that bridge the gap between technical jargon and everyday understanding, making AI and ML accessible to a wider audience.

Next Post
word puzzle find ignore in 15 seconds

Finding Words Puzzle: Find the word “ignore” in 15 seconds!

  • Trending
  • Comments
  • Latest
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2025: Maximize APY with Secure and Trusted Crypto Tools

April 17, 2025
scott wu net worth

Scott Wu Net Worth: Devin AI Software Engineer, CEO of Cognition Labs

April 17, 2025
Artificial Intelligence (AI) Glossary and Terminologies

Artificial Intelligence (AI) Glossary and Terminologies – Complete Cheat Sheet List

April 18, 2025
TurbolearnAI

Turbolearn AI: How to Use It for FREE, Features and Pricing Models

April 3, 2025
What is Blockchain Technology

What is Blockchain Technology And How Does It Work?

Enterprise AI

What is Enterprise AI? Meaning, Companies, Examples and More Details

Cosine Genie AI Software Engineer

What is Cosine Genie and How to Use? Check Benchmark, Functions, and Access Details

PhonePe Leads UPI Market in August 2024, Claims 50% Share by Value and 48% by Volume

PhonePe Partners with Liquid Group to Bring UPI Payments to Singapore for Indian Travelers

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025

Recent News

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025

Trending in AI

  • Perplexity CEO Net Worth
  • Grammarly AI Detection
  • What is LangChain
  • Canva AI Tool
  • Koupon AI
Tech Chilli

Tech Chilli is a beacon of knowledge, a relentless purveyor of the latest information, news, and groundbreaking research in the realm of cutting-edge technology.

We are dedicated to curating and delivering the most relevant, accurate, and up-to-the-minute information on the technologies that are shaping our world.
Contact us – [email protected]

Follow Us

Browse by Category

  • AI
  • AI India
  • Courses
  • Crypto
  • Featured
  • FinTech
  • Gaming
  • How-To
  • News
  • Puzzles
  • Robotics

Top Searches

  • Scott Wu Net Worth
  • Mira Murati Net Worth
  • Online Games for Couples
  • Amazon Q vs Microsoft Copilot
  • DarkGPT

Recent News

Google is moving Android news to a virtual event before I/O

Google is moving Android news to a virtual event before I/O

April 29, 2025
Generative AI Companies

Top Generative AI Companies of the World 2025

April 28, 2025
Veo 2 extends access to more Gemini Advanced Users

Veo 2 extends access to more Gemini Advanced Users

April 25, 2025
Perplexity launches the iPhone voice assistant

Perplexity launches the iPhone voice assistant

April 24, 2025
  • About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy

© 2024 Tech Chilli

No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us

© 2024 Tech Chilli

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OK