• About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy
Tech Chilli
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us
No Result
View All Result
Tech Chilli
No Result
View All Result

Home » Robotics » Innovative Humanoid Robot Alter3 Uses GPT-4 to Execute Detailed Commands

Innovative Humanoid Robot Alter3 Uses GPT-4 to Execute Detailed Commands

GPT-4 integration allows Alter3, a humanoid robot, to exhibit spontaneous motion generation, exhibit sophisticated zero-shot learning capabilities, and execute intricate movements, including capturing self-portraits, imitating a ghost, and even enacting intricate scenarios without the need for explicit programming.

Kumud Sahni Pruthi by Kumud Sahni Pruthi
Wednesday, 26 June 2024, 3:05 AM
in Robotics
Researchers Develop Alter3: A Humanoid Robot Powered by GPT-4 for Complex Tasks

Researchers Develop Alter3: A Humanoid Robot Powered by GPT-4 for Complex Tasks

A humanoid robot system that can translate commands from natural language straight into robot behaviours has been developed by researchers at Alternative Machine and the University of Tokyo. The robot, called Alter3, is intended to leverage the extensive information found in large language models (LLMs) like GPT-4 to carry out intricate tasks like posing as a ghost or snapping a selfie.

This research is the most recent in an expanding series that combines robotics systems and foundation model power. Although scalable commercial solutions for these systems are still a ways off, they have spurred advancements in robotics research in recent years and hold great potential.

Also Read: Humanoid Robots and the Role of AI in Transforming Technology

Robot Controlled by LLMs

GPT-4 is the backend model used by Alter3. A natural language command that either explains an action or a circumstance that the robot needs to react to is given to the model.

The LLM plans a sequence of steps that the robot needs to execute to accomplish its objective using an “agentic framework.” The model serves as a planner in the first stage, figuring out the steps needed to carry out the intended action.

Also Read: IEEE Forms Group for Humanoid Robot Safety and Performance Standards

After that, a coding agent receives the action plan and uses it to create the commands needed for the robot to carry out each step. Because GPT-4 has yet to be taught on Alter3’s programming commands, the researchers utilize its capacity for in-context learning to modify its behaviour to match the robot’s API. This indicates that a list of commands and a series of examples demonstrating the use of each command are included in the prompt. Next, the model associates each step with one or more API commands that are transmitted to the robot for implementation.

The researchers state, “We had to control all 43 axes in a specific order before the LLM appeared to mimic a person’s pose or to pretend a behaviour such as serving tea or playing chess.” “We are no longer burdened with the repetitive tasks because of LLM.“

Gaining knowledge via user feedback

The most precise vehicle for describing physical positions is not language. As a result, the action sequence that the model generates may not precisely cause the robot to behave as intended.

The researchers have included functionality that enables others to offer comments, such as “Raise your arm a bit more,” to encourage repairs. Another GPT-4 agent receives these instructions, analyzes the code, makes the required adjustments, and sends the action sequence back to the robot. For later use, the code and improved action recipe are kept in a database.

Also Read: Is China’s Humanoid Robot Industry a marketing ploy or a sign of the Robotic Revolution?

Alter3 was put to the test by the researchers using a variety of tasks, including mimicking gestures like posing as a snake or a ghost and commonplace actions like drinking tea and snapping selfies. The model’s capacity to react to situations requiring meticulous action planning was also put to the test.

A wide variety of language representations of movements are included in the LLM’s instruction. According to the researchers, GPT-4 can map these representations precisely onto Alter3’s anatomy.

GPT-4’s vast understanding of human motions and behaviours allows for the development of more realistic behaviour plans for humanoid robots, such as Alter3. According to the researchers’ experiments, they were also able to replicate in the robot feelings like delight and humiliation.

The researchers believe that “the LLM can infer adequate emotions and reflect them in Alter3’s physical responses, even from texts where emotional expressions are not explicitly stated.”

More sophisticated models

In robotics research, the use of foundation models is growing in popularity. For instance, the $2.6 billion Figure leverages OpenAI models in the background to comprehend commands from humans and perform tasks in the actual world. Robotics systems will become more capable of reasoning about their surroundings and making decisions as multi-modality becomes the standard in foundation models.

Also Read: China Ex-Robots Develop Humanoids With Facial Movement & Emotional Intelligence

Alter3 is among a group of initiatives that leverage commercially available foundation models as modules for planning and reasoning in robotic control systems. The researchers note that Alter3 does not employ an optimized version of GPT-4 and that other humanoid robots can utilize the code.

Some projects, like OpenVLA and RT-2-X, use specific foundation models that are intended to generate robotic commands directly. These models typically yield more consistent outcomes and exhibit greater task and environmental generalization. However, they cost more to produce and call for specialized knowledge.

A common oversight in these initiatives is the fundamental difficulties involved in building robots capable of simple functions like gripping objects, keeping their equilibrium, and moving.

Also Read: AI Humanoid ‘Reachy2’ by Hugging Face and Pollen Robotics Debuts in New Video

Previous Post

Global IndiaAI Summit 2024: Date, Place, Speakers and Discussion Pointers

Next Post

Alan Cowen Net Worth – CEO of Hume AI (Emotional Intelligence Company)

Kumud Sahni Pruthi

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Next Post
alan cowen net worth

Alan Cowen Net Worth - CEO of Hume AI (Emotional Intelligence Company)

  • Trending
  • Comments
  • Latest
top Yield Farming Platforms

Top 13 Yield Farming Platforms in 2025: Maximize APY with Secure and Trusted Crypto Tools

April 17, 2025
scott wu net worth

Scott Wu Net Worth: Devin AI Software Engineer, CEO of Cognition Labs

April 17, 2025
TurbolearnAI

Turbolearn AI: How to Use It for FREE, Features and Pricing Models

April 3, 2025
Artificial Intelligence (AI) Glossary and Terminologies

Artificial Intelligence (AI) Glossary and Terminologies – Complete Cheat Sheet List

April 18, 2025
What is Blockchain Technology

What is Blockchain Technology And How Does It Work?

Enterprise AI

What is Enterprise AI? Meaning, Companies, Examples and More Details

PhonePe Leads UPI Market in August 2024, Claims 50% Share by Value and 48% by Volume

PhonePe Partners with Liquid Group to Bring UPI Payments to Singapore for Indian Travelers

Cosine Genie AI Software Engineer

What is Cosine Genie and How to Use? Check Benchmark, Functions, and Access Details

AI and Crypto

How Will Artificial Intelligence (AI) Transform the Crypto Industry?

May 30, 2025

Top 10 AI Chatbots for Mental Health in 2025 (Rank-wise)

May 28, 2025
What is Threat Intelligence

What is Threat Intelligence? Tools, Meaning and Sources

May 27, 2025
Cryptocurrency exchanges in India

10 Best Low-Fee Crypto Exchanges India 2025

May 26, 2025

Recent News

AI and Crypto

How Will Artificial Intelligence (AI) Transform the Crypto Industry?

May 30, 2025

Top 10 AI Chatbots for Mental Health in 2025 (Rank-wise)

May 28, 2025
What is Threat Intelligence

What is Threat Intelligence? Tools, Meaning and Sources

May 27, 2025
Cryptocurrency exchanges in India

10 Best Low-Fee Crypto Exchanges India 2025

May 26, 2025

Trending in AI

  • Perplexity CEO Net Worth
  • Grammarly AI Detection
  • What is LangChain
  • Canva AI Tool
  • Koupon AI
Tech Chilli

Tech Chilli is a beacon of knowledge, a relentless purveyor of the latest information, news, and groundbreaking research in the realm of cutting-edge technology.

We are dedicated to curating and delivering the most relevant, accurate, and up-to-the-minute information on the technologies that are shaping our world.
Contact us – [email protected]

Follow Us

Browse by Category

  • AI
  • AI India
  • Courses
  • Crypto
  • Featured
  • FinTech
  • Gaming
  • How-To
  • News
  • Puzzles
  • Robotics

Top Searches

  • Scott Wu Net Worth
  • Mira Murati Net Worth
  • Online Games for Couples
  • Amazon Q vs Microsoft Copilot
  • DarkGPT

Recent News

AI and Crypto

How Will Artificial Intelligence (AI) Transform the Crypto Industry?

May 30, 2025

Top 10 AI Chatbots for Mental Health in 2025 (Rank-wise)

May 28, 2025
What is Threat Intelligence

What is Threat Intelligence? Tools, Meaning and Sources

May 27, 2025
Cryptocurrency exchanges in India

10 Best Low-Fee Crypto Exchanges India 2025

May 26, 2025
  • About Us
  • Privacy Policy
  • Disclaimers
  • Terms and Conditions
  • Contact Us
  • DMCA Policy

© 2024 Tech Chilli

No Result
View All Result
  • News
  • AI
  • Fintech
  • Crypto
  • AI India
  • Robotics
  • Courses
  • How-To
  • Puzzles
  • Gaming
  • Contact Us

© 2024 Tech Chilli

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OK