AI

What is Cosine Genie and How to Use? Check Benchmark, Functions, and Access Details

Recently, Cosine, a human reasoning lab co-founded by Alistair Pullen, unveiled Genie, an AI software engineer, which is regarded as the best in the world, surpassing even Devin. Genie can autonomously solve bugs, build features, and refactor code.

In April 2024, the digital world witnessed the development of the world’s first AI software engineer, Cognition Labs’ Devin. Since then the race to build artificial intelligence-powered engineers began. 

And now, a new and stronger contender has appeared in this race. 

Recently, Cosine, a human reasoning lab co-founded by Alistair Pullen, unveiled Genie, an AI software engineer, which is regarded as the best in the world, surpassing even Devin. 

Genie can autonomously solve bugs, build features, and refactor code. It scored 30.08% on SWE-Bench evaluations.

“We believe that if you want a model to behave like a software engineer, it has to be shown how a human software engineer works,” Alistair Pullen, Cosine’s CEO said. 

This article will cover everything you need to know about Cosine Genie, its performance benchmarks, functionalities, and how you can gain access to it.

Scott Wu Net Worth: Devin AI Software Engineer, CEO of Cognition Labs

What is Cosine Genie?

Cosine Genie is an advanced AI software engineering model. It is designed to handle a wide array of tasks. Whether it is debugging, feature development, or code refactoring, Genie can do it all either fully autonomously or in collaboration with a user. 

Cosine says “Genie is the world’s first AI Software Engineering colleague trained on data that perfectly emulates the cognitive processes, logic, and workflow of human engineers.”

Unlike other AI tools that only serve as copilots, Genie actively participates in the software development process.

Devin AI Software Engineer: How To Use It For Creating Websites, Apps, & Code Debugging?

Cosine Genie: Benchmark Performance

According to Cosine, Genie achieved the highest score among similar models on the SWE-Bench. The SWE is a comprehensive benchmark designed to evaluate the coding abilities of large language models across different software engineering tasks. Genie scored 30.08% and outperformed its competitors. 

To put this in perspective, Devin, completed 500 out of 2,294 tasks, with a success rate of a mere 3.44%. Genie’s performance isn’t just a step forward; it represents a dramatic leap in the capabilities of AI within software development.

Core Functions

These are some of the key functionalities of AI engineer Genie:

  • Autonomous Bug Solving: Genie can identify and fix bugs in your code without human intervention. It is expected to reduce the time and effort required for debugging.
  • Feature Development: Genie can assist in the entire process, from planning to implementation. It can also help with adding new functionalities to your software or enhancing existing ones.
  • Code Refactoring: The AI software engineer can clean up and optimize your codebase, making it more efficient and easier to maintain.
  • Testing and Validation: Genie excels in validation by running comprehensive tests and analyzing outcomes to ensure the reliability of its solutions. It continuously iterates and improves until the desired results are achieved.

How to access Genie?

Currently, Genie is not available for public use. If you wish to access the AI software engineer, then you need to join this waitlist

In the meantime, you can verify Genie’s success rate on GitHub. Cosine has made the final outputs of Genie publicly available on GitHub for independent verification. For more information, you can read the official technical report

All You Need To Know About Devika, An Open Source Alternative To Devin

Future Prospects

Cosine recently acquired $2.5 million seed funding round. The funding round was led by U.S.-based venture capital firms Uphonest and SOMA Capital and also saw participation from investors like Lakestar, Focal, and others. Cosine is planning to expand its model portfolio to “include smaller models for simpler tasks and larger models for complex challenges.” 

It would allow them to transform any state-of-the-art foundational model into a Genie model. The company’s plans include extending the context of an open-source model and pre-training a foundational model on their extensive dataset, to achieve enhanced generalization and better reconciliation of specialized data. 

Alistair Pullen Net Worth: Co-founder & CEO of Cosine – AI Software Engineer

This post was last modified on August 13, 2024 6:16 am

Raya

Raya is a tech enthusiast diving deep into New-Age technology, especially Artificial Intelligence (AI) and Machine Learning (ML). She is passionate about decoding the complexities and uses of new-age tech. Raya is on a mission to write articles that bridge the gap between technical jargon and everyday understanding, making AI and ML accessible to a wider audience.

View Comments

  • Thank you for your sharing. I am worried that I lack creative ideas. It is your article that makes me full of hope. Thank you. But, I have a question, can you help me?

  • Thank you for your sharing. I am worried that I lack creative ideas. It is your article that makes me full of hope. Thank you. But, I have a question, can you help me?

  • Can you be more specific about the content of your article? After reading it, I still have some doubts. Hope you can help me.

  • I don't think the title of your article matches the content lol. Just kidding, mainly because I had some doubts after reading the article.

  • Thank you for your sharing. I am worried that I lack creative ideas. It is your article that makes me full of hope. Thank you. But, I have a question, can you help me?

Recent Posts

Google is moving Android news to a virtual event before I/O

Google is launching The Android Show: I/O Edition, featuring Android ecosystem president Sameer Samat, to…

April 29, 2025

Top Generative AI Companies of the World 2025

The top 11 generative AI companies in the world are listed below. These companies have…

April 28, 2025

Veo 2 extends access to more Gemini Advanced Users

Google has integrated Veo 2 video generation into the Gemini app for Advanced subscribers, enabling…

April 25, 2025

Perplexity launches the iPhone voice assistant

Perplexity's iOS app now makes its conversational AI voice assistant compatible with Apple devices, enabling…

April 24, 2025

Ola’s AI arm Krutrim intends to raise $300 million

Bhavish Aggarwal is in talks to raise $300 million for his AI company, Krutrim AI…

April 22, 2025

World’s first humanoid half-marathon pits people against robots

The Beijing Humanoid Robot Innovation Center won the Yizhuang Half-Marathon with the "Tiangong Ultra," a…

April 22, 2025