Elon Musk's xAI has launched the Grok 1.5 Vision preview, an AI model that could see and process information from images, documents, screenshots, diagrams, and the list goes on.
xAI Grok 1.5 Model
xAI, owned by Elon Musk, has launched first generation multimodal model, Grok 1.5 Vision for preview. It will be available to early testers and Grok users soon. Grok 1.5 V has very strong text and visual capabilities. It can process information from diagrams, documents, charts, photographs, and screenshots.
In addition to better image understanding, Grok 1.5 v also introduced the RealWorldQA module, which helps it better understand the physical world using the images uploaded by users.
Also Read: Grok 1.5 Release Date, Price, Key Features and Other Details
As per the official blog, Grok, when evaluated in a zero-shot setting without chain-of-thought prompting, outperformed its peers in their new RealWorldQA benchmark that measures real-world spatial understanding.
xAI Grok 1.5 can write code with a diagram. This is among the amazing features unveiled by Elon Musk during the announcement of the upcoming Grok Model to compete with ChatGPT 4 and Google Gemini 1.5 pro.
Also Read: Grok 1.5 vs Mistral vs Claude vs GPT-4 vs Gemini: What are the Benchmark Differences?
According to the official announcement, Grok-1.5V is competitive with existing frontier multimodal models in several domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs. We are particularly excited about Grok’s capabilities in understanding our physical world. Grok outperforms its peers in our new RealWorldQA benchmark, that measures real-world spatial understanding. For all datasets below, we evaluate Grok in a zero-shot setting without chain-of-thought prompting. Here is a screenshot of its performance.
Grok 1.5 Vision with improvements in both multimodal modalities and generation capabilities will become a significant tool in advancing multimodal AI interactions.
This post was last modified on April 15, 2024 5:26 am
Rish Gupta is an Indian entrepreneur who serves as the chief executive officer (CEO) of…
Are you looking to advance your engineering career in the field of robotics? Check out…
Artificial intelligence is a topic that has recently made internet users all over the world…
Boost your learning journey with the power of AI communities. The article below highlights the…
Demystify the world of Artificial Intelligence with our comprehensive AI Glossary and Terminologies Cheat Sheet.…
Scott Wu is the co-founder and Chief Executive Officer of Cognition Labs, an artificial intelligence…