News

Gemini 1.5 Pro launched in 180+ countries with advanced features: check details

Gemini 1.5 Pro will now be available in 180+ countries as announced by Google. Gemini 1.5 pro will have advanced features including native audio understanding, system instructions, JSON mode, and more.

Google has announced that Gemini 1.5 Pro will be available in 180+ countries via the Gemini API in a public preview, with a first-ever native audio (speech) understanding capability and a new File API to make it easy to handle files. 

Google is also launching new features, like system instructions and JSON mode, to give developers more control over the model’s output. In addition, Google is releasing its next-generation text embedding model that outperforms comparable models.

Also Read: What Is Google Vids? All About Fourth Big Productivity App For Workspace

Google is also expanding the input modalities for Gemini 1.5 Pro to include audio (speech) understanding in both the Gemini API and Google AI Studio. Additionally, Gemini 1.5 Pro is now able to reason across both image (frames) and audio (speech) for videos uploaded in Google AI Studio, and Google might add API support for this in the future.

What are Gemini API Improvements?

Google is addressing the requests by top developers by giving out below mentioned Gemini API improvements:

1. System instructions: Guide the model’s responses with system instructions, now available in Google AI Studio and the Gemini API. Define roles, formats, goals, and rules to steer the model’s behaviour for your specific use case.

2. JSON mode: Instruct the model to only output JSON objects. This mode enables structured data extraction from text or images. You can get started with cURL, and Python SDK support is coming soon.

3. Improvements to function calling: You can now select modes to limit the model’s outputs, improving reliability. Choose text, function call, or just the function itself.

Also Read: Facebook Meta AI Tool Fails to Create Accurate Images like Google Gemini

As per the official blog, developers can access Google’s next-generation text embedding model via the Gemini API. The new model, text-embedding-004, (text-embedding-preview-0409 in Vertex AI), achieves a stronger retrieval performance and outperforms existing models with comparable dimensions, on the MTEB benchmarks. Here is a screenshot report of the Gecko model.

These are the first set of improvements to Gemini 1.5 pro, Google promises to come up with many more improvements as the technology advances.

Also Read: What is Gemini Nano AI Feature Drop in the Google Pixel 8?

This post was last modified on April 10, 2024 9:44 am

Kumud Sahni Pruthi

A postgraduate in Science with an inclination towards education and technology. She always looks for ways to help people improve their lives by putting complex things into simple words through her writing.

Recent Posts

Google is moving Android news to a virtual event before I/O

Google is launching The Android Show: I/O Edition, featuring Android ecosystem president Sameer Samat, to…

April 29, 2025

Top Generative AI Companies of the World 2025

The top 11 generative AI companies in the world are listed below. These companies have…

April 28, 2025

Veo 2 extends access to more Gemini Advanced Users

Google has integrated Veo 2 video generation into the Gemini app for Advanced subscribers, enabling…

April 25, 2025

Perplexity launches the iPhone voice assistant

Perplexity's iOS app now makes its conversational AI voice assistant compatible with Apple devices, enabling…

April 24, 2025

Ola’s AI arm Krutrim intends to raise $300 million

Bhavish Aggarwal is in talks to raise $300 million for his AI company, Krutrim AI…

April 22, 2025

World’s first humanoid half-marathon pits people against robots

The Beijing Humanoid Robot Innovation Center won the Yizhuang Half-Marathon with the "Tiangong Ultra," a…

April 22, 2025