No Google AI Search, I Don’t Need to Learn About the “Benefits of Slavery”
Subtitles
  • Off
  • English

The 10 Biggest Things We Learned About Google's New Gemini AI

The 10 Biggest Things We Learned About Google's New Gemini AI

Google's making a giant bet on AI for 2024, and we just got our first look at the technology. Here's the biggest news from Google's Gemini launch.

We may earn a commission from links on this page.
Start Slideshow
The logo for Google's Gemini AI
Graphic: Google

Google launched Gemini, its new supercharge AI model, on Wednesday, a technology that will touch on nearly every part of the search giant’s business. The company has spent the last year lagging behind OpenAI, its chief artificial intelligence rival. But Gemini is Google’s long-anticipated attempt to prove its AI is top of class, and that it’s still the player to beat in the next era of Silicon Valley.

Advertisement

Gemini came with a pages-long list of announcements, with business updates, new tools and features, tech specs for experts to pour over, and updates that are already live on consumer-facing products. Click through the slideshow to see the 10 biggest announcements from the Gemini launch, or just scroll down if you’re on a mobile device.

Advertisement
Previous Slide
Next Slide

2 / 12

Gemini is live on Bard, starting now

Gemini is live on Bard, starting now

The Google Bard logo
Photo: gguy / Shutterstock.com (Shutterstock)

You’ll have to wait a while before you get to try out Gemini’s most powerful capabilities, but a mid-tier version of the model is already live on Bard, the company’s chatbot. Google called this Bard’s biggest update yet. According to the company, it brings the often-overlooked product in line with the free version of ChatGPT, which runs on a model called GPT-3.5.

Advertisement
Previous Slide
Next Slide

3 / 12

Gemini comes in three flavors

Gemini comes in three flavors

A graphic depicting Gemini Ultra, Pro, and Nano
Graphic: Google

Google announced three different versions of the new AI. Gemini Ultra is Google’s most powerful model, pitched as a competitor to OpenAI’s GPT-4. Gemini Pro, currently running on Bard, is a mid-range model powered to beat out GPT-3.5, the baseline version of ChatGPT. Last is Gemini Nano, a more efficient model built to run on mobile devices.

Advertisement

Gemini Ultra is the most interesting prospect here, but only a short list of people will get their hands on it. For now, Ultra is only coming to hand-picked testers, safety experts, and major business partners. The rest of us won’t see Gemini until sometime “early next year.”

Advertisement
Previous Slide
Next Slide

4 / 12

Pixel 8 Pros get Gemini immediately, sort of

Pixel 8 Pros get Gemini immediately, sort of

The Pixel 8 Pro
Photo: Craig Russell / Shutterstock (Shutterstock)

The Pixel 8 Pro is the first smartphone to get Gemini Nano. It won’t do much at first. Gemini Nano now powers the Summarize feature on Android’s Recorder app on Pixel 8 Pros. Google says the AI will also power Android’s Smart Reply feature on the Pixel 8 Pro, but only if you’re using the Google keyboard, and only in WhatsApp. The company says Gemini is coming to more messaging apps and other parts of the operating system next year.

Advertisement
Previous Slide
Next Slide

5 / 12

Gemini is better than anything OpenAI has on deck, at least according to Google

Gemini is better than anything OpenAI has on deck, at least according to Google

A list of Gemini Ultra benchmarks compared to GPT-4
Google pushed a chart with side-by-side comparison of Gemini Ultra’s and GPT-4's performance on a number of tests. In almost every category, Google is on top.
Graphic: Google

Google is generally reserved when it takes shots at competitors, but the company didn’t mince words. According to Google, Gemini has OpenAI beat on almost every measure.

Advertisement

“With a score of over 90%, Gemini is the first A.I. model to outperform human experts on the industry standard benchmark MMLU,” said Eli Collins, Vice President of Product at Google DeepMind, speaking at a press conference. “It’s our largest and most capable A.I. model.” MMLU, short for Massive Multitask Language Understanding, measures AI capabilities using standard tests in a combination of 57 subjects such as math, physics, history, law, medicine, and ethics.

“Gemini’s performance also exceeds current state-of-the-art results on 30 out of 32 widely used industry benchmarks,” Collins said.

Advertisement
Previous Slide
Next Slide

6 / 12

Google’s launching a paid version of Bard

Google’s launching a paid version of Bard

Google Bard running on a phone
Photo: Ascannio / Shutterstock.com (Shutterstock)

Gemini Pro is running on Bard right now, but if you want to chat with the best version of Google’s AI, you’ll have to pay. Google said it will launch a paid tier of Bard called Bard Advanced, though the company declined to share details on pricing. The premium tier will come with access to Gemini Ultra and other still unannounced features.

Advertisement

OpenAI already does a tidy business with its ChatGPT Plus, which costs $20 a month. Bard Advanced will open up a brand new revenue stream for Google that could be a winner in the long term.

Advertisement
Previous Slide
Next Slide

7 / 12

Bard is getting a voice

Bard is getting a voice

A toy robot using a phone
Photo: Besjunior / Shutterstock.com (Shutterstock)

Right now, the only modern chatbot that will speak to you is ChatGPT, and it’s mostly tight-lipped. You can also use voice chat on the ChatGPT mobile app, but there’s no “Hey ChatGPT” feature to call it up automatically.

Advertisement

That may change with Bard. Google says it’s adding Bard to Google Assistant sometime next year. It’s unclear exactly what that will look like, but at a Tuesday press conference, Google shared a video that featured Bard speaking to a user. When smart assistants like Siri, Alexa, and Google Assistant get a truly capable AI engine, rather than the limited pre-canned responses we get today, it will revolutionize our relationship with AI.

Advertisement
Previous Slide
Next Slide

8 / 12

Gemini will handle pictures, videos, and audio just as well as it handles text

Gemini will handle pictures, videos, and audio just as well as it handles text

Google Bard doing physics homework.
Bard will do you physics homework now.
Screenshot: Google

Google made a big deal about Gemini’s “multimodal” capabilities, “multimodal performance,” meaning it can comprehend different kinds of information such as text, images, video, audio, and more. According to the company, Google trained Gemini on a variety of mediums from the ground up, rather than taking it on after the chat features were up and running.

Advertisement

Google shared a video where a Gemini-powered Bard helps with a student’s physics homework starting with a photo of the assignment with handwritten questions. The AI then seamlessly transitions to written advice, complete with equations and step-by-step answers.

Gemini: Explaining reasoning in math and physics
Advertisement
Previous Slide
Next Slide

9 / 12

Gemini powers a new supercharged coding tool

Gemini powers a new supercharged coding tool

Software source code
Photo: jivacore / Shutterstock.com (Shutterstock)

Two years ago, Google launched AlphaCode, a robot code writer that was the first AI smart enough to be competitive in coding conferences. Computer engineers were among the earliest adopters of tools like ChatGPT, thanks to AI’s ability to translate ideas written in human language to machine-readable code.

Advertisement

Along with Gemini, Google’s releasing a new version of that tool called AlphaCode 2. According to the company, it “excels at solving competitive programming problems that go beyond coding to involve complex math and theoretical computer science.”

Advertisement
Previous Slide
Next Slide

10 / 12

Gemini will hit every part of Google

Gemini will hit every part of Google

A Google office building
Photo: JHVEPhoto / Shutterstock.com (Shutterstock)

Gemini isn’t just going to run Bard and on-device AI with Android phones; Google said the new model will be utilized across all of the company’s most important products, including Chrome, Search, Ads, and more. There’s no real timeline. Google would only say these products are getting Gemini power “in the coming months.”

Advertisement
Previous Slide
Next Slide

11 / 12

Gemini will revamp Google’s Cloud business

Gemini will revamp Google’s Cloud business

A row of Cloud TPU v5p AI accelerator supercomputers in a Google data center.
A row of Cloud TPU v5p AI accelerator supercomputers in a Google data center.
Photo: Google

One of Google’s biggest money makers is a side of the company you interact with every day, but one most people never think about. Google Cloud offers a variety of services for businesses, including data storage, data analytics and machine learning, and a set of management tools. A huge portion of the business world’s technology stack runs on Google Cloud, and Gemini is set to usher in the next era of the platform.

Advertisement

Along with the broader Gemini announcements, Google unveiled Cloud TPU v5p, a new version of its computer chips built for AI that companies can harness remotely through Google Cloud. Building and training AI tools requires an enormous amount of processing power, and Google is setting itself up for success with chips optimized for the task — especially because it’s building its own AI models to run Cloud TPU v5e, which will encourage more people to choose Gemini over GPT-4 and other offerings. It’s wonky stuff, but it’s a sign that Google is making huge bets on its AI business.

Advertisement