Google VP Sissie Hsiao: the Gemini AI demo video ‘is completely real,’ though Google ‘did shorten parts for brevity’

Date:

Share:

[ad_1]

Amidst the frenzy that is the generative AI market, major players are fiercely vying for the shiniest product. For its part, Google, traditionally a more measured participant in this race, unveiled a teaser video for their Gemini large language model this week. However, things took a controversial turn when reports revealed the video was not actually a real time representation of the AI in action.

In the demo video released by Google, the showcased AI model shows its multimodal capabilities, demonstrating an ability to deftly decipher and handle information gleaned from live video and audio. It’s a formidable achievement for Google, particularly in the fierce arena of competition against the likes of OpenAI, where it has lagged behind. However, as reported by Bloomberg, the showcased demo was crafted by “using still image frames from the footage, and prompting via text,” rather than the real-time and vocal and video processing it seemed to achieve.

On stage at Fortune‘s Brainstorm AI conference in San Francisco on Monday, vice president and general manager of Google Assistant and Bard Sissie Hsiao spoke about the contentious demo video, focusing on the benchmarks Gemini reached as a model, and how it’ll propel Google’s chatbot Bard.

“The video is completely real. All the prompts and the model responses are real,” Hsiao said. “We did shorten parts for brevity, which we put in the video as information on making the video,” she noted.

The demo video displays the new AI model’s multimodal capabilities, identifying a squiggly line, then the curves of new lines, culminating in the creation of the drawing of a duck. Throughout this process, the model consistently recognizes each element, offering duck-related facts and answers in real-time.

Hsiao highlighted the milestones conquered by Gemini, showcasing its abilities in benchmarks that put AI models to the test, spanning high school physics, professional legal quandaries, and moral scenarios. According to the Verge, Gemini Ultra beat OpenAI’s GPT-4 in 30 out of 32 benchmarks—an achievement worth boasting about, although Gemini Ultra will not be released until next year. For now, Bard uses the less advanced Gemini Pro, which is roughly akin to GPT 3.5.

Hsiao said these Gemini models will continue to improve Google search as well as the Google Bard chatbot, which she said is “the most preferred free chat bot now in the market.”

Subscribe to the new Fortune CEO Weekly Europe newsletter to get corner office insights on the biggest business stories in Europe. Sign up for free.

[ad_2]

Source link

Subscribe to our magazine

━ more like this

Sports Betting Reddit Trends: What Smart Bettors Are Doing Differently

Introduction Over the past few years, Reddit has become one of the most active platforms for bettors looking to improve their strategies. What started as...

The Rise of Specialist Executive Recruitment Firms in the UK

Finding the right senior leader has never been easy. But in today’s fast-moving UK business environment, it has become even harder. Companies face rapid digital...

Why Non-Executive Directors Are Essential for Strong Governance and Business Growth

Did you know that companies with effective non-executive directors (NEDs) can outperform their competitors by up to 20%? This remarkable statistic underscores the vital...

What Canadian Bettors Look for in a Great Sports Betting Experience

What Canadian Bettors Look for in a Great Sports Betting Experience Sports betting has grown quickly across Canada. From casual fans placing weekend wagers to...

How Professional Bettors Manage Risk and Bankroll

Professional betting is often misunderstood. Many assume success comes from predicting winners more accurately than everyone else. In reality, long-term profitability depends far more...