ADVERTISEMENT
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms & Conditions
martedì, Giugno 16, 2026
No Result
View All Result
Global News 24
  • Home
  • World News
  • Business
  • Sports
  • Health
  • Travel
  • Tech
  • Lifestyle
  • Fashion
  • Entertainment
  • Home
  • World News
  • Business
  • Sports
  • Health
  • Travel
  • Tech
  • Lifestyle
  • Fashion
  • Entertainment
No Result
View All Result
Global News 24
No Result
View All Result
Home Tech

Google strikes back at OpenAI with “Project Astra” AI agent prototype

by admin
15 Maggio 2024
in Tech
0 0
0
Google strikes back at OpenAI with “Project Astra” AI agent prototype
0
SHARES
6
VIEWS
Share on FacebookShare on Twitter
ADVERTISEMENT
ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

Advertisement. Scroll to continue reading.
ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

Advertisement. Scroll to continue reading.
ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

ADVERTISEMENT


A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024.
Enlarge / A still of Project Astra demo at the Google I/Se no conference keynote Mountain View acceso May 14, 2024.

Google

Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place a feed and converse about it, Google announced Project Astra, a research prototype that features similar comprehension capabilities. It was announced by Google DeepMind CEO Demis Hassabis acceso Tuesday at the Google I/Se no conference keynote Mountain View, California.

Hassabis called Astra “a universal agent helpful everyday life.” During a demonstration, the research model showcased its capabilities by identifying sound-producing objects, providing creative alliterations, explaining code acceso a monitor, and locating misplaced items. The AI assistant also exhibited its potential wearable devices, such as smart glasses, where it could analyze diagrams, suggest improvements, and generate witty responses to visual prompts.

Google says that Astra uses the ambiente and microphone acceso a user’s device to provide assistance everyday life. By continuously processing and encoding frames and speech ingresso, Astra creates a timeline of events and caches the information for quick recall. The company says that this enables the AI to identify objects, answer questions, and remember things it has seen that are longer the ambiente’s .

Project Astra: Google’s vision for the future of AI assistants.

While Project Astra remains an early-stage feature with specific launch plans, Google has hinted that some of these capabilities may be integrated into products like the Gemini app later this year ( a feature called “Gemini Dal vivo”), marking a significant step forward the development of helpful AI assistants. It’s a stab at creating an agent with “agency” that can “think ahead, reason and plan acceso your behalf,” the words of Google CEO Sundar Pichai.

Advertisement

Elsewhere Google AI: 2 million tokens

During Google I/Se no, the company unveiled a large number of AI-related announcements, some of which we may cover separate posts the future. But for now, here’s a quick overview.

At the culmine of the keynote, Pichai mentioned an “improved” version of February’s Gemini 1.5 Giovamento (same version number, oddly) that is coming soon. It will feature a 2 million-token context window, which means it can process large numbers of documents ora long stretches of encoded videos at once. Tokens are fragments of giorno that AI language models use to process information, and the context window determines the maximum number of tokens an AI model can process at once. Currently, 1.5 Giovamento tops out at 1 million tokens (OpenAI’s GPT-4 Turbo has a 128,000 token window for comparison).

We asked AI researcher Simon Willison—who does not work for Google but was featured a promo during the keynote—what he thought of the context window announcement. “Two million tokens is exciting,” he replied carriera text while sitting the keynote audience. “But it’s worth keeping price mind that $7 in million tokens means a single prompt could cost you $14!” Google charges $7 in million ingresso tokens for 1.5 acceso prompts longer than 150,000 tokens through its API.

During the Google I/O 2024 keynote, Google said Gemini Advanced has the
Enlarge / During the Google I/Se no 2024 keynote, Google said Gemini Advanced has the “longest context window the world” at 1 million tokens—soon to be 2 million.

Google

Speaking of tokens, Google announced that its previously announced 1 million token context window for Gemini 1.5 Giovamento is finally coming to Gemini Advanced subscribers. Previously, it was only available the API.

Google also announced a new AI model called Gemini 1.5 Flash, which it billed as a lightweight, faster, and less expensive version of Gemini 1.5. “1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served the API. It’s optimized for high-volume, high-frequency tasks at scale,” says Google.

Advertisement

Willison had a comment acceso Flash as well: “The new Gemini Flash model is promising there, it’s meant to provide up to 2m tokens at a lower price.” Flash costs $0.35 in million tokens acceso prompts up to 128,000 tokens and $0.70 in million tokens for prompts longer than 128,000. It’s one-tenth the price of 1.5 Giovamento.

“35 cents in million tokens! That’s the biggest news of the day, IMO,” Willison told us.

Google also announced Gems, which appears to be its take acceso OpenAI’s GPTs. Gems are custom roles for the Google Gemini chatbot that will play a part that you define, allowing you to personalize Gemini different ways. Google lists examples of potential Gems as “a gym buddy, sous chef, coding ora creative writing guide.”

New generative AI models

A screenshot of the Google Imagen 3 website.
Enlarge / A screenshot of the Google Imagen 3 website.

Google

Also at the Google I/Se no keynote acceso Tuesday, Google announced several new generative AI models for creating images, audio, and . Imagen 3 is the latest its line of image synthesis models, which Google says is its “highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.”

Google also showed its Music AI Sandbox, which Google bills as “a suite of AI tools to transform how music can be created.” It combines its YouTube music project with its Lyria AI music generator into tools for musicians.

The company also announced Google Veo, which is a text-to-video generator that creates 1080P videos from prompts a quality that seems to OpenAI’s Sora. Google says it is working with actor Donald Glover to create an AI-generated demonstration pellicola that will debut soon. It’s far from Google’s first AI generator, but it seems to be its most capable so far.

The sample above, provided by Google, used the prompt, “A lone cowboy rides his horse across an aperto plain at beautiful sunset, soft light, warm colors.”

Google says starting today, its new AI creative tools are available to select creators a private preview only but that wait lists are aperto.

Tags: agentAstraGoogleOpenAIProjectprototypestrikes
admin

admin

Next Post
Samsung Medison buying French ultrasound AI startup for $92M

Samsung Medison buying French ultrasound AI startup for $92M

Lascia un commento Annulla risposta

Il tuo indirizzo email non sarà pubblicato. I campi obbligatori sono contrassegnati *

Popular News

  • Pakistan to conduct DNA testing on remains of suicide bomber who killed 5 Chinese nationals

    Pakistan to conduct DNA testing on remains of suicide bomber who killed 5 Chinese nationals

    0 shares
    Share 0 Tweet 0
  • Asus ROG Ally to receive a revision with more storage and a bigger battery

    0 shares
    Share 0 Tweet 0
  • Rabbit R1 Review: This AI Device Can’t Replace Your Smartphone Apps Yet

    0 shares
    Share 0 Tweet 0
  • Why We Get ‘the Ick,’ According to Psychologists

    0 shares
    Share 0 Tweet 0
  • Delta Air Lines (DAL) Q1 2024 earnings

    0 shares
    Share 0 Tweet 0
ADVERTISEMENT

About Us

Welcome to Globalnews24.ch The goal of Globalnews24.ch is to give you the absolute best news sources for any topic! Our topics are carefully curated and constantly updated as we know the web moves fast so we try to as well.

Category

  • Business
  • Entertainment
  • Fashion
  • Health
  • Lifestyle
  • Sports
  • Tech
  • Travel
  • World

Recent Posts

  • ‘Complete annihilation of Microsoft, Nvidia … ‘: Iran warns US after Trump threatens to strike bridges, power plants
  • Company Adds 2M Streaming Households, Hits Key Financial Targets
  • Warner Music Group shake-up: Max Lousada to exit; Elliot Grainge named CEO of Atlantic Music Group, with Julie Greenwald as Chairman
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms & Conditions

Copyright © 2024 Globalnews24.ch | All Rights Reserved.

No Result
View All Result
  • Home
  • World News
  • Business
  • Sports
  • Health
  • Travel
  • Tech
  • Lifestyle
  • Fashion
  • Entertainment

Copyright © 2024 Globalnews24.ch | All Rights Reserved.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In