Project Astra, AI assistant, DeepMind Technologies Limited, London, United Kingdom


Project Astra: Our vision for the future of AI assistants

May 14, 2024

Introducing Project Astra. We created a demo in which a tester interacts with a prototype of AI agents supported by our multimodal foundation model, Gemini.

There are two continuous takes: one with the prototype running on a Google Pixel phone and another on a prototype glasses device.

The agent takes in a constant stream of audio and video input. It can reason about its environment in real time and interact with the tester in a conversation about what it is seeing.
 

Project Astra | Exploring the future capabilities of a universal AI assistant

Dec 11, 2024

Project Astra is our research prototype that explores the future capabilities of a universal AI assistant. Using capabilities like multimodal understanding, multilinguality, tool use, native audio, and memory, it helps you understand your world, live.
 

Project Astra: Exploring a Universal AI Assistant with Greg Wayne

Dec 20, 2024

In our final episode for the year, we explore Project Astra, a research prototype exploring future capabilities of a universal AI assistant that can understand the world around you. Host Hannah Fry is joined by Greg Wayne, Director in Research at Google DeepMind. They discuss the inspiration behind the research prototype, its current strengths and limitations, as well as potential future use cases. Hannah even gets the chance to put Project Astra's multilingual skills to the test.

Timecodes

00:00 Intro to Project Astra
03:00 Hannah demo
07:00 Hardware and what's under the hood
16:56 Languages
23:00 Inspiration for Project Astra
33:55 Latency and memory
46:00 What's next
47:00 Hannah's thoughts

Thanks to everyone who made this possible, including but not limited to: Presenter: Professor Hannah FrySeries Producer: Dan HardoonEditor: Rami Tzabar, TellTale Studios Commissioner & Producer: Emma YousifMusic composition: Eleni ShawCamera Director and Video Editor: Bernardo ResendeAudio Engineer: Perry RogantinVideo Studio Production: Nicholas DukeVideo Editor: Bilal MerhiVideo Production Design: James BartonVisual Identity and Design: Eleanor TomlinsonCommissioned by Google DeepMind
 
Back
Top