Hi, this is Wayne again with a topic “Google Hints at New Google Glasses with Project Astra”.
End up powering a new set of transformative experiences. Today we have some exciting new progress to share about the future of AI assistance that we’re calling project Astra for a long time. We’Ve wanted to build a universal AI agent that can be truly helpful in everyday life. Our work, making this Vision a reality goes back many years.
It’S why we made Gemini multimodal from the very beginning an agent like this, has to understand and respond to our complex and dynamic world. Just like we do. It would need to take in and remember what it sees so it can understand context and take action and it would have to be proactive, teachable and personal, so you can talk to it, naturally without lag or delay. While we’ve made some great strides in developing AI systems that can understand multimodal information getting response time down to something, conversational is a difficult engineering challenge. Building on our Gemini model, we’ve developed agents that can process information Faster by continuously encoding video frames.
Combining the video and speech input into a timeline of events and caching, this for efficient, recall, we’ve also enhanced how they sound with a wider range of intonations. These agents better understand the context you’re in and can respond quickly in conversation, making the pace and quality of interaction feel much more natural. Here’S here, a video of our prototype, which we’ll see has two parts each part was captured in a single take in real time. Okay, let’s do some tests tell me when you see something that makes sound.
I see a speaker which makes sound. What is that part of the speaker called that is the Tweeter it produces. High frequency sounds give me a creative alliteration about these creative crayons color cheerfully. They certainly craft colorful Creations. What does that part of the code? Do this code defines encryption and decryption functions? It seems to use aebc encryption to en code and decode data based on a key and an initialization Vector IV. That’S right! What neighborhood do you think I’m in this appears to be the king’s cross area of London? It is known for its railway station and transportation connections. Do you remember where you saw my glasses? Yes, I do. Your glasses were on the desk near a red, [ Applause.
] apple, what can I add here to make this system faster? Adding a cache between the server and database could improve speed. What does this remind you of shring cat? All right, uh give me a band name for this Duo: golden Stripes nice thanks, Gemini, [ Applause, ], .