DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world

Écouter gratuitement

Voir les détails du balado

À propos de cet audio

Google DeepMind's Demis Hassabis and his team have a bold mission: penetrating the 4D chess game that's AI embracing our ever-changing biological, physical world.

Taking a snapshot is one thing. Remembering the molecular topology and their constant changes of state is truly what separates fact from fiction.

It seemed like an impossible target to hit. Until M3-Agent, the work of researchers associated with ByteDance at Shanghai Jiao Tong University, showed up with long-term multimodal memory - allowing the agent to see, hear, remember, and reason just like humans.

M3-Agent's potential is groundbreaking.

Here are just three use cases that will blow all our minds:

Autonomous robotics: Robots in homes or warehouses remember object locations, user habits, and past errors, adapting tasks dynamically, such as a caregiver bot recalling a patient's routines for personalized aid
Enhanced surveillance: Security systems analyse live video/audio feeds, building memory of normal patterns to detect anomalies, predict threats, and reason through scenarios, like identifying intruders based on historical behaviours
Personalised education: AI tutors process student interaction videos, remember progress and misconceptions over time, and deliver tailored lessons, such as adapting math explanations from weeks of observed struggles.

Read the paper: Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory.

Pas encore de commentaire

DÉBUTER

EXPLORER LES CATÉGORIES

DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué

DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world

À propos de cet audio