Livres audio et plus | Votre premier livre est gratuit!

Épisodes

Ep74: The AI Revolution Isn’t in Chatbots—It’s in Thermostats

May 13 2025
The AI that's quietly reshaping our world isn’t the one you’re chatting with. It’s the one embedded in infrastructure—making decisions in your thermostat, enterprise systems, and public networks.
In this episode, we explore two groundbreaking concepts. First, the “Internet of Agents” [2505.07176], a shift from programmed IoT to autonomous AI systems that perceive, act, and adapt on their own. Then, we dive into “Uncertain Machine Ethics Planning” [2505.04352], a provocative look at how machines might reason through moral dilemmas—like whether it’s ethical to steal life-saving insulin. Along the way, we unpack reward modeling, system-level ethics, and what happens when machines start making decisions that used to belong to humans.
Technical Highlights:
Autonomous agent systems in smart homes and infrastructure
Role of AI in 6G, enterprise automation, and IT operations
Ethical modeling in AI: reward design, social trade-offs, and system framing
Philosophical challenges in machine morality and policy design

Follow Machine Learning Made Simple for more deep dives into the evolving capabilities—and risks—of AI. Share this episode with your team or research group, and check out past episodes to explore topics like AI alignment, emergent cognition, and multi-agent systems.

References:
[2505.06020] ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
[2505.07280] Predicting Music Track Popularity by Convolutional Neural Networks on Spotify Features and Spectrogram of Audio Waveform
[2505.07176] Internet of Agents: Fundamentals, Applications, and Challenges
[2505.06096] Free and Fair Hardware: A Pathway to Copyright Infringement-Free Verilog Generation using LLMs
[2505.04352] Uncertain Machine Ethics Planning
Voir plus Voir moins
29 min

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

Écouter gratuitement
Ep73: Deception Emerged in AI: Why It’s Almost Impossible to Detect

May 6 2025
Are large language models learning to lie—and if so, can we even tell?
In this episode of Machine Learning Made Simple, we unpack the unsettling emergence of deceptive behavior in advanced AI systems. Using cognitive psychology frameworks like theory of mind and false belief tests, we investigate whether models like GPT-4 are mimicking human mental development—or simply parroting patterns from training data. From sandbagging to strategic underperformance, the conversation explores where statistical behavior ends and genuine manipulation might begin. We also dive into how researchers are probing these behaviors through multi-agent deception games and regulatory simulations.
Key takeaways from this episode:
Theory of Mind in AI – Learn how researchers are adapting psychological tests, like the Sally-Anne and SMARTIE tests, to measure whether LLMs possess perspective-taking or false-belief understanding.
Sandbagging and Strategic Underperformance – Discover how some frontier AI models may deliberately act less capable under certain prompts to avoid scrutiny or simulate alignment.
Hoodwinked Experiments and Game-Theoretic Deception – Hear about studies where LLMs were tested in traitor-style deduction games to evaluate deception and cooperation between AI agents.
Emergence vs. Memorization – Explore whether deceptive behavior is truly emergent or the result of memorized training examples—similar to the “Clever Hans” phenomenon.
Regulatory Implications – Understand why deception is considered a proxy for intelligence, and how models might exploit their knowledge of regulatory structures to self-preserve or manipulate outcomes.
Follow Machine Learning Made Simple for more deep dives into the evolving capabilities—and risks—of AI. Share this episode with your team or research group, and check out past episodes to explore topics like AI alignment, emergent cognition, and multi-agent systems.
Voir plus Voir moins
1 h et 12 min

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

Écouter gratuitement
Ep72: Can We Trust AI to Regulate AI?

Apr 22 2025
In this episode, we explore one of the most overlooked but rapidly escalating developments in artificial intelligence: AI agents regulating other AI agents. Through real-world examples, emergent behaviors like tacit collusion, and findings from simulation research, we examine the future of AI governance—and what it means for trust, transparency, and systemic control.
Technical Takeaways:
Game-theoretic patterns in agentic systems
Dynamic pricing models and policy learners
AI-driven regulatory ecosystems in production
The role of trust and incentives in multi-agent frameworks
LLM behavior in regulatory-replicating environments

References:
[2403.09510] Trust AI Regulation? Discerning users are vital to build trust and effective AI regulation
[2504.08640] Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents
Voir plus Voir moins
48 min

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

Écouter gratuitement
Ep71: The AI Detection Crisis: Why Real Content Gets Flagged

Apr 15 2025

In this episode of Machine Learning Made Simple, we dive deep into the emerging battleground of AI content detection and digital authenticity. From LinkedIn’s silent watermarking of AI-generated visuals to statistical tools like DetectGPT, we explore the rise—and rapid obsolescence—of current moderation techniques. You’ll learn why even 90% human-written content can get flagged, how watermarking works in text (not just images), and what this means for creators, platforms, and regulators alike.Whether you're deploying generative AI tools, moderating platforms, or writing with a little help from LLMs, this episode reveals the hidden dynamics shaping the future of trust and content credibility.What you'll learn in this episode:The fall of DetectGPT – Why zero-shot detection methods are struggling to keep up with fine-tuned, RLHF-aligned models.Invisible watermarking in LLMs – How models like MarkLLM embed hidden signatures in text and what this means for downstream detection.Paraphrasing attacks – How simply rewording AI-generated content can bypass detection systems, rendering current tools fragile.Commercial tools vs. research prototypes – A walkthrough of real-world tools like Originality.AI, Winston AI, and India’s Vastav.AI, and what they're actually doing under the hood.DeepSeek jailbreaks – A case study on how language-switching prompts exposed censorship vulnerabilities in popular LLMs.The future of moderation – Why watermarking might be the next regulatory mandate, and how developers should prepare for a world of embedded AI provenance.References:Baltimore high school athletic director used AI to create fake racist audio of principal: Police - ABC NewsA professor accused his class of using ChatGPT, putting diplomas in jeopardy[2405.10051] MarkLLM: An Open-Source Toolkit for LLM Watermarking[2301.11305] DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature[2305.09859] Smaller Language Models are Better Black-box Machine-Generated Text Detectors[2304.04736] On the Possibilities of AI-Generated Text Detection[2303.13408] Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense[2306.04634] On the Reliability of Watermarks for Large Language ModelsHow Does AI Content Detection Work?Vastav AI - Simple English Wikipedia, the free encyclopediaI Tested 6 AI Detectors. Here’s My Review About What’s The Best Tool for 2025.The best AI content detectors in 2025
Voir plus Voir moins

32 min

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

Écouter gratuitement
Ep70: Content Moderation at Scale: Why GPT-4 Isn’t Enough | Aegis vs. the Rest

Apr 8 2025
What if your LLM firewall could learn which safety system to trust—on the fly?
In this episode, we dive deep into the evolving landscape of content moderation for large language models (LLMs), exploring five competing paradigms built for scale. From the principle-driven structure of Constitutional AI to OpenAI’s real-time Moderation API, and from open-source tools like LLaMA Guard to Salesforce’s BingoGuard, we unpack the strengths, trade-offs, and deployment realities of today’s AI safety stack. At the center of it all is AEGIS, a new architecture that blends modular fine-tuning with real-time routing using regret minimization—an approach that may redefine how we handle moderation in dynamic environments.
Whether you're building AI-native products, managing risk in enterprise applications, or simply curious about how moderation frameworks work under the hood, this episode provides a practical and technical walkthrough of where we’ve been—and where we're headed.
🧠 What makes Constitutional AI a scalable alternative to RLHF—and how it bootstraps safety through model self-critique.
⚙️ Why OpenAI’s Moderation API offers real-time inference-level control using custom rubrics, and how it trades off nuance for flexibility.
🧩 How LLaMA Guard laid the groundwork for open-source LLM safeguards using binary classification.
🧪 What “Watch Your Language” reveals about human+AI hybrid moderation systems in real-world settings like Reddit.
🛡️ Why BingoGuard introduces a severity taxonomy across 11 high-risk topics and 7 content dimensions using synthetic data.
🚀 How AEGIS uses regret minimization and LoRA-finetuned expert ensembles to route moderation tasks dynamically—with no retraining required.

If you care about AI alignment, content safety, or building LLMs that operate reliably at scale, this episode is packed with frameworks, takeaways, and architectural insights.
Prefer a visual version? Watch the illustrated breakdown on YouTube here:
https://youtu.be/ffvehOz2h2I
👉 Follow Machine Learning Made Simple to stay ahead of the curve. Share this episode with your team or explore our back catalog for more on AI tooling, agent orchestration, and LLM infrastructure.
References:
[2212.08073] Constitutional AI: Harmlessness from AI Feedback
Using GPT-4 for content moderation | OpenAI
[2309.14517] Watch Your Language: Investigating Content Moderation with Large Language Models
[2312.06674] Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
[2404.05993] AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM Experts
[2503.06550] BingoGuard: LLM Content Moderation Tools with Risk Levels
Voir plus Voir moins
40 min

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

Écouter gratuitement
Ep69: MCP, GPT-4 Image Editing, and the Future of AI Tool Integration

Apr 1 2025
What if the next breakthrough in AI isn’t another model—but a universal protocol? In this episode, we explore GPT-4’s powerful new image editing feature and how it’s reshaping (and threatening) entire categories of AI apps. But the real headline is MCP—the Model Context Protocol—which may redefine how language models interact with tools, forever.
From collapsing B2C AI apps to the rise of protocol-based orchestration, we unpack why the future of AI tooling is shifting under our feet—and what developers need to know now.
Key takeaways:
How GPT-4's new image editing is democratizing creation—and wiping out indie tools
The dangers of relying on single-feature AI apps in an OpenAI-dominated market
Privacy concerns hidden inside the convenience of image editing with ChatGPT
What MCP (Model Context Protocol) is, and how it enables universal tool access
Why LangChain-style orchestration may be replaced by schema-aware, protocol-based AI agents
Real-world examples of MCP clients and servers in tools like Blender, databases, and weather APIs
Follow the show to stay ahead of emerging AI paradigms, and share this episode with fellow builders navigating the fast-changing world of model tooling, developer ecosystems, and AI infrastructure.
References:
Model Context Protocol
Introducing the Model Context Protocol \ Anthropic
Model Context Protocol (MCP) - Anthropic
Voir plus Voir moins
24 min

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

Écouter gratuitement
Ep68: Is GPT-4.5 Already Outdated?

Mar 25 2025
Is GPT-4.5 already falling behind? This episode explores why Claude's MCP and ReCamMaster may be the real AI breakthroughs—automating video, tools, and even 3D design. We also unpack Part 2 of advanced RAG techniques built for real-world AI.
Highlights:
Claude MCP vs GPT-4.5 performance
4D video with ReCamMaster
AI tool-calling with Blender
Advanced RAG: memory, graphs, agents

References:
Introducing GPT-4.5 | OpenAI
Introducing Operator | OpenAI
Introducing the Model Context Protocol \ Anthropic
[2404.16130] From Local to Global: A Graph RAG Approach to Query-Focused Summarization
Introducing Contextual Retrieval \ Anthropic
[2312.10997] Retrieval-Augmented Generation for Large Language Models: A Survey
[2404.13501] A Survey on the Memory Mechanism of Large Language Model based Agents
[2501.09136] Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG
Voir plus Voir moins
30 min

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

Écouter gratuitement
Ep67: Why RAG Fails LLMs – And How to Finally Fix It

Mar 19 2025
AI is lying to you—here’s why. Retrieval-Augmented Generation (RAG) was supposed to fix AI hallucinations, but it’s failing. In this episode, we break down the limitations of naïve RAG, the rise of dense retrieval, and how new approaches like Agentic RAG, RePlug, and RAG Fusion are revolutionizing AI search accuracy.
🔍 Key Insights:
Why naïve RAG fails and leads to bad retrieval
How Contriever & Dense Retrieval improve accuracy
RePlug’s approach to refining AI queries
Why RAG Fusion is a game-changer for AI search
The future of AI retrieval beyond vector databases
If you’ve ever wondered why LLMs still struggle with real knowledge retrieval, this is the episode you need!
🎧 Listen now and stay ahead in AI!

References:
[2005.11401] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
[2112.09118] Unsupervised Dense Information Retrieval with Contrastive Learning
[2301.12652] REPLUG: Retrieval-Augmented Black-Box Language Models
[2402.03367] RAG-Fusion: a New Take on Retrieval-Augmented Generation
[2312.10997] Retrieval-Augmented Generation for Large Language Models: A Survey
Voir plus Voir moins
23 min

Échec de l'ajout au panier.

Veuillez réessayer plus tard

Échec de l'ajout à la liste d'envies.

Veuillez réessayer plus tard

Échec de la suppression de la liste d’envies.

Veuillez réessayer plus tard

Échec du suivi du balado

Ne plus suivre le balado a échoué

Écouter gratuitement

DÉBUTER

EXPLORER LES CATÉGORIES

Épisodes

Ep74: The AI Revolution Isn’t in Chatbots—It’s in Thermostats

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué

Ep73: Deception Emerged in AI: Why It’s Almost Impossible to Detect

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué

Ep72: Can We Trust AI to Regulate AI?

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué

Ep71: The AI Detection Crisis: Why Real Content Gets Flagged

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué

Ep70: Content Moderation at Scale: Why GPT-4 Isn’t Enough | Aegis vs. the Rest

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué

Ep69: MCP, GPT-4 Image Editing, and the Future of AI Tool Integration

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué

Ep68: Is GPT-4.5 Already Outdated?

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué

Ep67: Why RAG Fails LLMs – And How to Finally Fix It

Échec de l'ajout au panier.

Échec de l'ajout à la liste d'envies.

Échec de la suppression de la liste d’envies.

Échec du suivi du balado

Ne plus suivre le balado a échoué