Obtenez 3 mois à 0,99 $/mois + 20 $ de crédit Audible

OFFRE D'UNE DURÉE LIMITÉE
Page de couverture de DeepSeek (and before it became DeepSeek)

DeepSeek (and before it became DeepSeek)

DeepSeek (and before it became DeepSeek)

Écouter gratuitement

Voir les détails du balado

À propos de cet audio

DeepSeek is the Chinese large language model (LLM) that stunned the AI world with its low training costs and open-weight approach. This episode dives into the extraordinary founder story of Liang Wenfeng, the secretive quant trader who pivoted his billion-dollar firm, High-Flyer Capital, into a top AI competitor.

How did a quantitative finance empire become the birthplace of DeepSeek R1 and V3? We unpack the innovations, the GPU hoarding strategy, the DeepSeek architecture, and the controversial pivot that positions them as a serious challenger to ChatGPT and Llama in the global AGI race.

👉 Subscribe & Follow: ASCENT Podcast on Substack

📖 Episode Chapters

00:00:00 Intro

00:04:23 Liang Wenfeng's early life and education

00:08:16 Inception of the quant trading journey

00:18:47 Becoming quant king: building a billion-bollar empire

00:31:00 The hoarding (of GPUs) begins

00:39:53 Liang Wenfeng's vision for China's quant future

00:46:48 The 2021 challenge and fund drawdown

00:50:29 The pivot: from trading to AGI

00:57:17 Innovation under constraint

01:08:31 DeepSeek's unconventional hiring philosophy

01:12:35 Future uncertain: can DeepSeek outlast the giants?

01:20:50 Ascent with open source

Correction: at 59:24 - it should be 600 billion not 6 billion parameters

P.S. Yes, these show notes were also generated by DeepSeek!

Pas encore de commentaire