エピソード

  • The Illusion of Thinking: Do AI Models Really Reason?
    2025/06/28

    It looks incredibly impressive when a large language model explains its step-by-step thought process, giving us a window into its "mind." But what if that visible reasoning is a sophisticated illusion? This episode dives deep into a groundbreaking study on the new generation of "Large Reasoning Models" (LRMs)—AIs specifically designed to show their work.

    We explore the surprising and counterintuitive findings that challenge our assumptions about machine intelligence. Discover the three distinct performance regimes where these models can "overthink" simple problems, shine on moderately complex tasks, and then experience a complete "performance collapse" when things get too hard. We'll discuss the most shocking discoveries: why models paradoxically reduce their effort when problems get harder, and why their performance doesn't improve even when they're given the exact algorithm to solve a puzzle. Is AI's reasoning ability just advanced pattern matching, or are we on the path to true artificial thought?

    Reference:
    This discussion is based on the findings from the Apple Machine Learning Research paper, "The Illusion of Thinking: Understanding the Strengths and Limitations of Large Language Models with Pyramids of Thought."
    https://machinelearning.apple.com/research/illusion-of-thinking

    続きを読む 一部表示
    14 分
  • Charting the Course for Safe Superintelligence
    2025/05/10

    What happens when AI becomes vastly smarter than humans? It sounds like science fiction, but researchers are grappling with the very real challenge of ensuring Artificial General Intelligence (AGI) is safe for humanity. Join us for a deep dive into the cutting edge of AI safety research, unpacking the technical hurdles and potential solutions. We explore the core risks – from intentional misalignment and misuse to unintentional mistakes – and the crucial assumptions guiding current research, like the pace of AI progress and the "approximate continuity" of its development. Learn about the key strategies being developed, including safer design patterns, robust control measures, and the concept of "informed oversight," as we navigate the complex balance between harnessing AGI's immense potential benefits and mitigating its profound risks.


    An Approach to Technical AGI Safety and

    Security: https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/evaluating-potential-cybersecurity-threats-of-advanced-ai/An_Approach_to_Technical_AGI_Safety_Apr_2025.pdf


    Google Deepmind AGI Safety Course: https://youtube.com/playlist?list=PLw9kjlF6lD5UqaZvMTbhJB8sV-yuXu5eW

    続きを読む 一部表示
    29 分
  • Algorithms for Artificial Intelligence: Understanding the Building Blocks
    2025/04/26

    Ever tried to understand how AI actually learns, only to get lost in a sea of equations and jargon? This episode is your fast track through the fundamentals of machine learning, breaking down complex concepts into understandable nuggets.

    Drawing inspiration from Stanford course materials, we ditch the dense textbook approach and offer a clear, conversational deep dive into the core mechanics of AI learning. Join us as we explore:

      • Linear Predictors: The versatile workhorses of early ML, from classifying spam to predicting prices.

      • Feature Extraction: The art of turning raw data (like an email) into numbers the algorithm can understand.

      • Weights & Scores: How AI weighs different information (like ingredients in a recipe) to make a prediction using the dot product.

      • Loss Minimization & Margin: How do we measure when AI gets it wrong, and how does it use that feedback (like the concept of 'margin') to improve?

      • Optimization Powerhouses: Unpacking Gradient Descent and its faster cousin, Stochastic Gradient Descent (SGD) – the engines that drive the learning process.

    Whether you're curious about AI or need a refresher on the basics, this episode provides a solid foundation, explaining how machines learn without needing an advanced degree. Get ready to understand the building blocks of artificial intelligence!

    Stanford's Algorithms for Artificial Intelligence: https://web.stanford.edu/~mossr/pdf/alg4ai.pdf

    続きを読む 一部表示
    25 分
  • C: The Bedrock of Modern Tech
    2025/04/12

    It's over 50 years old, yet the C programming language remains a fundamental building block beneath countless technologies we use daily, from operating systems to embedded devices. Join us as we dive deep into the core principles of C, guided by the classic Kernighan & Ritchie text. We explore its history, key syntax, control flow, functions, and the low-level power that explains why C still matters profoundly today.


    The C Programming Language (2nd Edition): http://cslabcms.nju.edu.cn/problem_solving/images/c/cc/The_C_Programming_Language_%282nd_Edition_Ritchie_Kernighan%29.pdf

    続きを読む 一部表示
    27 分
  • The Hidden Value of Open Source: Beyond Free Code
    2025/03/29

    Open source software powers our lives, from our phones to our fridges. We uncover its enormous economic impact, revealing how much businesses save (trillions of dollars!), the importance of different programming languages, and how to ensure its sustainable future.


    The Value of Open Source Software: https://www.hbs.edu/faculty/Pages/item.aspx?num=65230

    続きを読む 一部表示
    24 分
  • AI Agents: From Smart Homes to Deep Learning
    2025/03/15

    Ever wondered how AI is transforming our lives, from smart homes to the stock market? In this episode, we explore the fascinating world of AI agents – systems that can perceive, reason, and act to achieve goals. We'll unpack the core concepts from "AI Foundations of Computational Agents," delve into real-world examples like trading and tutoring agents, and discuss the blend of neural networks and logical reasoning that powers them. Get ready to understand the building blocks of AI's future!

    Artificial Intelligence: Foundations of Computational Agents (3rd Edition): https://mrce.in/ebooks/AI%20Foundations%20of%20Computational%20Agents%203rd%20Ed.pdf

    続きを読む 一部表示
    31 分
  • Decoding Databases: Indexes, Tradeoffs, and the Quest for Speed
    2025/03/01

    Unlock the secrets behind lightning-fast data retrieval! We explore the inner workings of database storage engines, from simple bash scripts to complex B-trees. Discover how indexes work, the trade-offs between speed and storage space, and how data warehouses organize vast amounts of historical data.

    Designing Data-Intensive Applications (free chapters): https://www.scylladb.com/wp-content/uploads/ScyllaDB-Designing-Data-Intensive-Applications.pdf

    DDIA book: https://dataintensive.net

    続きを読む 一部表示
    39 分
  • LLMs: Magic or Math? A Deep Dive into Language Models
    2025/02/15

    Large Language Models are everywhere, but how do they actually work? Join us as we unravel the science and engineering behind these powerful AIs that can write, translate, and even create. From pre-training to fine-tuning, we explore how they learn and what it means for the future of tech.


    Foundations of Large Language Models: https://arxiv.org/pdf/2501.09223

    続きを読む 一部表示
    15 分