『Snacks Weekly on Data Science』のカバーアート

Snacks Weekly on Data Science

Snacks Weekly on Data Science

著者: Pan Wu
無料で聴く

今ならプレミアムプランが3カ月 月額99円

2026年5月12日まで。4か月目以降は月額1,500円で自動更新します。

概要

This podcast is about making data science and machine learning knowledge accessible and less intimidating. Every week, I will handpick one selected industrial tech blog to break it down. We will discuss some key data science concepts and machine learning algorithms, and how they are applied in those real-world applications. Subscribe to the channel and enjoy Snacks Weekly on Data Science!Pan Wu
エピソード
  • Customized AI System for Subtitle Translation [Vimeo]
    2026/04/20

    In this episode, we explore how Vimeo built a customized AI system for subtitle translation—one that goes beyond basic text translation to tackle the much more challenging problem of synchronizing language with timing. We discuss how the team designed a split-brain architecture to separate translation quality from timing constraints, and how they implemented fallback mechanisms to ensure the system remains reliable in real-world scenarios.

    For more details, you can refer to their published tech blog, linked here for your reference: https://medium.com/vimeo-engineering-blog/how-we-built-ai-powered-subtitles-at-vimeo-ff11f1d64b2a

    続きを読む 一部表示
    9 分
  • Scaling Unit Test Coverage with AI Tools [NYTimes]
    2026/04/13

    In this episode, we explore how the New York Times engineering team used AI agents to scale unit test coverage across their News site. They accomplished this by building a custom coverage measurement tool, designing a two-loop human–AI workflow, and investing heavily in prompt engineering, including strict guardrails to prevent the agent from cheating or drifting. The key takeaway is that AI works best when it is tightly constrained, carefully monitored, and used to amplify human judgment.
    For more details, you can refer to their published tech blog, linked here for your reference: https://open.nytimes.com/how-the-new-york-times-is-scaling-unit-test-coverage-using-ai-tools-fa796bf9b8d2

    続きを読む 一部表示
    9 分
  • Product classification evolution [Shopify]
    2026/04/06

    In this episode, we explore how Shopify evolved its product classification system across three major stages: from a traditional logistic regression model with TF-IDF features, to a multi-modal approach combining text and images, and finally to Vision Language Models built on top of a standardized and evolving product taxonomy. We also look at how architectural design and inference optimization are just as important as model accuracy in real-world machine learning systems.
    For more details, you can refer to their published tech blog, linked here for your reference: https://shopify.engineering/evolution-product-classification

    続きを読む 一部表示
    8 分
まだレビューはありません