『Last Week in AI』のカバーアート

Last Week in AI

Last Week in AI

著者: Skynet Today
無料で聴く

このコンテンツについて

Weekly summaries of the AI news that matters!Copyright 2024 All rights reserved. 政治・政府
エピソード
  • #213 - Midjourney video, Gemini 2.5 Flash-Lite, LiveCodeBench Pro
    2025/06/26

    Our 213nd episode with a summary and discussion of last week's big AI news! Recorded on 06/21/2025

    Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/.

    In this episode:

    • Midjourney launches its first AI video generation model, moving from text-to-image to video with a subscription model offering up to 21-second clips, highlighting the affordability and growing capabilities in AI video generation.
    • Google's Gemini AI family updates include high-efficiency models for cost-effective workloads, and new enhancements in Google's search function now allow for voice interactions.
    • The introduction of two new benchmarks, Live Code Bench Pro and Abstention Bench, aiming to test and improve the problem-solving and abstention capabilities of reasoning models, revealing current limitations.
    • OpenAI wins a $200 million US defense contract to support various aspects of the Department of Defense, reflecting growing collaborations between tech companies and government for AI applications.

    Timestamps + Links:

    • (00:00:10) Intro / Banter
    • (00:01:32) News Preview
    • Tools & Apps
    • (00:02:12) Midjourney launches its first AI video generation model, V1
    • (00:05:52) Google’s Gemini AI family updated with stable 2.5 Pro, super-efficient 2.5 Flash-Lite
    • (00:07:59) Google’s AI Mode can now have back-and-forth voice conversations
    • (00:10:13) YouTube to Add Google’s Veo 3 to Shorts in Move That Could Turbocharge AI on the Video Platform
    • Applications & Business
    • (00:11:10) The ‘OpenAI Files’ will help you understand how Sam Altman’s company works
    • (00:12:29) OpenAI drops Scale AI as a data provider following Meta deal
    • (00:13:28) Amazon’s Zoox opens its first major robotaxi production facility
    • Projects & Open Source
    • (00:15:20) LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?
    • (00:19:45) AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions
    • (00:22:49) MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
    • Research & Advancements
    • (00:24:33) Scaling Laws of Motion Forecasting and Planning -- A Technical Report
    • Policy & Safety
    • (00:28:07) Universal Jailbreak Suffixes Are Strong Attention Hijackers
    • (00:30:52) OpenAI found features in AI models that correspond to different ‘personas’
    • (00:33:25) OpenAI wins $200 million U.S. defense contract
    続きを読む 一部表示
    37 分
  • #212 - o3 pro, Cursor 1.0, ProRL, Midjourney Sued
    2025/06/17
    Our 212th episode with a summary and discussion of last week's big AI news! Recorded on 06/33/2025 Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. In this episode: OpenAI introduces O3 PRO for ChatGPT, highlighting significant improvements in performance and cost-efficiency.Anthropic sees an influx of talent from OpenAI and DeepMind, with significantly higher retention rates and competitive advantages in AI capabilities.New research indicates that reinforcing negative responses in LLMs significantly improves performance across all metrics, highlighting novel approaches in reinforcement learning.A security flaw in Microsoft Copilot demonstrates the growing risk of AI agents being hacked, emphasizing the need for robust protection against zero-click attacks. Timestamps + Links: (00:00:11) Intro / Banter(00:01:31) News Preview(00:02:46) Response to Listener ReviewsTools & Apps(00:04:48) OpenAI adds o3 Pro to ChatGPT and drops o3 price by 80 per cent, but open-source AI is delayed(00:09:10) Cursor AI editor hits 1.0 milestone, including BugBot and high-risk background agents(00:13:07) Mistral releases a pair of AI reasoning models(00:16:18) Elevenlabs' Eleven v3 lets AI voices whisper, laugh and express emotions naturally(00:19:00) ByteDance's Seedance 1.0 is trading blows with Google's Veo 3(00:22:42) Google Reveals $20 AI Pro Plan With Veo 3 Fast Video Generator For Budget Creators Applications & Business(00:25:42) OpenAI and DeepMind are losing engineers to Anthropic in a one-sided talent war(00:34:32) OpenAI slams court order to save all ChatGPT logs, including deleted chats(00:37:24) Nvidia’s Biggest Chinese Rival Huawei Struggles to Win at Home(00:43:06) Huawei Expected to Break Semiconductor Barriers with Development of High-End 3nm GAA Chips; Tape-Out by 2026(00:45:21) TSMC’s 1.4nm Process, Also Called Angstrom, Will Make Even The Most Lucrative Clients Think Twice When Placing Orders, With An Estimate Claiming That Each Wafer Will Cost $45,000(00:47:43) Mistral AI Launches Mistral Compute To Replace Cloud Providers from US, China Projects & Open Source(00:51:26) ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Research & Advancements(00:57:27) Kinetics: Rethinking Test-Time Scaling Laws(01:05:12) The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning(01:10:45) Predicting Empirical AI Research Outcomes with Language Models(01:15:02) EXP-Bench: Can AI Conduct AI Research Experiments? Policy & Safety(01:20:07) Large Language Models Often Know When They Are Being Evaluated(01:24:56) Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence(01:31:16) Exclusive: New Microsoft Copilot flaw signals broader risk of AI agents being hacked—‘I would be terrified’(01:35:01) Claude Gov Models for U.S. National Security Customers Synthetic Media & Art(01:37:32) Disney And NBCUniversal Sue AI Company Midjourney For Copyright Infringement(01:40:39) AMC Networks is teaming up with AI company Runway
    続きを読む 一部表示
    1 時間 46 分
  • #211 - Claude Voice, Flux Kontext, wrong RL research?
    2025/06/03
    Our 211th episode with a summary and discussion of last week's big AI news! Recorded on 05/31/2025 Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. Join our Discord here! https://discord.gg/nTyezGSKwP In this episode: Recent AI podcast covers significant AI news: startups, new tools, applications, investments in hardware, and research advancements.Discussions include the introduction of various new tools and applications such as Flux's new image generating models and Perplexity's new spreadsheet and dashboard functionalities.A notable segment focuses on OpenAI's partnership with the UAE and discussions on potential legislation aiming to prevent states from regulating AI for a decade.Concerns around model behaviors and safety are discussed, highlighting incidents like Claude Opus 4's blackmail attempt and Palisade Research's tests showing AI models bypassing shutdown commands. Timestamps + Links: (00:00:10) Intro / Banter(00:01:39) News Preview(00:02:50) Response to Listener Comments Tools & Apps (00:07:10) Anthropic launches a voice mode for Claude(00:10:35) Black Forest Labs’ Kontext AI models can edit pics as well as generate them(00:15:30) Perplexity’s new tool can generate spreadsheets, dashboards, and more(00:18:43) xAI to pay Telegram $300M to integrate Grok into the chat app(00:22:42) Opera’s new AI browser promises to write code while you sleep(00:24:17) Google Photos debuts redesigned editor with new AI tools Applications & Business (00:25:13) Top Chinese memory maker expected to abandon DDR4 manufacturing at the behest of Beijing(00:30:04) Oracle to Buy $40 Billion Worth of Nvidia Chips for First Stargate Data Center(00:31:47) UAE makes ChatGPT Plus subscription free for all residents as part of deal with OpenAI(00:35:34) NVIDIA Corporation (NVDA) to Launch Cheaper Blackwell AI Chip for China, Says Report(00:38:39) The New York Times and Amazon ink AI licensing deal Projects & Open Source (00:41:11) DeepSeek’s distilled new R1 AI model can run on a single GPU(00:45:19) Google Unveils SignGemma, an AI Model That Can Translate Sign Language Into Spoken Text(00:47:08) Open-sourcing circuit tracing tools(00:49:42) Hugging Face unveils two new humanoid robots Research & Advancements (00:52:33) PANGU PRO MOE: MIXTURE OF GROUPED EXPERTS FOR EFFICIENT SPARSITY(00:58:55) DataRater: Meta-Learned Dataset Curation(01:05:05) Incorrect Baseline Evaluations Call into Question Recent LLM-RL Claims (01:10:17) Maximizing Confidence Alone Improves Reasoning(01:11:00) Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence(01:11:44) One RL to See Them All(01:15:05) Efficient Reinforcement Finetuning via Adaptive Curriculum Learning Policy & Safety (01:17:58) Trump's 'Big Beautiful Bill' could ban states from regulating AI for a decade(01:24:31) Researchers claim ChatGPT o3 bypassed shutdown in controlled test(01:30:10) Anthropic’s new AI model turns to blackmail when engineers try to take it offline(01:31:09) Anthropic Faces Backlash As Claude 4 Opus Can Autonomously Alert Authorities(01:35:37) Claude helps users make bioweapons(01:35:49) The Claude 4 System Card is a Wild Read
    続きを読む 一部表示
    1 時間 38 分

Last Week in AIに寄せられたリスナーの声

カスタマーレビュー:以下のタブを選択することで、他のサイトのレビューをご覧になれます。