『Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys — 2026-05-24』のカバーアート

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys — 2026-05-24

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys — 2026-05-24

無料で聴く

ポッドキャストの詳細を見る
## Short Segments NVIDIA's Gated DeltaNet-2 introduces a new linear attention layer that decouples erase and write operations, enhancing memory management in AI models. Today, we'll explore how this innovation improves performance and what it means for developers. Later, we'll dive into Microsoft's Webwright, a terminal-native web agent framework that significantly boosts task performance. But first, let's break down NVIDIA's latest release. NVIDIA AI has unveiled Gated DeltaNet-2, a linear attention layer that separates erase and write operations in the Delta Rule, addressing a key bottleneck in memory management. This model, trained on 100 billion FineWeb-Edu tokens, outperforms its predecessors like Mamba-2 and Gated DeltaNet across various benchmarks. By decoupling the active memory edit into two channel-wise gates, Gated DeltaNet-2 allows for more precise control over memory updates, enhancing both speed and efficiency. This development is particularly significant for developers working with large-scale AI models, as it offers a more efficient way to manage memory without compromising on performance. The practical consequence is a more streamlined process for handling complex data sets, making it easier to implement advanced AI solutions in real-world applications. ## Feature Story Microsoft Research's Webwright framework redefines web automation by using a terminal-native approach, significantly improving task performance. Unlike traditional web agents that operate one action at a time, Webwright allows agents to write and refine Playwright code, offering a more flexible and efficient method for web interactions. This shift from a stateful browser session to a terminal environment enables agents to launch, inspect, and discard browsers while focusing on code and logs in the local workspace. This approach mirrors how developers create Robotic Process Automation scripts, allowing for reusable and adaptable solutions. Webwright's architecture consists of three core components: a Runner, a Model Endpoint, and a terminal Environment, totaling just over a thousand lines of code. This simplicity and efficiency make it accessible for developers looking to integrate AI-driven web automation into their workflows. The framework's ability to score 60.1% on the Odysseys benchmark, a significant improvement from the base GPT-5.4's 33.5%, highlights its potential to transform how web tasks are automated. For developers, this means a more robust toolset for creating and deploying web agents, ultimately leading to faster and more reliable automation solutions. As AI continues to evolve, frameworks like Webwright will play a crucial role in bridging the gap between AI capabilities and practical applications, offering new possibilities for innovation and efficiency in web-based tasks.
adbl_web_anon_alc_button_suppression_c
まだレビューはありません