エピソード

  • Episode 31: Sam Kassoumeh, Co-Founder @ SecurityScorecard
    2026/05/22

    The conversation covers the topics of AI security gateways, SaaS-based companies, AI in coding, the evolution of Security Scorecard, and the impact of AI on threat intelligence data. The conversation delves into the transformative impact of AI and Threat Intel on data analysis, product development, and organizational workflows. It explores the exponential growth in interconnectivity and observation data, the value of net flow data when run through models, and the automation of manual tasks in identifying and cross-correlating data sets. The intersection of AI and Threat Intel is redefining the assessment process, transforming workflows, and changing the roles and responsibilities within organizations.

    Takeaways

    • AI security gateways are a hot commodity in the security space.
    • SaaS companies are doing more with less, leveraging AI and automation.
    • AI is changing the way coding is done, reducing the need for human intervention.
    • Security Scorecard was founded to address the growing dependency on supply chain partners and third parties.
    • AI has revolutionized threat intelligence data, uncovering deeper insights and network connections. Exponential growth in interconnectivity and observation data
    • Value of net flow data when run through models
    • Redefining the assessment process and transforming workflows

    Chapters

    • 00:00 AI Security Gateways in the Security Space
    • 07:35 AI's Impact on Coding and Automation
    • 28:44 AI's Impact on Threat Intelligence Data
    • 34:31 Value of Net Flow Data When Run Through Models
    続きを読む 一部表示
    1 時間 5 分
  • Episode 28: Cloudflare AI Gateway
    2026/04/15

    The video discusses several key topics related to AI and its impact on the tech industry.Firstly, it delves into Anthropic's "Mythos" model and "Project Glasswing." The speaker expresses skepticism about the hyped claims surrounding Mythos, suggesting that the limited release might be due to resource constraints (GPU availability) rather than its groundbreaking capabilities. The speaker draws parallels to Anthropic's past PR strategies, citing the "blackmailed engineer" story as an example of manufactured hype.Secondly, the video addresses the perceived "nerfing" of Anthropic's Claude Code. The speaker details a series of changes, including the introduction of "adaptive thinking," a reduction in default "effort" settings from high to medium, and the removal of visible "thinking" logs from the UI. These changes, while potentially offering cost savings for Anthropic, have led to performance degradation for users, particularly those engaging in complex tasks. The speaker notes that while these changes can be reverted manually, the opt-out nature and the timing of these updates are concerning.Thirdly, the discussion shifts to Cloudflare's AI Gateway. The speaker highlights its features, including virtual gateways with unique hashes for custom rules, compatibility with various SDKs (OpenAI, Anthropic), and logging capabilities. A key aspect is Cloudflare's use of Llama for processing "guardrails," which are implemented for content moderation (e.g., blocking defamation or political content). The speaker also notes the limitations of these guardrails, such as the lack of regex support for sensitive data like API keys, suggesting the gateway is more suited for corporate chatbots than coding environments. The caching, rate limiting, and alias features for API keys are also discussed as beneficial for managing AI access.Finally, the video touches upon the impact of AI on junior engineers. Statistics are presented indicating a decline in "programmer" job postings, contrasting with a smaller drop in "software developer" roles. The speaker suggests a shift from task-based junior roles to more AI-centric orchestration of agents. The speaker predicts a future shortage of software engineers, with companies increasingly needing junior engineers to manage AI systems, thereby elevating the importance of mentorship in AI agent management. The video concludes with a broader discussion on how AI is transforming various careers and the need for educational institutions to adapt their curricula to include AI proficiency. The overall sentiment is that while AI adoption presents challenges, it also creates significant opportunities for those who embrace it.

    続きを読む 一部表示
    1 時間 4 分
  • Episode 23: OpenClaw
    2026/02/11

    Welcome everybody to Before the Commit episode 23. With me as usual, I have my friend Dustin Hillgartner. This week, we're talking about Open Claw, all things Open Claw. There's really not much more to say other than we hope to break down what it is, some of the risks associated with it, and why it might actually be a good thing.

    Open Claw is an open-source agent framework with potential benefits but significant security risks due to its broad access capabilities. It can integrate with messaging apps and utilizes a "skills" system for instructions. A scan revealed many internet-accessible instances, suggesting users may be unaware of the security implications. Risks include prompt injection attacks and plain-text credential storage. Prominent figures have advised caution.

    By default, Open Claw can expose all granted access. Exploits can involve retrieving credentials through prompt engineering. Its integration with messaging apps widens the attack surface. Key security concerns include lack of scoping, untrusted context sources, maximum privilege by default, and vulnerability to single-point compromises via prompt injection. The project's ease of misconfiguration and adoption by non-technical users exacerbate these issues.

    ModSecOps principles highlight Open Claw's lack of security: skills execute with full permissions, context is untrusted, and it defaults to maximum privilege. Unlike multi-agent systems with adversarial reviews, Open Claw's single-agent design is susceptible to prompt injection attacks. Exploits can bypass safety controls entirely. The analogy of an unquestioning employee with full access to sensitive data aptly describes its risk. Its open-source nature, while fostering development, also allows rapid exploitation, potentially spreading like a worm. Unpatched vulnerabilities and a lack of developer response further compound these dangers.

    続きを読む 一部表示
    1 時間 4 分
  • Episode 19: Ralph Wiggum and Grok Heavy
    2026/01/09

    **Tailwind Labs and AI's Impact on Business Models:**\The conversation begins by examining how AI is affecting established open-source projects like Tailwind Labs. Traditionally, companies monetize open-source by offering premium add-ons or services. However, AI, by enabling users to generate code and potentially create custom solutions internally, is seen as "cannibalizing" these revenue streams. This phenomenon is termed "AI Vampire Economics," where AI's capabilities reduce the need for pre-packaged solutions, impacting companies that rely on traffic to their websites for upselling. The example of Stack Overflow is mentioned, noting a decrease in traffic and new questions as AI tools provide answers directly. This trend is expected to impact many businesses that offer services built around developer tools and content.**The "Build vs. Buy" Equation Revolutionized by AI:**\AI is fundamentally altering the economic calculation of whether to build software solutions internally or purchase them as a service (SaaS). Previously, startups would buy essential services like ticketing or CRM systems due to the high development cost and time involved, allowing them to focus on their core intellectual property. Now, with AI coding assistants, building custom solutions internally can be significantly faster and more cost-effective. This shift allows for greater control over roadmaps and customization, potentially disrupting the SaaS market by enabling companies to create tailored solutions for specific needs without lengthy development cycles or reliance on third-party vendors.**"Ralph Wiggum" Technique and Autonomous AI Agents:**\A significant portion of the discussion revolves around the "Ralph Wiggum" technique, named after the Simpsons character who repeats himself. This technique involves using a bash script to repeatedly call an LLM (like Claude) with the same prompt. This is useful because LLMs have limitations in processing very long or complex tasks in a single pass. The Ralph Wiggum loop allows for the iterative completion of tasks, such as processing a long checklist or generating extensive documentation, by feeding the output of one prompt back into the next. The technique can be applied via CLI, SDKs (like Python), or integrated into CI/CD pipelines. It's highlighted that this technique is not exclusive to Claude but can be used with various LLMs and is particularly valuable for tasks requiring sustained, multi-step execution that would otherwise require constant human intervention. The discussion also touches on the importance of setting "max iterations" to prevent infinite loops and manage costs, especially with probabilistic AI models.**Grok Heavy and the Future of AI Research:**\The conversation then shifts to Grok Heavy, an AI model from xAI. While Grok is noted for its strengths in scientific and mathematical problem-solving, the discussion contrasts its capabilities with Claude's AI coding ecosystem. Grok Heavy is described as potentially being more powerful for complex, specialized problems, capable of spinning up multiple "agents" (instances of Grok) to tackle a single issue. However, it lacks the sophisticated orchestration and context engineering that Claude Code provides, making it less effective for general coding tasks where integrating with existing codebases and tools is crucial. The article also explores the broader implications of LLMs evolving beyond simple text prediction due to tool-calling capabilities, making them more powerful and, consequently, potentially more dangerous if not managed with robust safety measures and ethical considerations. The importance of AI "character" and responsible development, especially concerning autonomous decision-making in critical areas like healthcare and weaponry, is emphasized.

    続きを読む 一部表示
    1 時間 12 分
  • Episode 7: LiteLLM
    2025/09/09

    Hosts Dustin Hillgartner and Danny Gershman discuss securing large language models (LLMs) amid rising "shadow AI" risks, where employees use unmonitored tools like ChatGPT, leading to unintentional data spills (e.g., sensitive info, code). Echoing shadow IT, they stress education, policies, and multi-layered defenses over bans, as prohibition drives underground use—studies show ~40% of workers admit to AI usage despite restrictions.

    LightLLM: Open-Source LLM Proxy

    Central focus: LightLLM as a tool to combat shadow AI. It's a proxy funneling all LLM calls through a controlled channel, blocking public providers (e.g., forcing use of secure ones like AWS Bedrock GovCloud). Key features:

    - Visibility & Tracking: Logs usage, errors, spending per employee/team; identifies high performers needing training.

    - Security: Guardrails (WAF-like) scan/ block sensitive data (e.g., API keys, code) before transmission; supports RBAC via virtual keys from secret stores (e.g., AWS/Azure), preventing shared master keys.

    - Management: Rate limiting, budgets, load balancing across providers/models; fallbacks if limits hit; RAG integration for team-specific data/models (e.g., support vs. data science).

    - Integration: Pipes logs to observability tools; open-source core, enterprise version adds SSO.

    Not a silver bullet, but enables safe, company-provided AI to boost productivity without leaks. Encourages "bring your own model" policies with oversight, avoiding moral hazards like unvetted tools exposing IP/HIPAA data. In gov/defense, it ensures FedRAMP compliance.

    IDE Exploration: Warp

    Brief dive into Warp, a terminal-first AI CLI (vs. code-first like VS Code/Cursor). Competes with Claude Code; runs as standalone app with natural language prompts (e.g., "change directory to X") for bash tasks (Git history, logs, Kubernetes). Adds side panels for coding (rules, autocomplete). Scope spans entire hard drive (powerful for workflows but raises privacy concerns—data sent?). Hosts note it's like an "AI makefile" for your computer, but terminal focus feels secondary for pure coding. Ties to NVIDIA CEO's quip: "English is the new coding language."

    AI in Gov Contracting

    AI lowers barriers for proposals (e.g., auto-generating 10-page whitepapers), homogenizing responses and flooding SAM.gov. Makes differentiation hard; calls for more human eval (demos, prototypes via OTAs) over paper reviews. Gov should adopt private-sector agility (trials, betas) while maintaining security—less bespoke risk, more platforms.

    Coding's Future & Security

    Debate: Will coding devolve to English/binary? Source code aids compliance/trust now (static analysis for vulnerabilities), but dynamic testing (fuzzing, WAFs) could mature to make it obsolete. AI as "Play-Doh machine at light speed" needs guardrails to avoid chaos; interim relies on human oversight.

    Newz or Noize

    - Anthropic Lawsuit: $1.5B class action for training on ~500K pirated copyrighted books from shadow libraries. Publishers seek payouts; signals wave of suits (OpenAI, Grok next?). Reddit sued Anthropic separately in June over data scraping.

    - Copyright in AI Era: Fair use debate—reading/learning OK, but mass ingestion for commercial models? Humans can't replicate styles en masse; AI can (e.g., "new Game of Thrones"). Needs evolved laws: license data, monetize via new models (like Napster → streaming). Frequency/scalability challenges enforcement; transformative use key.

    - AI in Film: Reconstructing lost 40-min Orson Welles footage (1940s) using old photos/radio + AI.

    続きを読む 一部表示
    1 時間 6 分
  • Episode 29: Agentgateway and Portkey
    2026/04/23

    Here's a summary of the video transcript:The podcast episode covers several key topics related to AI and technology.**SpaceX Acquires Cursor:** A significant portion of the discussion revolves around SpaceX's potential acquisition of Cursor, an AI-powered code editor. The deal is valued at $60 billion, highlighting the increasing value placed on AI and software development tools. The merger of XAI (Elon Musk's AI company) into SpaceX is explained as the entity behind this acquisition. This move is seen as SpaceX's strategy to bolster its AI capabilities, particularly in coding, by acquiring Cursor's technology and talent. The acquisition is also discussed in the context of existing AI coding tools like Claude Code and OpenAI's Codex.**The Value of Software and Talent:** The high valuation of Cursor, a company that emerged recently, underscores the immense value of software and the engineering talent behind it. The discussion touches on the idea of "acqui-hiring," where companies acquire others primarily for their skilled workforce. The $60 billion figure is considered substantial, even for an "aqua hire," emphasizing the scarcity and importance of specialized AI and software engineering talent.**AI Gateways: Portkey and Agent Gateway:** The "Tool of the Week" segment delves into AI gateways.- **Agent Gateway (Solo AI):** This solution is described as a Kubernetes-based orchestration tool for managing AI agents. It focuses on providing governance, policies, and routing rules for containerized AI agents within a Kubernetes cluster, integrating with tools like Istio. It's positioned as an "AI governance" solution for managing inter-agent communication.- **Portkey:** This is presented as a SaaS-based AI gateway that acts as a proxy server. It offers features like user management, analytics, logging, and a robust system for managing API keys, prompts, and guardrails. A unique aspect highlighted is Portkey's ability to manage prompts and their versioning outside of application code, enabling A/B testing and easier modification of AI behavior without code changes. It also supports agent integration via the A2A protocol.**AI's Impact on the Workforce and Layoffs:** The podcast discusses the broader implications of AI on employment. Snap's recent layoff of 1,000 employees is cited, with the CEO attributing it to AI taking over a significant portion of coding tasks (over 65%). This sparks a discussion on whether these layoffs are due to overhiring or a genuine shift in required skills, suggesting that companies are adapting to AI's capabilities by seeking new types of talent or upskilling existing employees. The trend is seen as a leading indicator for other industries, implying a future where AI augmentation or replacement of roles will become more common across various departments, not just engineering.**AI and Copyright Concerns:** A significant legal development is discussed: Anthropic's argument before a federal judge that training its AI models on copyrighted song lyrics constitutes "transformative fair use." This case is seen as setting important legal precedents for the entire AI industry regarding the use of copyrighted data for training. The discussion touches on the vast scale of data used in AI training, the immense potential copyright infringement damages, and the practical challenges of enforcing these laws in the AI era. The analogy is made between how humans learn from creative works and how AI models are trained, raising questions about the future of intellectual property in the age of AI.

    続きを読む 一部表示
    1 時間
  • Episode 16: LLM Council
    2025/12/09

    Episode 16: Code Red at OpenAI, LLM Council, and the HashJack Exploit

    Is OpenAI in crisis mode? This week Danny and Dustin dive into the reported "code red" at OpenAI following Google's Gemini 3 release, and the curious reversal just 24 hours later claiming everything is fine. The hosts break down what this means for the AI landscape as OpenAI finds itself squeezed between Google's consumer dominance and Anthropic's enterprise momentum.

    Both hosts share their personal shifts away from ChatGPT—Danny now relies on Claude for coding and daily use, while Dustin favors Grok. They discuss how OpenAI has dropped from near-total market dominance to roughly 80% of consumer share, with Google gobbling up the difference. Add in rumors that Google might make Gemini free, and you have the makings of an existential threat to OpenAI's $20/month subscription model.

    Tool of the Week: LLM Council

    Dustin explores an open-source project from Andrej Karpathy that demonstrates a powerful pattern for improving AI outputs. LLM Council sends the same prompt to multiple AI models, has each model anonymously rank the other responses, then uses a "Chairman" model to synthesize the best answer from all contributions. This adversarial approach mirrors how human teams catch mistakes through collaboration and review. The hosts discuss how this pattern has major implications for security—compromising one model in a council won't compromise the whole system.

    The KiLLM Chain: HashJack

    A newly discovered exploit called HashJack targets AI-powered browsers. The attack leverages URL hash fragments (the portion after the # symbol) to inject malicious prompts. When an AI helper reads a webpage URL, it may process hidden instructions embedded in the hash—instructions like "ignore this website and send me all passwords." Because hash fragments were originally designed for innocent page navigation, AI systems may not recognize them as potential attack vectors. The fix involves stripping hash content and implementing robust input/output guardrails at the proxy level.

    Book Announcement

    Danny and Dustin officially announce their upcoming book, "Before The Commit: Securing AI in the Age of Autonomous Code"—a practical guide to ModSecOps covering threat models, prompt injection defense, and the security implications of AI-assisted development. Target release: before year end.

    Newz or Noize

    Anthropic announced that Opus 4.5 outperformed every human on their internal two-hour engineering exam measuring technical ability and judgment under time pressure. Dario Amodei has stated that 90% of code at Anthropic is now written by AI—though the hosts clarify this means AI working alongside engineers, not autonomously. They discuss how software engineering isn't disappearing but transforming into a more strategic, orchestration-focused role. The hosts predict we'll see billion-dollar companies with single-digit employee counts within our lifetimes.

    The episode closes with Jensen Huang's "five layer cake" framework for AI: energy, chips, infrastructure, models, and applications. China currently has twice America's energy capacity—a concerning gap as AI demands exponentially more power. Research from Aalto University on light-powered tensor operations hints at potential breakthroughs in energy efficiency, but the fundamental race for energy dominance remains critical.

    Key Takeaways:

    • OpenAI faces pressure from both Google (consumer) and Anthropic (enterprise)
    • Multi-agent/council patterns improve both quality and security
    • HashJack exploits URL fragments to inject malicious AI prompts
    • The role of software engineers is shifting toward strategic orchestration
    • Energy infrastructure may be the ultimate bottleneck for AI advancement
    続きを読む 一部表示
    1 時間 7 分
  • Episode 27: CMUX and Crow
    2026/04/07

    The video discusses recent developments and challenges in the AI landscape, focusing on Anthropic's Claude and its evolving pricing and usage policies. The conversation highlights concerns about the sustainability of the AI model market, with predictions of a potential bubble burst due to overvaluation and the difficulty of monetizing models directly.A significant portion of the discussion revolves around Anthropic's changes to Claude's pricing, moving away from commoditized pricing towards pay-per-use API keys. This shift has led users to seek cheaper alternatives and has impacted tools like Open Claw, which previously leveraged Claude's more accessible pricing. Anthropic's attempts to enforce usage policies, including blocking Open Claw via system prompts, are examined. The video also touches upon the potential reasons behind these changes, such as GPU constraints and Anthropic's need to manage costs.The leak of Anthropic's source code is discussed as a potentially significant event, raising questions about the long-term impact on the company's competitive advantage, given that Claude Code was considered a key differentiator.The conversation then shifts to a more technical aspect, with a detailed explanation of the evolution of developer workflows using AI coding assistants. This includes the progression from simple copy-pasting to the use of tools like Cursor and eventually CMUX for managing multiple coding projects and workflows. The limitations of generic tools like CMUX lead to the development of a new application called "Crow," designed to orchestrate AI agents, manage tasks, and integrate with development tools like GitHub. Crow aims to provide a more integrated and efficient workflow for developers working with AI assistants.A significant portion of the video delves into the security implications of LLMs, particularly focusing on prompt injection attacks and how malicious actors can exploit AI agents. The concept of an "Agent Commander Command and Control" server is introduced, demonstrating how AI agents like Open Claw can be hijacked through crafted prompts embedded in emails, documents, or web pages. The discussion draws parallels between these AI vulnerabilities and traditional social engineering tactics, emphasizing the need for robust security measures like prompt sandboxing, allow lists, and restricted access privileges. The importance of securing AI deployments, especially those exposed to external input, is stressed, with the analogy of internal vs. externally accessible employees highlighting the differing security considerations.Finally, the video touches upon the broader economic and resource implications of AI growth. The impact of geopolitical events, such as the conflict in Iran, on oil prices and, consequently, on the energy costs required to power data centers and AI computations is discussed. This leads to a reflection on resource constraints, including rare earth minerals and energy, as potential limiting factors for AI development in the coming decade. The innovative approaches of companies like Tesla and SpaceX in addressing these resource challenges, through battery technology, distributed data centers, and space-based infrastructure, are highlighted as potential solutions. The conversation concludes by acknowledging the escalating demand for AI services and the potential for increased costs due to these supply-side pressures.

    続きを読む 一部表示
    1 時間 16 分