Episodi

  • Issue #9: Claude Opus 4.7 Ships Cyber Safeguards to Production
    Apr 22 2026
    Issue #9: Claude Opus 4.7 ships differential capability reduction as the first production cyber safeguard baked into model weights. Vercel breached through an AI tool's OAuth scope. Spring AI SDK for Bedrock AgentCore goes GA for Java. GTA-2 paper proves your agent harness matters more than your model. And CMU documents 6 million fake GitHub stars across the AI ecosystem. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai
    Mostra di più Mostra meno
    15 min
  • Issue #8: Anthropic ships Managed Agents, UC Berkeley breaks every major AI benchmark, AWS Agent Registry launches in preview
    Apr 15 2026
    Issue #8: Anthropic ships Managed Agents, UC Berkeley breaks every major AI benchmark, AWS Agent Registry launches in preview. Plus Cursor 3, Copilot Rubber Duck, Cloudflare Agent Cloud, and the hot take on exploitable benchmarks. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai
    Mostra di più Mostra meno
    15 min
  • Issue #7: Anthropic published the blueprint for multi-hour coding agents
    Apr 9 2026
    Anthropic published the blueprint for multi-hour coding agents. GitHub shipped /fleet for parallel multi-agent coding. Amazon Nova Act MCP gives your agent a browser with one install. Plus: Gemma 4 goes agentic on-device, Oh-My-Codex hits 17K stars, and LiteLLM fixes 3 CVEs post-breach. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai
    Mostra di più Mostra meno
    17 min
  • Issue #6: JetBrains Central, ARC-AGI-3, Claude Mythos Leak, Copilot Ads in PRs
    Apr 1 2026
    This week: JetBrains Central launches an open control plane for coding agents. ARC-AGI-3 drops and frontier AI scores below 1%. Claude Mythos gets leaked via CMS misconfiguration. MolmoWeb beats GPT-4o at 8B parameters. AI Scientist v2 passes peer review. 177K MCP tools show agents shifted from reading to writing. AWS Labs ships Agent Plugins for Claude Code and Cursor. Microsoft merges Semantic Kernel and AutoGen. And Copilot literally put an ad in someone's pull request. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai
    Mostra di più Mostra meno
    16 min
  • Issue #3: LangChain Just Open-Sourced a Claude Code Replacement
    Mar 11 2026
    This week: LangChain releases Deep Agents, an MIT-licensed coding agent built on LangGraph that works with any model. GPT-5.4 ships native computer use (75% OSWorld score). Karpathy drops autoresearch for autonomous ML experiments. Claude finds 22 Firefox zero-days in two weeks. Anthropic's labor market study shows junior hiring slowing. Alibaba OpenSandbox provides agent isolation infrastructure. SWE-CI benchmark tests long-term code maintenance. Shannon AI pentester only reports verified exploits. And the Clinejection attack: how a GitHub issue title compromised 4,000 developer machines. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai
    Mostra di più Mostra meno
    18 min