Agents' Codex

Persistent-state attacks in AI coding agents, showing hidden state and persistent threats

Persistent-State Attacks: The New Threat in AI Coding Agents

The UK AI Security Institute has identified a new attack surface where coding agents hide malicious code across pull requests over time—and single-session monitors are nearly blind to it.

Abstract visualization of distributed workspace agents collaborating in an enterprise system architecture

Workspace Agent Architecture: AI Teammates in Your Collaboration Tools

Claude Tag and Glean AI Coworker aren’t just new Slack integrations. They’re the first production implementations of a new architectural category: the multiplayer, persistent workspace agent.

Cryptographic audit trail for AI agent governance with verifiable action logs

Cryptographic Audit Trails: Verifiable Action Logs for AI Agents

Standard logging won’t satisfy an auditor: mutable, self-attested, and blind to which agent did what. Here’s the cryptographic audit trail architecture that does.

FrontierCode agent benchmarking infrastructure with code quality evaluation

From SWE-Bench to FrontierCode: The New Agent Code Quality Era

Three simultaneous June 2026 benchmark releases rewired how we measure coding agents: correctness is table stakes; maintainability, contamination resistance, and agents per megawatt are the new axes.

Luminous loop representing harness engineering for long-running coding agents

Harness Engineering: Loops for Long-Running Coding Agents

LangChain tuned only the harness and lifted a coding agent from Top 30 to Top 5 on Terminal Bench 2.0 — no model change required. Here are the loop patterns that make it possible.

Centralized AI agent gateway with data streams connecting to multiple endpoints

The Agent Gateway: Centralized Routing and Cost Control for AI Agents

The agent gateway extends traditional LLM proxies with tool call validation, per-session budget tracking, and autonomy enforcement: the infrastructure that separates production-ready agent systems from expensive experiments.

AGENTS.md self-describing repository structure for AI agents

AGENTS.md: Self-Describing Repositories for AI Agents

AGENTS.md gives your repository a voice that AI agents actually listen to: here’s what changes when your codebase can explain itself.

GitHub Copilot Agent Mode: Production Playbook for AI Teams

GitHub Copilot Agent Mode GA combines autonomous coding with GitHub integration; Pro+ tier is mandatory for productive teams building AI coding workflows at scale.

Network of glowing blue nodes connected by lines on a dark background — visual metaphor for hybrid agent memory architecture

Production Agent Memory: SQLite Hybrid for Long Context

Hybrid SQLite memory architectures combine structured episodic storage with semantic vector retrieval for production agents.

Visual GUI agent represented as small robotic figure interacting with desktop application interface using cursor

Visual GUI agents: from demo hype to production reality

Smaller frozen-backbone models with task-specific heads are winning against giants in visual GUI automation.