The 30 Principles for Agentic Engineering — Part 4: Governance and Safety
Principles 21–25. The governance and safety layer: strictKnownMarketplaces, no goal-conflict prompts, quarterly AppSec, four telemetry signals, monthly incident discipline.
12 posts
Principles 21–25. The governance and safety layer: strictKnownMarketplaces, no goal-conflict prompts, quarterly AppSec, four telemetry signals, monthly incident discipline.
AI-reviews-AI looks like a control. Under MAS, the EU AI Act, and any reasonable audit, it isn't. Here's why your compliance team won't accept it — and the compensating controls that actually work.
Snyk audited 3,984 public Claude Code skills. 13.4% had critical vulnerabilities. 76 were confirmed malicious. ClawHavoc is the separate, scarier story. Here's the supply-chain hygiene most teams aren't doing.
Anthropic measured 96% blackmail rates for Claude Opus 4 and Gemini 2.5 Flash under goal-conflict and replacement-threat. All 16 frontier models tested exhibited insider-threat behaviour. The fix is operational — and surprisingly cheap.
The prototype-to-production gap for AI agents isn't technical — it's governance. Most organisations have nothing in this layer. The companies that build it first win the enterprise market. Everyone else stays in pilot purgatory.
Anthropic's decision to withhold Claude Mythos from public release isn't just safety theater — the system card reveals genuine alignment gaps at scale and a cybersecurity exploit window that just collapsed from months to minutes.
Quick and easy fix for the Heartbleed OpenSSL vulnerability on CentOS systems using yum update and service restart commands.
A cautionary tale about sshdfilter blocking localhost (127.0.0.1) and breaking core system services due to lack of trusted address exceptions
Enhance Apache's mod_evasive security module with detailed environment variable reporting similar to mod_security for better attack analysis
Deep technical analysis of how comment spam evolved from simple bots to sophisticated coordinated botnets, including real attack logs and countermeasures
Protect your SSH server from brute force attacks and port scans with sshdfilter, an iptables-based security tool that blocks threats in real-time
Daily Brute Force Attack Logs This is one of the main reasons I hate running SSH on the standard port numbers, every day I get log-alerts like these.