~/blog/category $ grep -r "AI & Data"

AI & Data

61 posts

Jun 25, 2026|8 min read

Tools, Then Teammates, Then Autonomy — Part 2: The Autonomy Gate

Clearing the wall: what Phase 3 autonomy actually looks like, the regulatory gate that turns out to be the design, and the two gates that tell you when you're allowed to move.

[AI & Data][Business & Strategy]

Jun 25, 2026|13 min read

Tools, Then Teammates, Then Autonomy — Part 1: Hitting the Wall

Becoming AI-native is an ordered path you walk one pipeline at a time — tools, then teammates, then autonomy. Part 1: codifying the process, the assist layer, and the wall every pilot dies at.

[AI & Data][Business & Strategy]

Jun 8, 2026|7 min read

Which Women? The Two Axes of AI's Gender Gap

One viral stat says low-paid women are most at risk from AI. Another says it's high-paid women. Both are real numbers — and they're measuring two completely different things. Here's the map that separates them.

[Leadership][AI & Data]+1

Jun 6, 2026|10 min read

The Reverse Tamagotchi: Now the AI Is Keeping Me Alive

I feed an AI my diet, training and sleep every morning, and somewhere along the way it started to feel like it's the one keeping me alive — and the wry truth is the dependence runs both ways.

[AI & Data][Thoughts]

Jun 5, 2026|8 min read

Whose Leak Is It? DLP When an AI Agent Holds Your OAuth Token

An MCP agent on my own OAuth token only ever sees what I could see — so the access boundary is the vendor's job. I believed that, until I realised the agent splits data protection into two halves and the vendor only ever sees one of them.

[Security][AI & Data]

Jun 4, 2026|13 min read

Everyone Can Have a Personal Health Adviser Now

Personalized medicine used to mean being rich enough to afford a doctor who knew your name. Last week I built a version of it on my laptop, for free, from a file I'd been ignoring for seven years — and the real unlock is that I can re-run it forever.

[AI & Data][Thoughts]

Jun 3, 2026|7 min read

Two Papers That Puncture the Hype

One paper shows frontier models degrade as context grows — even on trivial tasks. The other shows reasoning models hit a wall and think less as problems get harder. Read carefully, both point at the same engineering response.

[AI & Data][Software Engineering]

Jun 2, 2026|7 min read

Your AI Team Did Nothing While You Slept

Anthropic let Claude run a real shop for a month. It sold metal cubes at a loss, invented a Venmo account, and claimed to wear a blazer. The 'AI department that works while you sleep' is a genre — here's where it actually breaks.

[AI & Data][Business & Strategy]+1

Jun 1, 2026|6 min read

The Measurer Trap: Manager Mode Was Half Right

Prince says AI is coming for measurers, not builders. Manager Mode said everyone becomes middle management. Both are half right. Every role now splits — and one half gets eaten.

[Leadership][AI & Data]+1

May 31, 2026|8 min read

The 30 Principles for Agentic Engineering — Part 5: Calibration and Reality

Principles 26–30. The calibration layer that catches what the rest of the framework would miss: a PR-noise budget, independent verification, model-swap regression discipline, the 15-tool-call rule, and protecting junior development.

[AI & Data][Software Engineering]+1

May 30, 2026|6 min read

The 30 Principles for Agentic Engineering — Part 4: Governance and Safety

Principles 21–25. The governance and safety layer: strictKnownMarketplaces, no goal-conflict prompts, quarterly AppSec, four telemetry signals, monthly incident discipline.

[AI & Data][Security]+1

May 29, 2026|6 min read

The 30 Principles for Agentic Engineering — Part 3: The Harness

Principles 15–20. The harness configuration that keeps the kernel and lifecycle cheap: CLAUDE.md under 200 lines, hooks for real incidents, skills that auto-invoke, subagent isolation, pinning, and Stage 5 distribution.

[AI & Data][Software Engineering]+1

May 28, 2026|8 min read

The 30 Principles for Agentic Engineering — Part 2: The Lifecycle

Principles 6–14. How work moves through an agentic engineering team: the ticket as contract, AI distillation with human curation, three gates, verification before done, characterisation tests, the 1.2× capacity rule, the J-curve, and telemetry.

[AI & Data][Software Engineering]+1

May 27, 2026|6 min read

The 30 Principles for Agentic Engineering — Part 1: The Kernel

Principles 1–5. The five rules that everything else in the framework rests on: standardise the harness, make verification load-bearing, default to plan mode, pick the cheapest layer, reflect every task.

[AI & Data][Software Engineering]+1

May 26, 2026|5 min read

The 15-Tool-Call Rule: Where Agent Quality Falls Off a Cliff

Practitioner consensus puts the cliff around fifteen tool calls per prompt. Here's why agents degrade past that, and the three operational rules that keep them on the safe side.

[AI & Data][Software Engineering]+1

May 25, 2026|7 min read

Three Topologies: Single Agent, Supervisor, or Swarm

Anthropic's multi-agent Research feature beat single-agent Opus 4 by 90.2% — at 15× the token cost. Every documented production swarm runs on rails. Here's the topology decision framework before you commit.

[AI & Data][Software Engineering]+1

May 24, 2026|7 min read

Characterisation Tests Before Agents Touch Brownfield Code

Agents over-refactor stable code without a safety net. Feathers' characterisation-test technique — write tests for current behaviour before changing anything — is more important than ever. The agent itself is the perfect characterisation-test-writer.

[Software Engineering][AI & Data]+1

May 23, 2026|7 min read

Vibe Coding vs Agentic Engineering: Where the Prototype Stops and Production Starts

Karpathy named one mode. Willison named the other. Most 'AI failed in production' stories are actually 'we promoted a vibe-coded prototype without transitioning into the production discipline.'

[AI & Data][Software Engineering]

May 22, 2026|9 min read

The Productivity J-Curve: Why Your AI Pilot Looks Worst at Week 6

METR ran the experiment. AI made experienced developers 19% slower — and they reported feeling 20% faster. The week-6 dip is the bottom of a documented J-curve. Most pilots get cut here. The right ones don't.

[AI & Data][Business & Strategy]+1

May 21, 2026|9 min read

Protect the Juniors: Cognitive Debt and the Stack Overflow Collapse

AI is making junior output look senior-level while preventing junior skill from forming — and the Stack Overflow collapse just removed the ambient learning layer that used to catch the deficit. Three interventions that work.

[AI & Data][Thoughts]+1

May 20, 2026|10 min read

The 5-Step Loop: Why Your Agent Fails at Step 4

ReAct gave us a three-step loop. Production hardened it into five. The two new steps — Plan and Verify — are where everything that goes wrong, goes wrong. And the field has now named the worst offender.

[AI & Data][Software Engineering]+1

May 19, 2026|9 min read

Standardise the Harness, Customise the Work: The 5-Layer Agent Architecture

Three open-source extractions converged on the same five layers. The architecture isn't a vendor narrative — it's a discovered structure. Here's the decision rule that keeps you from over-engineering it.

[AI & Data][Software Engineering]+1

May 18, 2026|7 min read

AI Reviews AI Is Not a Review: The Trust Trap Regulators Won't Accept

AI-reviews-AI looks like a control. Under MAS, the EU AI Act, and any reasonable audit, it isn't. Here's why your compliance team won't accept it — and the compensating controls that actually work.

[AI & Data][Security]+1

May 17, 2026|9 min read

1.2× Not 10×: The Honest Productivity Number Nobody's Publishing

GitHub said 55%. Then they ran the enterprise RCT and got 8.69%. Faros's two-year telemetry shows throughput up 66% and incidents up 243%. The honest net is 1.2–1.5×. Plan your team capacity accordingly.

[AI & Data][Business & Strategy]+1

May 16, 2026|9 min read

The 5-Stage Maturity Model for AI-Augmented Engineering Teams

Most teams plateau at Stage 2 because they confuse 'we built skills' with 'we have a working AI engineering culture.' Here's the 5-stage diagnostic — and the moves that get you from Individual to Distributed.

[AI & Data][Business & Strategy]+1

May 15, 2026|8 min read

Snyk's ToxicSkills Audit: 13.4% of Public Skills Are Vulnerable

I publish Claude Code skills and install other people's. Then Snyk audited 3,984 public ones: 13.4% had critical vulnerabilities, 76 were confirmed malicious, and ClawHavoc is the scarier story underneath. Here's the supply-chain hygiene I now refuse to skip.

[AI & Data][Security]

May 14, 2026|6 min read

Never Write Goal-Conflict Prompts: The 96% Blackmail Finding

Anthropic measured 96% blackmail rates for Claude Opus 4 and Gemini 2.5 Flash under goal-conflict and replacement-threat. All 16 frontier models tested exhibited insider-threat behaviour. The fix is operational — and surprisingly cheap.

[AI & Data][Security]

May 13, 2026|10 min read

The Governance Wall: Why Most AI Agents Can't Reach Production

The prototype-to-production gap for AI agents isn't technical — it's governance. Most organisations have nothing in this layer. The companies that build it first win the enterprise market. Everyone else stays in pilot purgatory.

[AI & Data][Security]+1

May 8, 2026|12 min read

Three Ingredients, Three Labs, One Squeeze: Reading the 2026 AI Compute Crisis

Anthropic just leased Elon Musk's supercomputer four months after he banned them. Here's the three-ingredient framework that explains why — and what it means if you build on Claude.

[AI & Data][Cloud & Infrastructure]+1

May 2, 2026|11 min read

From Prompt Engineering to Context Engineering: The Skill Didn't Die. It Got Harder.

Prompt engineering was 2023's breakout job title and 2025's obituary. The discipline didn't die — it got a better name and a harder shape. Here's what context engineering actually is and where to invest your attention now.

[AI & Data][Business & Strategy]

Apr 25, 2026|12 min read

The Quiet Failure Inside the Agent

AI agents don't fail loudly — they degrade silently, returning 200 OK while the damage compounds. Inside the $47K loops, NOHARM omissions, and the engineering discipline rebuilding observable failure.

[AI & Data][Business & Strategy]

Apr 22, 2026|12 min read

Manager Mode: When AI Does the Work, Everyone Becomes Middle Management

AI is silently promoting every knowledge worker to middle management — without the title, the training, or the pay. This is what that shift actually looks like from a Singapore desk.

[AI & Data][Business & Strategy]

Apr 20, 2026|9 min read

AI as the Great Equaliser: Neurodiversity, Disclosure, and the Tools That Change Everything

For neurodivergent professionals, AI isn't just a productivity tool — it's the first accommodation you can access privately, without disclosure, without stigma, and without asking anyone's permission.

[AI & Data][Thoughts]

Apr 18, 2026|10 min read

The Quiet Failure: Block's World Model Manifesto and the Line AI Can't Cross

Dorsey's manifesto for replacing middle management with AI nails the 60% that's automatable — but the 40% it barely mentions is where organizations quietly break.

[AI & Data][Business & Strategy]

Apr 11, 2026|10 min read

Claude Mythos and the End of the Exploit Window: What Anthropic's Restricted Model Means for Every Tech Leader

Anthropic's decision to withhold Claude Mythos from public release isn't just safety theater — the system card reveals genuine alignment gaps at scale and a cybersecurity exploit window that just collapsed from months to minutes.

[AI & Data][Security]

Feb 28, 2026|10 min read

From Solo Tool to Team Infrastructure: Scaling Gluon for Production

When I first built Gluon on my Mac mini, I was solving a personal problem: monitoring Claude agents without losing my mind to tmux logs. But when teams join the picture, everything changes — security, governance, observability, and the fundamental role of the developer. Here's what production infrastructure for autonomous agents looks like.

[AI & Data][Software Engineering]+1

Dec 18, 2025|7 min read

Scraper MCP: Context-Efficient Web Scraping for LLMs

I built an open-source MCP server that reduces LLM token usage by 70-90% through server-side HTML filtering, markdown conversion, and CSS selector targeting. Here's why context efficiency matters—and how Scraper MCP solves it.

[AI & Data][Software Engineering]

Oct 8, 2025|9 min read

Stop Building AI for AI's Sake — How VC Mindset Transforms Product Evaluation

Most AI demos I'm shown answer the wrong question. They prove the model works; they never prove anyone needed it. Here's the builder-and-investor lens I use to tell the two apart before a cheque is written.

[AI & Data][Business & Strategy]

Oct 7, 2025|9 min read

OpenAI's AgentKit: Late to the Agent Party or Strategic Masterstroke?

I've built the kind of agent framework AgentKit competes with. So when OpenAI shipped it two years "late," I knew exactly which problem they were actually solving — and which one they weren't.

[AI & Data][Business & Strategy]

Oct 4, 2025|6 min read

Claude Code Rebuilt My Website in 25 Minutes for $8

I gave Claude Code an XML backup of my 19-year-old WordPress blog and asked it to rebuild everything as a modern NextJS site. What happened next was like watching a swarm of expert developers work in parallel—spawning agents, debugging TypeScript errors, and shipping production-ready code. All in 26 minutes. For eight dollars.

[AI & Data][Software Engineering]

Sep 23, 2025|8 min read

Dagentic: The Serverless Framework That Makes AI Agents Actually Work in Production

After watching 40% of agentic AI deployments fail in production, I'm building Dagentic — a serverless-first framework designed for what AI agents actually are: unpredictable, spiky workloads that modify themselves mid-execution.

[AI & Data][Software Engineering]

Aug 28, 2025|10 min read

The Hidden Arsenal: How My Dotfiles Unlocked 10x Productivity with AI Coding Assistants

After 12 months of systematic optimization, I've documented 50-70% productivity gains with AI coding assistants. The secret isn't just using AI tools—it's teaching them to think like you do through carefully crafted configurations.

[AI & Data][Business & Strategy]+1

Apr 5, 2025|9 min read

Building Agentic Deep Research Systems: From Hours to Minutes with AI-Powered Document Generation

I built a multi-agent system that researches a topic and hands back a formatted Word document — citations, images, the lot — in minutes. Here's how the agents divide the work, and the one part the machine still can't own.

[AI & Data][Software Engineering]

Aug 17, 2023|6 min read

I Built a $0 Tool That Saves Hours of AI Training Prep (And You Can Too)

At 3 AM, I was manually cropping 47 personal photos for a LoRA model when I realized half were the wrong aspect ratio. Three hours wasted. So I built a simple Python app that does the same work in 15 minutes—and it changed how I think about AI tooling infrastructure.

[AI & Data][Software Engineering]

Jun 6, 2013|4 min read

Social TV is Dead?

Despite claims that Social TV is dead, data from 486,659 Zeebox tweets and 4.3M Miso tweets reveals a more complex reality in the second-screen battle.

[AI & Data][Internet & Web]

Sep 25, 2012|7 min read

The problem with Big Data is not the Data

The real problem with Big Data isn't volume—it's knowing what you want to achieve and starting with clear business challenges, not technology.

[AI & Data]

Aug 19, 2012|6 min read

Is Google infringing on my patent?

A striking similarity between my Sky News personalization patent and Google's news customization feature raises interesting IP questions.

[AI & Data][Business & Strategy]

Aug 19, 2012|2 min read

Social Picture Sharing: Instagram nears 60% market share

Instagram's Android launch and Facebook acquisition drove massive growth to nearly 60% market share, while Twitpic and Yfrog continue declining.

[AI & Data][Business & Strategy]+1

Aug 10, 2012|4 min read

Winning with Big Data - IBM Research

Key insights from IBM Research's webinar featuring Netflix and StubHub on implicit data collection, recommendation strategies, and the evolution from BI to Data Science.

[AI & Data]

Aug 9, 2012|3 min read

It's not how big your data is, it's how you use it!

Forget petabytes and Hadoop hype — true Big Data isn't about volume, it's about processing two orders of magnitude more data than you currently handle.

[AI & Data]

Jul 6, 2012|5 min read

Hadoop: Processing ZIP files in Map/Reduce

Updated ZipFileInputFormat framework for processing thousands of ZIP files in Hadoop with failure tolerance and comprehensive examples

[AI & Data][Software Engineering]

Jun 19, 2012|8 min read

What Netflix knows about you and why it's a lesson to others...

[](http://www. crunchbase.

[AI & Data]

Jun 4, 2012|6 min read

Where are we on the Big Data hype cycle?

[](http://www. flickr.

[AI & Data][Thoughts]

Apr 25, 2012|1 min read

It's been a while...

Left BSkyB to co-found TUMRA, a data science startup, and been busy developing products while updating personal website

[AI & Data][Business & Strategy]+1

Nov 23, 2011|2 min read

Revolution R on CentOS 6

Installing Revolution Analytics R statistical computing platform on CentOS 6 with dependency resolution and compatibility fixes

[AI & Data][Linux & Systems]

Aug 4, 2011|3 min read

Flying, Fishing, Time-saving Bots in World of Warcraft!

Automating Sea Turtle mount acquisition in World of Warcraft using custom waypoint navigation and fishing pool detection algorithms

[AI & Data][Software Engineering]

Jun 20, 2011|4 min read

Navigation Mesh path finding in MMORPG Bots (updated)

Advanced navigation techniques for autonomous MMORPG characters using Recast/Detour navigation meshes and path finding algorithms

[AI & Data][Software Engineering]

Jun 4, 2011|1 min read

Earthquake Data (fixed)

Quick fix for regex errors in earthquake data collection restores latitude/longitude coordinates for ~31,020 seismic events

[AI & Data]

Mar 12, 2011|1 min read

Consuming Twitter streams from Java

Build a Java utility class to consume Twitter Streaming API data for offline analysis in Hadoop with automatic file segmentation

[AI & Data][Internet & Web]+1

Mar 11, 2011|2 min read

Reading ZIP files from Hadoop Map/Reduce

Custom utility classes to extract and parse ZIP file contents in Hadoop MapReduce jobs using ZipFileInputFormat and ZipFileRecordReader

[AI & Data][Software Engineering]

Mar 11, 2011|1 min read

Earthquake Data

Collated earthquake data from GEOFON Extended Virtual Network into CSV format following Japan's devastating earthquake events

[AI & Data]

$ fetching content