About

I’m Dr. Nobel Khandaker. I hold a PhD in Computer Science with a research focus on multi-agent systems, and I’ve spent the years since applying that foundation to production infrastructure — safety-critical real-time control systems, industrial edge middleware, and distributed architectures where downtime has physical consequences.

Zero Downtime is where I write about what I learn along the way.

Most posts here fall into one of four areas:

  • Multi-agent reliability tooling — trust modeling with subjective logic, semantic caching with knowledge-aware invalidation, deadlock and livelock detection for agent workflows. These are the libraries and frameworks I build to solve problems I hit in production.
  • Agentic engineering practices — how to work effectively with coding agents: spec-driven development, agentic project planning, and structured workflows for using AI in real engineering teams.
  • Reliability patterns — backpressure, idempotency, exactly-once illusions, and where their assumptions quietly break in distributed systems.
  • Tooling notes — small observations on instruments I find indispensable.

Contact