About
I’m Dr. Nobel Khandaker. I hold a PhD in Computer Science with a research focus on multi-agent systems, and I’ve spent the years since applying that foundation to production infrastructure — safety-critical real-time control systems, industrial edge middleware, and distributed architectures where downtime has physical consequences.
Zero Downtime is where I write about what I learn along the way.
Most posts here fall into one of four areas:
- Multi-agent reliability tooling — trust modeling with subjective logic, semantic caching with knowledge-aware invalidation, deadlock and livelock detection for agent workflows. These are the libraries and frameworks I build to solve problems I hit in production.
- Agentic engineering practices — how to work effectively with coding agents: spec-driven development, agentic project planning, and structured workflows for using AI in real engineering teams.
- Reliability patterns — backpressure, idempotency, exactly-once illusions, and where their assumptions quietly break in distributed systems.
- Tooling notes — small observations on instruments I find indispensable.
Contact
- LinkedIn — nobelkhandaker
- GitHub — @nobelk
- RSS — /feed.xml