About

Backend systems, incident timelines, and tradeoffs that show up after real traffic starts.

Most of my work is Node.js and NestJS services that move data across queues, databases, and APIs with strict reliability constraints. Kafka consumer behavior, retry semantics, and idempotency usually matter more than feature velocity.

I spend a lot of time in PostgreSQL-heavy systems where query plans and index decisions directly impact user-facing latency. Redis-backed workflows and Dockerized local environments help me keep behavior predictable between local testing and production.

This site is my engineering notebook: production incidents, debugging paths, and operational decisions that improved reliability. No motivational filler, only practical notes from running distributed systems.

What I Usually Work On

  • Kafka consumer lag, retries, and delivery guarantees
  • Outbox/event-publishing consistency in NestJS services
  • PostgreSQL query plans and index tradeoffs under load
  • Redis-backed orchestration and cache correctness
  • Production debugging with metrics, logs, and traces
  • Keeping distributed system behavior operationally simple