AI/ML Engineer (Research) β’ Agentic AI β’ Full-Stack β’ DevOps/MLOps β’ Distributed Systems β’ Computer Vision
I build agentic systems end-to-end: model + retrieval + evaluation, planner β executor loops, UX, and deployment. Reliability is how I ship AI safely β idempotency, retries/backoff, DLQ, reconciliation, audit logs, regression gates.
Proof
- Delivered 3Γ throughput while holding p95 β€ 3.8s by pairing guardrailed agent flows with regression gates + evidence bundles.
- Tuned platform schedulers for ~50% less swap thrash and ~$40K/year infra savings via idempotent queues, retries/backoff, and DLQ hygiene.
- Cut 38% inference cost yet kept 60fps UI at 100k+ rows through caching/batching, virtualization, and observability-led tuning.
Background: NYU MS CS (May 2026) β’ Samsung Research β’ Veach AI β’ Research Assistant β’ ES 2026 full paper accepted.
- Agentic AI systems β planner β executor loops, tool use, eval harnesses, safety/guardrails, and logs that keep auditors happy.
- ML/AI pipelines β RAG + LLM apps, training/eval harnesses, latency/cost optimization, experiment tracking.
- Computer Vision β OpenCV/vision-model pipelines, dataset tooling, scoring dashboards.
- Full-stack product β Next.js/React frontends, FastAPI/REST backends, Postgres/Redis data layers.
- DevOps/MLOps β Docker/Kubernetes, CI/CD (GitHub Actions), observability, deployment runbooks.
- Event-driven workflow orchestrator β workflow-orchestrator-sandbox. Agent + ML pipeline engine (FastAPI + Redis + Postgres) with idempotency keys, retries/backoff, DLQ, reconciliation sweeps, and audit-ready metrics. Demo/Docs: repo README.
- RAG evaluation + latency/cost harness β rag-eval-harness. Deterministic dataset loader, caching vector store, async workers, latency/cost dashboards, and CI-ready eval harnesses that drove the 38% cost win. Demo/Docs: repo README.
- High-volume analytics UI β Portfolio. Full-stack (Next.js + APIs + DB) analytics experience with virtualization, workerized transforms, and instrumentation to keep 60fps at 100k+ rows. Demo/Docs: https://karan-allagh.vercel.app.
- Low-latency C++ prototyping β Samsung/Veach internal (non-public). Near-metal agentic kernels for SIMD batching, pipeline hazard detection, and telemetry to hold p95 β€ 3.8s. Demo/Docs: available under NDA.
- Cloud automation agent β Cloud_Automation_Agent-. Electron + Django + Orion agent stack that plans, executes, captures evidence, and enforces guardrails for cloud operations. Demo/Docs: repo README (video WIP).
- Bake reliability patterns (idempotency, retries/backoff, DLQ, reconciliation, audit logs) into every agentic or ML system to keep rollouts safe.
- Reliability is not a phase; itβs the guardrail for AI/ML features before they reach customers.
- Languages: Python, TypeScript/JavaScript, Java (Spring), C++17/20, Go, SQL, Bash.
- ML/AI: LLM tooling (agentic planners, RAG, eval harnesses), cost/latency tuning, dataset tooling.
- Computer vision: OpenCV + vision-model pipelines, dataset scoring + regression dashboards.
- Web & APIs: React/Next.js frontends, FastAPI/REST services, Postgres + Redis data layers.
- Infra / DevOps / MLOps: Docker, Kubernetes, GitHub Actions, Kafka streams, observability (logs/metrics/traces), runbooks + on-call.
- ES 2026 accepted full paper β Agentic Decomposition for Reliable Long-Horizon AI Planning (public preprint coming soon).
- Interest areas: agentic decomposition, evaluation rigor, retrieval quality, latency/cost trade-offs for LLM + CV workloads.
New Grad roles in AI/ML engineering, agentic AI systems, or backend/platform engineering β including Founding Engineer (0β1) opportunities at early-stage startups (NYC hybrid or remote). I love building reliable, observable systems: idempotency, evals, CI, and production-ready deployments.
π« ka3527@nyu.edu β’ LinkedIn β’ Portfolio β’ GitHub
workflow-orchestrator-sandboxβ shows idempotency, retries/backoff, DLQ, reconciliation.rag-eval-harnessβ demonstrates latency/cost benchmarking and async evaluation.Cloud_Automation_Agent-β agentic automation with plan reviews + audit logs.Portfolioβ high-volume UI + recruiter-ready story.office-submissionβ reliability patterns inside Office add-ins.

