⚠ This page is served via a proxy. Original site: https://github.com
This service does not collect credentials or authentication data.
Skip to content
View Karan-05's full-sized avatar
πŸ˜ƒ
FUN
πŸ˜ƒ
FUN

Block or report Karan-05

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Karan-05/README.md

Karan Allagh

AI/ML Engineer (Research) β€’ Agentic AI β€’ Full-Stack β€’ DevOps/MLOps β€’ Distributed Systems β€’ Computer Vision

I build agentic systems end-to-end: model + retrieval + evaluation, planner β†’ executor loops, UX, and deployment. Reliability is how I ship AI safely β€” idempotency, retries/backoff, DLQ, reconciliation, audit logs, regression gates.

Proof

  • Delivered 3Γ— throughput while holding p95 ≀ 3.8s by pairing guardrailed agent flows with regression gates + evidence bundles.
  • Tuned platform schedulers for ~50% less swap thrash and ~$40K/year infra savings via idempotent queues, retries/backoff, and DLQ hygiene.
  • Cut 38% inference cost yet kept 60fps UI at 100k+ rows through caching/batching, virtualization, and observability-led tuning.

Background: NYU MS CS (May 2026) β€’ Samsung Research β€’ Veach AI β€’ Research Assistant β€’ ES 2026 full paper accepted.

What I build

  • Agentic AI systems – planner β†’ executor loops, tool use, eval harnesses, safety/guardrails, and logs that keep auditors happy.
  • ML/AI pipelines – RAG + LLM apps, training/eval harnesses, latency/cost optimization, experiment tracking.
  • Computer Vision – OpenCV/vision-model pipelines, dataset tooling, scoring dashboards.
  • Full-stack product – Next.js/React frontends, FastAPI/REST backends, Postgres/Redis data layers.
  • DevOps/MLOps – Docker/Kubernetes, CI/CD (GitHub Actions), observability, deployment runbooks.

Featured Projects

  • Event-driven workflow orchestrator β€” workflow-orchestrator-sandbox. Agent + ML pipeline engine (FastAPI + Redis + Postgres) with idempotency keys, retries/backoff, DLQ, reconciliation sweeps, and audit-ready metrics. Demo/Docs: repo README.
  • RAG evaluation + latency/cost harness β€” rag-eval-harness. Deterministic dataset loader, caching vector store, async workers, latency/cost dashboards, and CI-ready eval harnesses that drove the 38% cost win. Demo/Docs: repo README.
  • High-volume analytics UI β€” Portfolio. Full-stack (Next.js + APIs + DB) analytics experience with virtualization, workerized transforms, and instrumentation to keep 60fps at 100k+ rows. Demo/Docs: https://karan-allagh.vercel.app.
  • Low-latency C++ prototyping β€” Samsung/Veach internal (non-public). Near-metal agentic kernels for SIMD batching, pipeline hazard detection, and telemetry to hold p95 ≀ 3.8s. Demo/Docs: available under NDA.
  • Cloud automation agent β€” Cloud_Automation_Agent-. Electron + Django + Orion agent stack that plans, executes, captures evidence, and enforces guardrails for cloud operations. Demo/Docs: repo README (video WIP).

Core strength β€” Distributed Systems & Reliability

  • Bake reliability patterns (idempotency, retries/backoff, DLQ, reconciliation, audit logs) into every agentic or ML system to keep rollouts safe.
  • Reliability is not a phase; it’s the guardrail for AI/ML features before they reach customers.

Reliability patterns I reach for

Reliability patterns diagram

Tech stack

  • Languages: Python, TypeScript/JavaScript, Java (Spring), C++17/20, Go, SQL, Bash.
  • ML/AI: LLM tooling (agentic planners, RAG, eval harnesses), cost/latency tuning, dataset tooling.
  • Computer vision: OpenCV + vision-model pipelines, dataset scoring + regression dashboards.
  • Web & APIs: React/Next.js frontends, FastAPI/REST services, Postgres + Redis data layers.
  • Infra / DevOps / MLOps: Docker, Kubernetes, GitHub Actions, Kafka streams, observability (logs/metrics/traces), runbooks + on-call.

Research

  • ES 2026 accepted full paper – Agentic Decomposition for Reliable Long-Horizon AI Planning (public preprint coming soon).
  • Interest areas: agentic decomposition, evaluation rigor, retrieval quality, latency/cost trade-offs for LLM + CV workloads.

What I’m looking for

New Grad roles in AI/ML engineering, agentic AI systems, or backend/platform engineering β€” including Founding Engineer (0β†’1) opportunities at early-stage startups (NYC hybrid or remote). I love building reliable, observable systems: idempotency, evals, CI, and production-ready deployments.

πŸ“« ka3527@nyu.edu β€’ LinkedIn β€’ Portfolio β€’ GitHub

Pinned repos recommendation

  1. workflow-orchestrator-sandbox β€” shows idempotency, retries/backoff, DLQ, reconciliation.
  2. rag-eval-harness β€” demonstrates latency/cost benchmarking and async evaluation.
  3. Cloud_Automation_Agent- β€” agentic automation with plan reviews + audit logs.
  4. Portfolio β€” high-volume UI + recruiter-ready story.
  5. office-submission β€” reliability patterns inside Office add-ins.

Pinned Loading

  1. Cloud_Automation_Agent- Cloud_Automation_Agent- Public

    Electron + Django agent that executes cloud console tasks with guardrails

    Python

  2. Drowning_Model Drowning_Model Public

    Jupyter Notebook

  3. Improved-Drowning-System Improved-Drowning-System Public

    Forked from Inj3-GT/Improved-Drowning-System

    Garry's Mod / Inj3 - Improved Drowning System

    Lua

  4. Portfolio Portfolio Public

    Next.js + Tailwind portfolio with Home/Projects/Resume/Contact

    TypeScript

  5. reddit-assignment reddit-assignment Public

    TypeScript

  6. StructuredLabs/preswald StructuredLabs/preswald Public

    Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide…

    Python 4.3k 652