Lead AI / ML Engineer @ ZetaGlobal
Building ML/AI systems that move from research to production and scale.
Published Research
I help teams turn AI ideas into reliable, high-impact products with measurable outcomes.
- Distributed training for SLMs, LLMs and LVLMs
- High-throughput, cost-efficient inference at scale
- Data-centric AI pipelines with production-grade observability
- Production-ready agentic AI and evaluation workflows
Flagship Open Source Product
harneXa/nexa-gauge
A graph-based evaluation toolkit for LLM and RAG systems with repeatable quality checks, upfront cost visibility, and clean per-case outputs for analysis.
- Graph-native evaluation flow (scan -> claims -> metrics -> eval)
- Cost visibility before runtime with estimate-first execution
- Cache-aware runs to avoid duplicate spend and recomputation
- Coverage across relevance, grounding, redteam, GEval, and reference scoring
- Production-friendly CLI for run, estimate, and cache management
- Scales with control across utility and metric nodes
BYOM · Ollama support in progress
Next Product
harneXa/nexa-prism
Coming Soon10+
Years in ML & AI Research and Engineering
2–3x
Inference Performance Gains
~50%
ML Infra Cost Reduction
Published
ICCV and AAMAS
Weeks → Hours
Governed pipelines and data workflows that accelerate model iteration
6–10%
Use-case specific model gains through architecture tradeoffs and training pipeline design
Capabilities
What I Work With
Weighted from public GitHub activity with recency, stars, and topic signals.
Languages & Core
Python
Deep Learning
ML
Transfer Learning
C++
LLMs
Modeling Stack
Ray
Triton
PyTorch
Voxel51
RAG
CUDA
TensorFlow
HuggingFace
Keras
vLLM
Infra $ Tools
Docker
WandB
Langfuse
AWS
DataDog
Kubernetes