About

I'm a machine learning engineer working on LLM inference and applied AI. Currently building Vajra at Georgia Tech and consulting independently.

2025
ML Systems Engineer · Georgia Tech, Systems for AI Lab

Building Vajra, an open-source LLM inference engine. Worked on quantization & MoE support, experiment infra, CI, and telemetry. Lead developer of Veeksha (LLM performance & quality evaluation).

Applied AI Consultant · Independent

Advising companies on end-to-end applied AI: requirements, data, training, eval, and serving.

2024
Visiting Researcher · Barcelona Supercomputing Center

Studied advanced batching for LLM inference during my MSc.

Data Scientist · CaixaBank

ML systems for asset & credit risk. Productionized work with measured impact in the tens of millions.

2022
Research Assistant · U. of Waikato & BarcelonaTech

~2 years across two projects. Explainable AI research (evaluation of attention-based explanations) under Dr. Alvin Jia and Prof. Albert Bifet. Data analysis and modeling for COPEDI-Cat, a Catalan COVID-19 paediatric response network (~150 collaborating paediatricians).

2021
BSc in Data Science and Engineering · BarcelonaTech

First cohort of Spain's data science program. Thesis and internships centered on ML.

Me and my cat Cookie

Me and Cookie 🍪