Research Engineer & Machine Learning Engineer

Research engineer focused on deep learning and scalable ML infrastructure. I'm drawn to the unsolved parts of the field, where the science is still being written and the engineering hasn't caught up yet, and building the systems that make those ideas real.

My Photo

Skills

Languages
Python C C++ Go Mojo Java JavaScript
DL/ML
PyTorch Tensorflow Keras CUDA scikit-learn
Fine-Tuning
PEFT SFT RLHF RLAIF DPO GRPO LoRA QLoRA Unsloth
GenAI
Transformers Diffusers vLLM LangChain LangSmith LangGraph LlamaIndex llama.cpp
Infrastructure
Docker Kubernetes AWS GCP Azure CDK Terraform

Experience

Research Engineer/Machine Learning Engineer 2025 - Present
Self-employed
Built several systems at the intersection of ML infrastructure and deep learning research:
  • Currently researching YORO, a novel LLM architecture that runs the main reasoning block once and reuses its output across all generated tokens, replacing repeated full-model passes with a lightweight auxiliary network
  • Built a pipelined layer-streaming system enabling full LLM inference at a fraction of the model's VRAM footprint by overlapping disk, CPU, and GPU transfers in parallel
  • Developed a runtime correctness checker for CUDA kernels using outlier-biased sampling with zero training graph impact
  • Implemented a GPU-accelerated deep learning framework in Mojo with custom autograd and explicit GPU kernel implementations
  • Built a distributed LLM inference system routing OpenAI-compatible requests across llama.cpp nodes with automatic model distribution and mutual TLS security
Software/AI Engineer 2023 - Present
Spotted Zebra
Designed and shipped agentic AI systems across the full stack, from architecture through production:
  • Built agentic AI pipelines end-to-end, including monitoring and observability
  • Played a major role in the company's agentic AI transition, contributing to technical direction and architecture decisions
  • Implemented embedding model-based semantic indexing and retrieval systems
  • Actively consulted on AI and data science decisions, shaping how the team approached and integrated intelligent systems
Software Development/DevOps Engineer 2021 - 2023
Amazon
Built and maintained the data infrastructure and pipelines powering a deep learning system serving millions of users worldwide:
  • Handled massive data volumes across distributed systems, ensuring data quality, feature reliability, and pipeline integrity that models directly depended on
  • Actively consulted the ML team on causality analysis algorithms and contributing technical input throughout the model design and development process
  • Managed complex DevOps workflows and large-scale deployment pipelines for global production systems
Intern Software Development/DevOps Engineer 2020 - 2020
Amazon
Designed and built an AI chatbot capable of knowledge retrieval and contextual answer formulation, delivered end-to-end:
  • Architected a scalable, production-grade system from scratch, owning the full pipeline from design through deployment

Education

M.S. in Computer Science
2021 - 2023

Thesis: Efficient Deep Single-Image Super-Resolution on Mobile Devices
A study on deep learning-based methods for efficient mobile SISR across multiple upscaling factors (×2, ×3, ×4), optimizing first for PSNR then perceptual loss. Architectures were kept deliberately shallow to minimize inference time, compensating through increased width to maximize GPU parallelism.

B.S. in Computer Science
2017 - 2021

Thesis: Resource Streaming using a Peer-to-Peer Architecture
Proposed a decentralized P2P resource streaming framework with no imposed hierarchy, eliminating single points of failure and reducing privacy risk. The architecture uses locality-aware distributed hash tables (LDHTs) for per-node network state and a custom epidemic protocol for network-wide information propagation.

Certifications

LLM Course
LLM Course
Hugging Face
LLM Post Training Course
LLM Post Training Course
Hugging Face
Deep Learning Specialization
Deep Learning Specialization
DeepLearning.AI
Machine Learning Specialization
Machine Learning Specialization
DeepLearning.AI, Stanford Online
Mathematics for Machine Learning and Data Science Specialization
Mathematics for Machine Learning and Data Science
DeepLearning.AI, Serrano Academy
PyTorch for Deep Learning Specialization
PyTorch for Deep Learning Specialization
DeepLearning.AI
TensorFlow Developer Professional Certificate
TensorFlow Developer Professional Certificate
DeepLearning.AI
AI Agents Course
AI Agents Course
Hugging Face
AI Agents MCP Course
AI Agents MCP Course
Hugging Face
Generative AI for Software Development Specialization
Generative AI for Software Development
DeepLearning.AI
Generative AI for Software Development Specialization
Data Engineering Specialization
DeepLearning.AI, Amazon Web Services
Data Analytics Specialization
Data Analytics Specialization
DeepLearning.AI
AI for Good Specialization
AI for Good Specialization
DeepLearning.AI
AWS Partner: Technical Certification
AWS Partner: Technical
Amazon Web Services
Database Design and Programming with SQL Certification
Database Design and Programming with SQL
Oracle