Projects

TextFusion - Diffusion Models for Text and Code
AI/ML
Research implementation of discrete and continuous diffusion models for text and code generation. Builds D3PM, DDPM, and flow-matching variants with time-conditioned transformers, custom noise schedules, code-focused datasets, and evaluation pipelines.
- PyTorch
- Diffusion
- D3PM
- DDPM
- Flow Matching
FlashFeat - Real-Time Feature Infrastructure
AI/ML
Low-latency, tamper-resistant data path for features that feed real-time AI systems. Designed around single-digit-millisecond access patterns and production-style performance constraints rather than offline preprocessing.
- Go
- Systems
- Low Latency
- Data Infrastructure
- Real-Time ML
Mixture of Experts Small Language Model
AI/ML
From-scratch Mixture of Experts language model with sparse routing, gating, and conditional computation. Explores expert specialization and the efficiency trade-offs of scaling small language models with MoE layers.
- PyTorch
- MoE
- Transformers
- LLM
- Sparse Models
Grawl - Distributed LLM Data Crawler
AI/ML
Distributed web crawler for LLM data collection with concurrent workers, RedisBloom-based deduplication, and infrastructure automation for scalable crawling and dataset quality control.
- Go
- Distributed Systems
- Web Crawling
- RedisBloom
- IaC
Model-Agnostic Meta-Learning (MAML)
AI/ML
Research implementation of MAML for few-shot learning with inner- and outer-loop optimization. Focuses on second-order gradient updates, task sampling, and rapid adaptation to new tasks with minimal examples.
- Meta-Learning
- Few-Shot
- PyTorch
- Research
- Optimization
Terraforge - Smart DevOps Assistant
Cloud
Hackathon-winning deployment assistant that generates Terraform for predefined stacks and custom infrastructure. Pairs a conversational interface with in-house execution and self-testing for transparent DevOps workflows.
- Terraform
- DevOps
- IaC
- Conversational AI
- JavaScript
nanoGPT - Decoder-Only Transformer in PyTorch
AI/ML
Minimal, self-contained decoder-only Transformer written from scratch in PyTorch. Reimplements GPT training and inference in a small codebase for studying attention, tokenization, and language-model training dynamics.
- PyTorch
- GPT
- Transformers
- NLP
- Attention
CheeseCake - AI Course Generation Platform
AI/ML
Hackathon-winning platform for auto-generating workplace learning courses and gamifying training flows. Product-focused build that pairs generative course creation with a TypeScript front end for fast iteration.
- TypeScript
- LLM
- EdTech
- Course Generation
- Product