Projects
- TextFusion - Diffusion Models for Text and Code
Research implementation of discrete and continuous diffusion models for text and code generation. Builds D3PM, DDPM, and flow-matching variants with time-conditioned transformers, custom noise schedules, code-focused datasets, and evaluation pipelines.
- FlashFeat - Real-Time Feature Infrastructure
Low-latency, tamper-resistant data path for features that feed real-time AI systems. Designed around single-digit-millisecond access patterns and production-style performance constraints rather than offline preprocessing.
- Mixture of Experts Small Language Model
From-scratch Mixture of Experts language model with sparse routing, gating, and conditional computation. Explores expert specialization and the efficiency trade-offs of scaling small language models with MoE layers.
- Grawl - Distributed LLM Data Crawler
Distributed web crawler for LLM data collection with concurrent workers, RedisBloom-based deduplication, and infrastructure automation for scalable crawling and dataset quality control.
- Model-Agnostic Meta-Learning (MAML)
Research implementation of MAML for few-shot learning with inner- and outer-loop optimization. Focuses on second-order gradient updates, task sampling, and rapid adaptation to new tasks with minimal examples.
- Terraforge - Smart DevOps Assistant
Hackathon-winning deployment assistant that generates Terraform for predefined stacks and custom infrastructure. Pairs a conversational interface with in-house execution and self-testing for transparent DevOps workflows.
- nanoGPT - Decoder-Only Transformer in PyTorch
Minimal, self-contained decoder-only Transformer written from scratch in PyTorch. Reimplements GPT training and inference in a small codebase for studying attention, tokenization, and language-model training dynamics.
- CheeseCake - AI Course Generation Platform
Hackathon-winning platform for auto-generating workplace learning courses and gamifying training flows. Product-focused build that pairs generative course creation with a TypeScript front end for fast iteration.