Work
Projects
A selection of production systems and open-source projects focused on applied machine learning and data analytics.
AI Judge — SCOTUS Verdict Backtester
Can AI predict Supreme Court rulings? Retrieves 200 landmark SCOTUS decisions from CourtListener and runs a RAG pipeline with temporal leakage prevention — Claude 3.5 Haiku on AWS Bedrock, FAISS vector search, MLflow experiment tracking, and a Streamlit dashboard. 45.7% backtested accuracy.
Fate/RAG — RAG Chatbot (Archived)
Production-grade RAG chatbot for the Fate Series universe on AWS. Full serverless stack: FastAPI on Lambda via Mangum, API Gateway, OpenSearch Serverless k-NN vector search, and AWS Bedrock (Claude Sonnet + Titan Embeddings). GitHub Actions CI/CD pipeline runs pytest on every PR and auto-deploys to AWS via CDK on merge to main. Docker Compose for local dev.
Pokémon Red — RL Agent
Modular reinforcement learning codebase training a PPO agent to play Pokémon Red from pixel observations. CNN policy network (~1.7M params), parallel environment vectorisation, TensorBoard monitoring, and checkpoint management — all running locally.
Super Bowl LX Prediction
XGBoost binary classifier trained on 25 years of NFL data to predict Super Bowl outcomes. Key finding: regular season wins contribute 0% feature importance — defense (37%) and offense (35%) dominate. Honest evaluation with GridSearchCV and 5-fold CV.
More projects in progress. Want to collaborate?
Get in Touch