Work
Projects
AI agent systems, multi-agent simulations, and production ML pipelines.
Polybot — Autonomous Polymarket Trading Bot
AWS Lambda bot that reads live Polymarket tech markets every 15 minutes, forms independent probability estimates with a Haiku → Sonnet Claude cascade, and places small bets when it sees edge — with a homegrown insider-trading detector as a defensive filter. Risk guardrails cap exposure ($10/trade, $100/day), a bug-detection circuit breaker halts on slippage spikes or API errors, and every decision is audit-logged to DynamoDB and S3. Runs for under $1/day on AWS.
AI Judge — SCOTUS Verdict Backtester
A RAG pipeline that predicts Supreme Court rulings. Ingests 200 landmark SCOTUS decisions from CourtListener, embeds them with Amazon Titan into a FAISS index, then uses Claude on AWS Bedrock to render verdicts with top-3 precedent retrieval. Backtests accuracy against real historical outcomes with temporal integrity enforced — the model never sees future rulings. Final accuracy: 45.7% across 46 cases with ground-truth verdicts; fixing temporal leakage dropped it from 47.8% (that 2.1% gap was the model "cheating" by seeing the future).
Calgary Housing Intelligence
Fully automated intelligence pipeline that scrapes 21 years of housing, oil, and economic data; trains three XGBoost models (annual, monthly, daily); scrapes live news via RSS; and auto-generates a market intelligence report with a live Housing Pressure Score. Runs biweekly on the 1st and 15th.
More projects in progress. Want to collaborate?
Get in Touch