Projects

Here are some projects I carried out for uni / fun.

Current research

  • Turn-taking prediction in duplex models in progress msc dissertation 2026 MSc dissertation work on whether sparse autoencoders can recover timing-related features in duplex speech-model representations.
  • Circuit Atlas prototype interpretability interface 2026 Prototype interface for exploring SAE feature clusters and their interactions across transformer layers.
  • Moshi turn-taking data pipeline working prototype speech data tooling May 2026 Supporting data pipeline for conversational audio: discovery, diarisation, transcription, Mimi encoding, and turn-yield labels.

Coursework and degree work

  • Reward hacking under GRPO completed atnlp coursework 2026 ATNLP coursework on GRPO for mathematical reasoning, focused on failure cases where the reward signal encourages shallow shortcuts.
  • WFST-based ASR decoding completed asr coursework 2025 ASR coursework implementing a WFST decoder with beam pruning, silence modelling, and KenLM language-model scoring.
  • Histopathology image segmentation completed computer vision coursework 2026 Computer vision coursework: UNet nucleus segmentation, then supervised and SimCLR-based classification.
  • Biological knowledge graph construction completed undergraduate dissertation 2023-2024 Undergraduate dissertation using transformer-assisted triple extraction on biological abstracts, then turning the output into graph structure.

Other projects / experiments

  • AI tutor with knowledge tracing prototype learning tool 2025-2026 A personal learning-tool prototype around concept graphs and learner state; still more design exploration than product.
  • Emotion vectors for Gemma exploratory prototype interpretability Apr 2026 Exploratory interpretability pipeline for extracting emotion-related activation directions from Gemma activations.
  • Obsidian and MCP automation infrastructure ongoing personal infrastructure 2025-2026 Local note-search and automation tooling around Obsidian, MCP, BetterTouchTool, and personal knowledge workflows.
  • Snake AI early project, revisited game AI 2020 / 2026 Small Snake AI experiments, from older deep Q-learning code to newer checkpoint and evaluation work.
  • Pathfinder early experiment Swift / spatial data Aug-Sep 2023 Swift pathfinding and spatial-data-structure experiments, including quadtree and floorplan/SVG work.
  • Speaker diarisation and claim verifier rough utility speech tooling Dec 2024-Sep 2025 Rough speech tooling for diarising and transcribing long audio, then experimenting with claim extraction over the result.