Roles, builds, and the toolbox behind them.
The CV with marginalia. Each entry has a footnote. Available as PDF for the more conventional version.
shubz@orion~/home%git log --since=2019 --reverse | tac# 2019 → present
§02Trajectory
May 2026 —
BlueDot AGI Research cohort
Pre-reading + weekly seminars on AGI strategy, alignment, and the policy landscape around frontier AI.
BlueDot Impact
Feb 2026
Elected MInstP · Institute of Physics
Member of the UK's professional body for physics.
London
2025 —
Mentor · Stemmettes, InnovateHer, I'm a… Programme
Three orgs running workshops, panels, and one-on-ones for girls, non-binary kids, and under-represented young people moving toward engineering, maths, and CS. DBS-checked.
STEM-ed outreach
Apr 2024 —
Founding Engineer · Orion
Building geospatial intelligence infrastructure from scratch — data pipelines for high-volume multi-source streams, ML anomaly detection over H3 spatial indexing, backend services in Python and Golang. Joined as Data Engineer (Apr 2024 → Nov 2024), then Data & ML Engineer (Nov 2024 → Aug 2025), then Founding Engineer.
London
Oct 2023 —
Student Ambassador · Oxford Mathematical Institute
Working with prospective applicants at the Mathematical Institute and Lady Margaret Hall — answering questions, sitting on Q&A panels, supporting the application process.
Oxford · LMH
Aug 2025
Best Paper · IEEE ICCUBEA-2025
"Decoding Flipkart-Walmart Merger" — theme extraction and sentiment mining across four years of news coverage. 1,240 submissions, 220 accepted, one Best Paper. Co-authored with Prof. Chakraborty.
co-author
Sep 2023 – Jul 2024
MFoCS · Oxford · Lady Margaret Hall
Thesis: automated immune biomarker detection on multiplex immunofluorescence data from tumour samples. Modules included Geometric Deep Learning, Computational Game Theory, Quantum Information, and Category Theory.
Math + CS
Sep 2022 – May 2023
BSc Honours Thesis · GNSS atmospheric water vapour
Estimated tropospheric water-vapour content from GNSS signals using regression-based ML models, outperforming traditional linear baselines on accuracy and robustness to atmospheric variability.
FLAME · Prof. Gopalan
Jun – Sep 2023
ML Engineering Intern · Natter
Content moderation models for bullying, mental-distress, and spam detection on social text, plus a user-matching recommender.
London
May 2023
BSc (Hons) Computer Science · Dean's Roll of Honour · FLAME University
Graduated ranked 3rd in cohort (from 41st in year one).
CGPA 8.53
May – Jul 2022
Backend Developer Intern · JobsForHer
Built user-facing flows in Flask, queried PostgreSQL, configured and deployed scaleable services on AWS (EC2, S3, load balancers, CloudWatch, SNS).
Bangalore
Jan – Aug 2022
Research Intern · Prof. Jayaraman Valadi, FLAME
Improved the JAYA stochastic optimizer — size-penalised fitness function and elitism — for protein function identification on three biological datasets. Poster PP-28 at INBIX'22, VFSTR.
Bioinformatics
2022 – 2023
Founder + President · Dotslash, The Technology Club
Founded and led a 10+ person team running technology events — interactive workshops, quizzes, talks — pulling 100+ attendees per session.
FLAME University
2022 – 2023
Vice-Captain · Student Council
Automated the manual process for allocating six annual student awards using Python and the Google Sheets API. Sat on the council that ran the year's events.
FLAME University
Oct 2021 – Apr 2022
Data Analyst & Researcher · Centre for Knowledge Alternatives
Cultural mapping of Kolhapur district for the Discover India Project — government datasets, Tableau, sector reports. Led a five-person team analysing law and order data.
Pune
Aug 2021 – Jan 2022
Research Intern · Prof. Kaushik Gopalan, FLAME
Novel technique to derive cloud information from INSAT-3D satellite data — pre-processed, thresholded, and adjusted radiation data to separate clear-sky and cloudy pixels in half-hourly satellite images using NumPy, SciPy, PyHDF and NetCDF. Visualised 20+ cloud-incidence results across India.
Atmospheric ML
Jun 2020 – Jun 2021
Co-founder · UnisphereCo
Online platform for the undergraduate-application process: mentors from 50+ universities, 200+ attendees per session.
Hyderabad
Jul 2019 – Jul 2021
Founder · PrintedCraft
Personalised printing business. Built a Python connector to a cloud product database for real-time promotions. Integrated Google + Facebook Ads to land on Google's first page. 100+ product designs, suppliers across the country.
Bangalore
shubz@orion~/home%ls -la ./builds/# selected · open · founding
§03Selected work
01founding-eng2024 —02deployed · solo2026 —03deployed · solo2025 —04open202605Oxford · 2024202406academic · Best Paper2023–2507open · research202208honours thesis202309coursework202210open2026
Orion intelligence platform
Geospatial intelligence over H3 spatial indexing — data infrastructure for high-volume multi-source streams, ML anomaly detection and confidence-weighted signal fusion, backend services in Python and Golang. Joined as one of two founding engineers (first hires after the founders); building the data stack from scratch and currently rewriting the LLM-driven search system end-to-end — tool schemas, decision-tree system prompts, few-shot disambiguation, output evaluation.
fashion-web ↗
Predicts which fashion micro-trends will peak 30 days out. Five live sources (Google Trends, Bluesky firehose, Pinterest, Reddit, YouTube) merge into a Holt-smoothed composite signal tracking 250+ terms across silhouettes, colours, and aesthetics. A logarithmic prediction market (LMSR, b=100) lets traders compete against the house model with virtual StyleCoins. Gemini 2.5 Flash breaks down outfit photos and matches them against runway collections from eight major houses. House predictions are Brier-scored.
written up: ↗ Pricing the next scarf
platypus-learn ↗
Turns PDFs and YouTube videos into structured courses. Drop in a paper or a lecture; get back a coherent learning path with assessments, a weekly goals dashboard, and an email digest. Built with Next.js, Supabase, Claude API, and Resend.
Taylor Swift recommendation engine ↗
Six engines running in parallel — Sentence-BERT on lyrics, a variational autoencoder compressing Spotify audio features to 16 dimensions, graph node2vec embeddings, neural collaborative filtering, contrastive self-supervised learning, and a weighted ensemble with a consensus boost for cross-engine agreement. Runs over Taylor's full discography plus hand-curated editorial bridges to related artists. Rebuilt from an R/Shiny original in TypeScript + FastAPI; recommendations in under 200ms.
written up: ↗ Six engines for one songbook
MFoCS thesis · immune biomarkers
Two problems in automated biomarker detection on multiplex immunofluorescence data from real tumour samples. First: four automated thresholding methods (Otsu, IsoData, custom GMM, minimum cross-entropy) competing to replicate a pathologist's gate — CTLA-4 resisted all of them, because the biology is genuinely ambiguous. Second: spectral clustering with biological co-expression constraints encoded directly in the similarity matrix, beating five of seven standard methods on plausibility. Thesis available in Oxford repository.
written up: ↗ Positive by how much
Flipkart–Walmart merger NLP
Theme extraction and sentiment mining across four years of news coverage of the $16B acquisition. Identified dominant narrative clusters and tracked sentiment shifts through deal stages using SpaCy, NLTK, Gensim, and TextBlob. Best Paper at IEEE ICCUBEA-2025 from 1,240 submissions.
JAYA protein feature selection ↗
Modified a parameter-free stochastic optimizer with a size-penalty fitness function and elitism to improve feature selection for protein function identification. Tested on three biological datasets; consistently produced smaller feature sets without sacrificing classification accuracy. Poster PP-28 at INBIX'22.
written up: ↗ JAYA, improved
GNSS water-vapour ML
Estimated tropospheric water-vapour content from GNSS signals using regression models, outperforming traditional linear baselines on accuracy and robustness to atmospheric variability. BSc honours thesis, FLAME University.
Nand2Tetris — 16-bit computer
Built a working general-purpose computer from NAND gates up: all 13 projects from HDL gate design through machine language, VM translator, compiler, and a minimal operating system.
this site ↗
Started as a single-file HTML + CSS + inline React app; migrated to Astro during the astro-migration branch. Essays, animated diagrams, custom typography.
shubz@orion~/home%cat ./toolbox.md# what built what
§04Toolbox
AI safety.
AI alignment research↗ a unit on bending the curve↗ bluedot AGI strategy cohort · /nowAI ethics & governance↗ bluedot AGI strategy cohort · /now
AI/ML.
Python↗ orion backend services · /work↗ six engines (2026)R↗ taylor rec (original R/Shiny) · CV↗ FLAME peer tutoring · CVNumPy · SciPy · Pandas↗ positive by how much (MFoCS, 2024)↗ six engines (2026)matplotlib↗ jaya plotsPyTorch · scikit-learn↗ positive by how much↗ cells that can't existSpaCy · NLTK · Gensim · TextBlob↗ flipkart-walmart merger NLP · CVggplot · dplyr · tidyr · shiny · purrr↗ taylor rec (R/Shiny) · CVClaude API · Anthropic SDK↗ platypus-learn (PDFs/videos → courses)↗ used to redesign fashion-web · /nowGemini API (2.5 Flash)↗ fashion-web outfit recognitionPrompt engineering · tool schemas↗ orion search system · /nowFew-shot disambiguation↗ orion search · /nowDecision-tree system prompts↗ orion search system · /nowLLM evaluation · output testing↗ orion search rewrites · /nowFeature selection (JAYA-style)↗ jaya, improvedGenetic algorithms · crossover/mutation↗ jaya, improvedRegression modelling↗ honours thesis · GNSS water vapour · CVSentiment + theme extraction↗ flipkart-walmart merger · CVWeb scraping↗ merger NLP · CV↗ natter data collection · CV
Mathematics.
Computational Game Theory↗ Oxford MFoCS module · CVGeometric Deep Learning↗ Oxford MFoCS module · CVGraph Theory↗ Oxford MFoCS module · CVCategory Theory↗ Oxford MFoCS module · CVAlgorithmic Foundations of Collective Decision Making↗ Oxford MFoCS module · CVComputational Complexity↗ Oxford MFoCS module · CVCryptography · number theory↗ cipher program (5-layer) · CVLinear algebra · transforms↗ GNSS regression · CV↗ cipher program · CV
Physics.
Quantum Information↗ Oxford MFoCS module · CV↗ two colours and a HadamardZX-rewriting↗ two colours and a HadamardGNSS atmospheric modelling↗ honours thesis · water vapour · CVSatellite remote sensing↗ INSAT-3D cloud detection · CVPyHDF · NetCDF↗ INSAT-3D cloud detection · CV
Biotech.
Bioinformatics↗ jaya · proteins↗ MFoCS thesis · biomarkersConstrained · spectral clustering↗ cells that can't existAutomated thresholding↗ positive by how muchImage processing (multiplex IF)↗ MFoCS thesis · biomarkersExplainable AI (WOE binning)↗ protein sequence decoding · CVLotka–Volterra dynamics↗ predator and prey
Geospatial.
Rust↗ orion (founding) · /workGolang↗ orion (founding) · /workGeospatial ML · H3 indexing↗ orion · /workAnomaly detection↗ orion · ML over H3 · /workConfidence-weighted signal fusion↗ orion · multi-source fusion · /workETL · data pipelines↗ orion · multi-source ingestion · /work
fintech.
LMSR prediction markets↗ pricing the next scarfTime-series forecasting↗ fashion-web (Holt-smoothed composite)statsmodels · Holt-Winters↗ fashion-web composite signalCalibration · Brier scoring↗ fashion-web (house predictions)Recommendation systems↗ six engines↗ natter user matching · CVEnsemble methods↗ six engines · consensus boost
Hardware.
C++↗ IoT weather bot · ThingSpeak · CVIoT↗ IoT weather bot · CVThingSpeak · microcontrollers↗ IoT weather bot · CVCompiler · VM design↗ Nand2Tetris (all 13 projects) · /workHDL · gate-level design↗ Nand2Tetris (NAND → computer) · /work
Infrastructure.
TypeScript↗ this site repo↗ six engines (TS + FastAPI rebuild)SQL · PostgreSQL↗ jobsforher backend · CVAstro · React islands↗ this site repoNext.js · Supabase · Resend↗ platypus-learn (deployed)Vercel↗ fashion-web↗ platypus-learnAWS · EC2 · S3 · CloudWatch · SNS↗ jobsforher deploy + monitor · CVTableau↗ kolhapur cultural mapping · CVFlask↗ jobsforher web app · CVGoogle APIs · Sheets↗ student council awards automation · CVGit · CI/CD↗ all engineering work · /workD3 · hand-rolled SVG↗ all essay diagrams↗ jaya figuresFastAPI↗ six engines backendUI / UX design↗ this site↗ PrintedCraft CMS (100+ designs) · CV
Outreach.
Team leadership↗ Dotslash tech club (10+ people) · CV↗ Kolhapur project (5 people) · CVMentoring under-represented young people↗ Stemmettes + InnovateHer · /work↗ Oxford Mathematical Institute Ambassador · CVTeaching↗ FLAME Quantitative Centre peer tutor (R + Python) · CVSTEM-ed outreach↗ Stemmettes · InnovateHer · I'm a… Programme · /workDBS-checked (UK)↗ cleared for under-18s work · /workResearch writing · academic papers↗ IEEE ICCUBEA-2025 Best Paper · /work↗ MFoCS thesis · biomarkers (Oxford, 2024)Conference presentations↗ INBIX'22 (VFSTR) · CV↗ IEEE ICCUBEA-2025 · /workEvent organising · community building↗ UnisphereCo (200+ attendees / session) · CV↗ Dotslash (100+ attendees / event) · CVCross-cultural collaboration↗ India ↔ UK working contexts · /workTechnical documentation↗ Natter — docs + stakeholder presentations · CVStakeholder communication↗ Natter — docs + cross-team presentations · CVFounding / building from zero↗ orion (one of two founding engineers) · /work↗ UnisphereCo (200+ attendees) · CVDigital marketing · SEO · Google/Facebook Ads↗ PrintedCraft (Google front-page) · CVGraphic · product design↗ PrintedCraft (100+ designs) · CVQualitative · cultural research↗ kolhapur · Discover India Project · CV
shubz@orion~/home%cat ./speaking.md
§05Speaking & STEM outreach
StemmettesMentoring girls and non-binary young people into engineering, maths, and computer science2025 —
InnovateHerSTEM Ambassador — running sessions for under-represented students in tech2025 —
"I'm a… Programme"Science communicator, CS & Mathematics2025 —