Who I’m I?
I’m Tobi, A data engineer and data scientist, with experience with Python, Palantir Foundry Large Language Models.
I turn messy, fragmented data into reliable pipelines and actionable insights. Faster.
Whether it’s extracting your Excel worksheets, using Machine learning to understand who buys your products. Using AI to summarise large bodies of text.
Happy to help. Using your tools helps you get the outcome you want.
Want to discuss your data project? Let’s see if you're the right fit.
-
200+ hours/month saved Built an LLM redaction pipeline that anonymized clinical notes and gave clinicians their lives back.
Healthcare
-
£200k annual savings. With powerBI Dashboard for SharePoint Migration
Finance + Operations
-
95% reduction in ETL prep time Engineered pipelines processing 1M+ transactions across 5 years of data using PySpark & Palantir Foundry.
Sports + Retail
Featured Projects
Healthcare NLP
Problem: Clinicians were manually reviewing and redacting sensitive patient data from hundreds of notes — a slow, error-prone process that ate into time that should've been spent on actual patients.
Built: An LLM-powered redaction pipeline that automatically anonymized clinical notes at scale.
Result: Saved clinicians 200+ hours/month. Cut insurance eligibility response time from 10–20 minutes down to 30 seconds.
Sports Fan ML Segmentation
Problem: A sports analytics startup was sitting on a goldmine of fan data. Demographics, purchase history, ticketing. But had no way to turn it into targeted marketing.
Built: An ML clustering model that segmented fans by behaviour and demographics, giving the marketing team actual signal instead of spray-and-pray campaigns.
Result: 40% increase in fan engagement.
Large-Scale Sports Data ETL
Problem: Retail, ticketing, and merchandise data was scattered across 10+ sources with no unified pipeline — making any kind of analysis a manual nightmare.
Built: A Palantir Foundry + Apache Spark pipeline ingesting 5TB+ of data daily, integrating 1M+ transactions across 5 years of historical data.
Result: 95% reduction in ETL prep time.
Services / Outcomes
Stop flying blind on ops
Data pipelines that actually run reliably → Data Eng
Dashboards your team uses
KPIs in one place → Dashboards
Automate the repetitive stuff
LLMs handling docs, reports, Make the most use of your data using AI.
Book a call
30 mins. Let's see if it's a fit.
What’s next?
Send an email
Prefer async? Feel free to reach out!