Who I’m I?

I’m Tobi, A data engineer and data scientist, with experience with Python, Palantir Foundry Large Language Models.

I turn messy, fragmented data into reliable pipelines and actionable insights. Faster.

Whether it’s extracting your Excel worksheets, using Machine learning to understand who buys your products. Using AI to summarise large bodies of text.

Happy to help. Using your tools helps you get the outcome you want.

Want to discuss your data project? Let’s see if you're the right fit.

  • 200+ hours/month saved Built an LLM redaction pipeline that anonymized clinical notes and gave clinicians their lives back.

    Healthcare

  • £200k annual savings. With powerBI Dashboard for SharePoint Migration

    Finance + Operations

  • 95% reduction in ETL prep time Engineered pipelines processing 1M+ transactions across 5 years of data using PySpark & Palantir Foundry.

    Sports + Retail

Featured Projects

Healthcare NLP

Problem: Clinicians were manually reviewing and redacting sensitive patient data from hundreds of notes — a slow, error-prone process that ate into time that should've been spent on actual patients.

Built: An LLM-powered redaction pipeline that automatically anonymized clinical notes at scale.

Result: Saved clinicians 200+ hours/month. Cut insurance eligibility response time from 10–20 minutes down to 30 seconds.

Sports Fan ML Segmentation

Problem: A sports analytics startup was sitting on a goldmine of fan data. Demographics, purchase history, ticketing. But had no way to turn it into targeted marketing.

Built: An ML clustering model that segmented fans by behaviour and demographics, giving the marketing team actual signal instead of spray-and-pray campaigns.

Result: 40% increase in fan engagement.

Large-Scale Sports Data ETL

Problem: Retail, ticketing, and merchandise data was scattered across 10+ sources with no unified pipeline — making any kind of analysis a manual nightmare.

Built: A Palantir Foundry + Apache Spark pipeline ingesting 5TB+ of data daily, integrating 1M+ transactions across 5 years of historical data.

Result: 95% reduction in ETL prep time.

Services / Outcomes

Stop flying blind on ops

Data pipelines that actually run reliably → Data Eng

Dashboards your team uses

KPIs in one place → Dashboards

Automate the repetitive stuff

LLMs handling docs, reports, Make the most use of your data using AI.

Book a call

30 mins. Let's see if it's a fit.

What’s next?

Send an email

Prefer async? Feel free to reach out!