Data Scientist & Analytics Specialist

Turning
data into
decisions.

I specialise in predictive modelling, operational intelligence, and decision-support analytics — bridging technical depth with business context across housing, healthcare, and e-commerce.

£200m+
Annual capital investment informed by data models
25%
Revenue growth driven through behavioural analytics
45%
Waste reduction via SQL-driven performance monitoring
15%
Conversion rate uplift through A/B testing & ML
Manchester, United Kingdom  ·  Open to opportunities
Data.

The tools
of the trade.

My toolkit spans the full analytics pipeline — from raw data engineering and modelling through to executive-ready dashboards and automation. I choose tools for the problem, not the other way around.

Programming & Analytics
PythonSQLRDAXPower Query (M)
Machine Learning
Random ForestClassificationClusteringTime SeriesNLP / SentimentFeature Engineering
BI & Visualisation
Power BITableauHolisticsGoogle AnalyticsLooker Studio
Data Engineering
ETL DesignAzure MLSQL ServerPower AutomatePower AppsSharePoint

Where I've made
an impact.

2025 — PresentLondon & Quadrant (L&Q)
Asset Data Analyst
  • Lead asset data governance for stock condition and lifecycle datasets within Keystone/NEC, underpinning Decent Homes compliance and long-term capital planning.
  • Design advanced Power BI models (DAX + Power Query) identifying investment hotspots, underperforming stock, and emerging component risks. £200m+ capex informed
  • Built enterprise dashboards (Decency Trackers, Asset Trackers, Combined Property Search), reducing reporting turnaround and improving operational visibility.
  • Analytical partner to Pre-Construction, Fire, Energy, and regional teams — embedding governance standards and serving as asset intelligence super-user.
2024Royal College of Nursing (RCN)
Data Scientist
  • Led advanced analytics on multi-channel member communication data using EDA, time-series analysis, and behavioural segmentation via machine learning.
  • Applied classification and clustering models to identify high-value and at-risk member segments, enabling precise, data-informed engagement strategies.
  • Delivered executive-ready Power BI dashboards supporting senior leadership in shifting toward structured, data-driven performance monitoring.
2023 — 2024Startech Digital
Primary Data Analyst
  • Drove revenue growth through behavioural analytics, market basket analysis (Apriori), and pricing insight across multiple e-commerce platforms. ~25% revenue growth
  • Integrated Shopify, Google Analytics, and Holistics into automated pipelines, improving data reliability and eliminating manual reporting effort.
  • Built Decision Tree and Random Forest models alongside A/B testing frameworks. ~15% conversion uplift
2019 — 2022Transport Services Ltd
Data & Facilities Operations Officer
  • Designed SQL-driven data collection systems and operational dashboards. ~45% waste reduction
  • Led digitisation of facilities workflows via a Facilities ERP built with PowerApps and Power Automate.
  • Managed ETL migration of legacy operational data, ensuring integrity and improved analytical usability.

Projects &
case studies.

Classification · Python · Azure ML
Predicting Income with Classification Algorithms

Applied Random Forest, KNN, and Decision Tree models to the UCI Adult Dataset to predict income levels. Random Forest achieved ~85.7% accuracy with strong ROC-AUC performance, also implemented in Azure ML Designer.

PythonRandom ForestAzure MLEDA
Time Series · Forecasting
Web Traffic Time Series Forecasting

Forecasted daily Wikipedia article views using ARIMA, LSTM, XGBoost, and Prophet. LSTM and Prophet demonstrated robust performance capturing complex seasonality and trends.

PythonLSTMProphetARIMAXGBoost
Clustering · Segmentation
Customer Segmentation & Behaviour Analysis

Applied K-Means and Hierarchical Clustering to UCI Online Retail data. Used WCSS Elbow and Silhouette Scoring to identify four distinct purchasing behaviour profiles.

PythonK-MeansHierarchicalSilhouette
NLP · Sentiment Analysis
Netflix Streaming Sentiments & Text Mining

Analysed Netflix content using VADER sentiment scoring, N-gram analysis, topic modelling, and word clouds to uncover emotional trends across genres and top actors.

PythonNLTKVADERNLP
Statistical Analysis · R
Global Health Dynamics: Demographics & Fiscal Impacts

Analysed global health systems 2010–2021 using R, examining the interplay between demographics, fiscal allocation, mortality, and equity across diverse socioeconomic contexts.

RStatisticsEDAVisualisation
EDA · Churn Prediction
Customer Churn Prediction — Energy Sector

Diagnosed SME churn drivers for a European energy utility, framing price sensitivity as a data science problem and designing a predictive model to target a 20% discount strategy.

PythonEDAChurn Modelling
SQL · Data Engineering
Data Cleaning with SQL — 56K Row Housing Dataset

End-to-end data cleaning pipeline using SQL: addressed incomplete data, duplicate rows, address splitting, and format standardisation to maximise downstream analytical accuracy.

SQLETLData Quality
Dashboard · Visualisation
D&S Health Tech Sales Dashboard — Looker Studio

Interactive sales dashboard built in Looker Studio surfacing revenue, product performance, and coupon insights from Google Sheets data for a mental health tech organisation.

Looker StudioGoogle SheetsDashboard

Let's work
together.

Whether you're looking for a data scientist to join your team, a consultant to drive an analytics project, or just want to talk data — I'd love to hear from you.