DataStudy
Founder & Data Science Engineer — transforming raw data into actionable intelligence.
Founder & Owner
Co-founded and run DataStudy — a data consulting company helping organisations turn raw information into data-driven strategies.
Published Research
Polymarket arbitrage, YouTube algorithm analysis, LLM-powered mock data generation, and more.
Data Science & AI
Building tools and pipelines across data engineering, machine learning, advanced analytics, and business optimisation.
What is DataStudy?
DataStudy is a data consulting company focused on empowering businesses with data-driven strategies for efficiency and innovation. The core mission: transform raw information into actionable intelligence.
The company covers four main service areas:
- Data Engineering — building robust data pipelines and infrastructure.
- AI & Machine Learning — developing intelligent systems powered by LLMs, statistical models, and automation.
- Advanced Analytics — extracting insights from complex datasets with rigorous methodology.
- Business Optimisation — translating data findings into operational improvements and strategic decisions.
Founded alongside Coen Tertoogen, who brings financial and business expertise, DataStudy combines deep technical capability with practical business acumen.
Published Research
DataStudy publishes in-depth research articles covering a range of quantitative and engineering topics:
Arbitrage on Polymarket
Theory, practice, and code for detecting risk-free profit in prediction markets. Covers single-market and cross-market arbitrage, Bayesian reasoning, Kelly criterion bet sizing, and automated trading bots.
Read on DataStudy →Mock Data Generator
An LLM-powered system that analyzes a dataset's statistical properties and generates Python code to create privacy-safe, statistically aligned mock data — with a Streamlit frontend.
Read on DataStudy →YouTube Shorts Algorithm
Deep dive into how YouTube's Shorts recommendation algorithm works — analyzing the path from zero to a million subscribers through data.
Read on DataStudy →Projects & Open Source
DataStudy maintains a collection of internal projects and open-source tools:
- Formula 1 Visuals — creating race analytics and performance visualisations using FastF1 and Python.
- DataStudy Stellingen — a dedicated sub-site for data-driven propositions and theses.
- Open-source tools — published on GitHub for the community to use and contribute to.
The Stack
Languages & Frameworks
- Python (pandas, NumPy, scikit-learn)
- Streamlit
- Django / Wagtail CMS
- JavaScript / HTML / CSS
AI & Data
- Large Language Models (LLMs)
- Machine Learning pipelines
- Statistical modelling
- Data visualisation (Matplotlib, Plotly)