London, UK  ·  Data Engineer & Data Scientist

Adil Osman

Data Engineer   ·   Analytics Engineer   ·   Data Scientist

Welcome! I'm a London-based data professional who loves turning raw data into something meaningful. I design end-to-end data pipelines, lakehouse architectures, and analytics-ready warehouses — working across real-time ingestion, CDC, dbt, and machine learning to build systems you can genuinely rely on.

London, UK First Class BSc · Birkbeck, University of London Python · SQL · dbt · Airflow · Spark
Adil Osman
Data Engineer · Analytics Engineer

Building pipelines
that hold up.

I'm a London-based data professional with a First Class BSc in Data Science & Computing from Birkbeck, University of London. My work focuses on data engineering and analytics engineering — designing pipelines that are reliable, observable, and deliver data that downstream consumers can actually trust.

I work with real-time ingestion, CDC architectures, local-first lakehouses, and dbt-driven warehouses across domains including finance, sport, urban transport, and environmental data.

More about me
All 15 projects
Data Engineering · Real-time

TfL Real-Time Lakehouse  ⭐ 3

Real-time transport data from London on a local laptop. Airflow ingests TfL arrivals and stores them in Parquet. dbt and DuckDB transform the data, Great Expectations validates it, and OpenLineage tracks data lineage — a fully observable, production-pattern lakehouse without cloud dependency.

PythonApache AirflowdbtDuckDBParquetGreat ExpectationsOpenLineage
Data Engineering · CDC

StreamShop CDC Analytics Stack

Production-style Change Data Capture pipeline: Postgres → Debezium → Redpanda (Kafka API) → ClickHouse. Includes a synthetic e-commerce data generator, Python CDC sink, and dbt models with tests for analytical queries.

PythonDebeziumRedpandaClickHousedbtPostgreSQL
Analytics Engineering

Premier League Analytics Warehouse

End-to-end football analytics warehouse on BigQuery using dbt and GitHub Actions. Covers Premier League seasons 2014–15 to 2024–25 from openfootball JSON, modelled into analytics-ready tables with full CI.

dbtBigQueryGitHub ActionsSQLPython

The tools I trust.

Python SQL dbt Core Apache Airflow Apache Spark DuckDB PostgreSQL BigQuery ClickHouse Kafka / Redpanda Debezium Great Expectations GitHub Actions
Full stack breakdown

Let's work together.

Building reliable pipelines and analytics-ready data systems — let's connect.

Get in Touch LinkedIn