Building enterprise-scale data platforms processing billions of events daily. Expertise in AWS/GCP, Airflow, DBT, Spark, and cutting-edge GenAI solutions with Bedrock, LangChain, and HuggingFace. 10+ certifications, 9+ years delivering measurable business impact at Meta, Amazon, Delivery Hero, and more.
def build_data_platform():
return {
'role': 'Lead Data Engineer',
'specialization': ['GenAI', 'ML Engineering'],
'languages': [
'Python', 'SQL', 'SPARQL',
'R', 'Scala'
],
'cloud': ['AWS', 'GCP', 'Serverless'],
'architecture': [
'Lakehouse', 'Delta Lake',
'Data Mesh', 'Event-Driven'
],
'data_stack': [
'Airflow', 'DBT', 'Databricks',
'Spark', 'Kafka', 'Iceberg'
],
'modern_tools': [
'Great Expectations', 'Dagster',
'Trino', 'dbt Cloud'
],
'genai_ml': [
'Bedrock', 'SageMaker',
'PyTorch', 'QuickSight'
],
'visualization': [
'QuickSight', 'Looker',
'Streamlit', 'Metabase'
],
'scale': 'billions of events/day',
'impact': 'measurable revenue growth'
}
Scroll Down
I'm a Lead Data Engineer and GenAI specialist based in Berlin, Germany, with 9+ years of experience architecting enterprise-scale data platforms and intelligent systems. Working across top-tier companies like Meta, Amazon, Delivery Hero, Goldman Sachs, and leading enterprises, I've built solutions processing billions of events daily, driving measurable revenue growth and operational excellence.
Data Engineering Excellence: I design and deploy metadata-driven data pipelines using Airflow, DBT, Spark, and Kafka, implementing end-to-end data governance with lineage tracking and automated lifecycle management. My expertise spans both AWS (Lambda, Glue, Redshift, S3, SageMaker, Bedrock) and GCP (BigQuery, Cloud Functions, Dataflow), with infrastructure as code using Terraform and orchestration via Kubernetes.
GenAI & Machine Learning: Leading the charge in GenAI adoption, I've integrated Amazon Bedrock, LangChain, and HuggingFace to deploy production-ready LLM-powered AI agents, chatbots, and RAG systems. I build production ML pipelines using TensorFlow, PyTorch, and SageMaker, delivering recommendation engines, predictive analytics, and real-time intelligent automation at scale.
Years Experience
Certifications
Major Companies
Events Processed
Airflow, DBT, Spark, Kafka/Kinesis • Processing billions of events daily • Metadata-driven pipelines with full governance
Amazon Bedrock, LangChain, HuggingFace • Production RAG systems • AI agents & chatbots for automation
AWS (Lambda, Glue, Redshift, Bedrock, SageMaker) • GCP (BigQuery, Dataflow, VertexAI) • Terraform IaC
TensorFlow, PyTorch, SageMaker • Real-time ML pipelines • Recommendation engines • Predictive analytics
Enterprise-scale data platforms and GenAI solutions delivering measurable business impact
Architected and deployed a fully serverless, scalable lakehouse data platform on AWS for a leading manufacturing enterprise, centralizing analytics across manufacturing, operations, and finance. Implemented metadata-driven pipelines with end-to-end governance, lineage tracking, and automated lifecycle management using modern serverless architecture.
Led GenAI adoption by integrating Amazon Bedrock, LangChain, and Streamlit to deploy production-ready LLM-powered AI agents and Q&A chatbots for internal automation and knowledge retrieval, revolutionizing business operations.
Architected multi-stream real-time data systems using Kafka and Kinesis for a leading e-commerce platform, processing billions of events daily to support customer experience optimization, predictive analytics, and dynamic metric dashboards across global markets.
Developed and optimized large-scale analytics solutions on GCP using BigQuery, Spark, and Cloud Functions for a global food delivery platform, processing billions of events daily across multiple international markets, driving millions in revenue through automated decision-making and real-time analytics.
Built and deployed production-grade ML pipelines and recommendation engines using SageMaker, TensorFlow, and PyTorch. Automated deep learning workflows for predictive analytics, customer segmentation, and targeted advertising strategies.
Led infrastructure modernization by introducing Terraform-based IaC across AWS and GCP, improving deployment consistency, reproducibility, and enabling automated multi-cloud infrastructure management at enterprise scale.
Comprehensive guide for Machine Learning and GenAI interviews with solutions, patterns, and best practices. Features dedicated GenAI questions section. Helping thousands of engineers prepare for ML and AI roles at top tech companies.
Full-stack German language learning platform helping learners master German through interactive lessons, practice exercises, and real-world content.
Gamified learning experience for mastering German noun genders (der, die, das) through interactive gameplay, spaced repetition, and adaptive difficulty.
13+ professional certifications across cloud platforms, data engineering, and ML/AI • View Credly Badges
Associate
2025Specialty
2021Associate
2020Foundational
2020ACG
2023ACG
2023Associate
2023Automation (ACG)
2023Fundamentals
2023Apache Spark
2023DAG Auth
2023Specialization
2019Specialization
2016Open source contributions, stats, and popular repositories
I'm always open to discussing new opportunities, collaborations, or just having a chat about data engineering and AI.
Berlin, Germany
Whether you're looking for a lead data engineer, need consultation on cloud architecture, or want to collaborate on innovative AI projects, I'd love to hear from you.