You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Snehasish DuttaSD

Snehasish Dutta

Consultant Data Engineer (Data Platform)

€800/day
Berlin, DE
8-15 years

Average response time: 1 hour

About Snehasish

Your data pipelines shouldn't break at 3 AM.
I'm a Data Engineer with 10+ years of experience designing bulletproof data platforms for companies processing billions of events daily. From Walmart Labs to Zalando to Lidl, I've built the infrastructure that powers real-time analytics, e-commerce, and customer-facing products at scale.
What I bring to your project:
→ Streaming Expertise – Kafka, Flink, Spark Streaming. I design event-driven architectures that handle traffic spikes without breaking a sweat.
→ Modern Data Stacks – Delta Lake, Databricks, dbt. Whether you're migrating to a lakehouse or optimizing what you have, I've done it.
→ Production Mindset – I don't just build pipelines; I build systems that your team can maintain, monitor, and trust.
Certified: Confluent Data Streaming | Databricks
I'm the right fit if you need:

A data platform built from scratch
Migration from batch to real-time processing
Schema design and data modeling for complex domains
Hands-on delivery, not just architecture diagrams

Based in Germany 🇩🇪 | Available for remote projects across the EU
  • English

    Native or bilingual

  • German

    Basic

  • Hindi

    Conversational

Remote only
Primarily works remotely

Experience

  • Lidl e-commerce,
    Consultant Data Engineer (Data Platform)
    E-COMMERCE
    July 2022 - Today (3 years and 11 months)
    Berlin, Germany
    • • Building and maintaining data pipelines for Lidl Plus and e-commerce in a cross-cloud (GCP, Azure), cross-functional team; CI/CD with DevOps, Airflow scheduling, Spark tuning
    • • Architecting event-driven pipelines consuming from GCP Pubsub, SAP, and Snowflake into Delta Lake medallion architecture; processing up to 6M records per event daily
    • • Built real-time stock notification system using Spark Streaming to alert customers when products are back in stock, generating €1.5M additional revenue
    • • Maintaining Spring Boot API ingestion apps; driving architecture simplification and cost optimization; exploring Apache Flink, Lakeflow, and Apache Arrow for platform evolution
    • • Driving innovation: evaluating AI/LLM tools (Claude Code, Gemini CLI) for CI/CD automation; Golang-based Franz for cost-efficient Kafka consumption; published technical blogs
    Apache Kafka Apache Spark Infra as Code Microsoft Azure
  • Zalando Payments,
    Data Engineer (Sole DE in Team)
    September 2020 - January 2022 (1 year and 4 months)
    Berlin, Germany
    • • Owned end-to-end data engineering: onboarded 160+ payment and order events, processing up to 1.5 TB daily on Databricks from Nakadi (Kafka), Snowflake, Redshift, and SAP HANA
    • • Maintained Redshift data warehouse; collaborated with analysts and scientists; monitored pipelines during high-traffic events like Cyber Friday
    Databricks Apache Kafka Data Cleaning and Preprocessing SQL Data Engineer
  • Walmart Labs,
    Software Engineer III (Data Platform
    November 2019 - September 2020 (10 months)
    Bengaluru, Karnataka, India
    • • Canada: Built real-time vehicle tracking system for product returns; worked with data scientists on profit/loss prediction for returns, generating $2M additional revenue annually
    • • UK: Built and maintained data pipeline for ASDA stores to calculate hourly discounts; optimized existing ETL components

Recommendations

Be the first to recommend Snehasish

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • Bachelor of Engineering
    VTU
    2014
    Bachelor of Engineering
  • Confluent Certified Data Streaming Engineer
    Confluent Certified Data Streaming Engineer

Skill set

Categories