Data Lake / Lakehouse Engineer Profiles

LinkedIn profile search — Spark + Iceberg + Cloud (AWS/GCP/Azure) + <10 yrs experience

Apache Iceberg Apache Spark EKS / Kubernetes AWS / GCP / Azure Data Lakehouse
19
Profiles Collected
6
FAANG / Big Tech
3
Cloud Providers
FAANG & Big Tech Companies
Amazon Amit Sharma
Staff Data Engineer • Austin, Texas

Architecting petabyte-scale data infrastructure. Built scalable data ingestion pipelines with Apache Iceberg, Kafka, Spark (Scala/Python), Kubernetes (EKS), Terraform/CDK, Snowflake, and Bedrock/ML Infrastructure. Strong EKS/Iceberg/Spark combination at scale.

Apache Iceberg Apache Spark Kubernetes (EKS) Kafka Terraform/CDK Snowflake AWS
Amazon logana Than palani
Senior Data Engineer • Amazon Web Services (AWS) • Seattle, WA

Technical Data leader for AWS Data Center org who built distributed, scalable data lake systems from the ground up. Architected and operationalized exabyte-scale infrastructure using Apache Iceberg, Trino, Glue, and Lake Formation. Strong lakehouse and data platform credentials.

AWS Apache Iceberg Trino Glue Lake Formation Data Lake
Amazon Joshua Dale
Senior Software Engineer • Amazon Web Services (AWS) • Denver, CO

The Insights team used Flink and Spark for moving data around and Apache Iceberg for the OLAP data lake. Also supported OLTP workloads via Postgres across kubernetes-based platforms. Combines real-time streaming with lakehouse storage.

Apache Iceberg Apache Spark Apache Flink Kubernetes PostgreSQL AWS
Amazon AlokKumar Roy
Sr. Data Engineer • Amazon Web Services (AWS) • Washington DC-Baltimore

Built a near-real-time data lake using Spark and big data processing at scale. Leveraged AWS Bedrock for automated data quality detection. Strong ETL platform background with DDMS (data catalog, lineage tracking, freshness monitoring). US patent-approved ETL platform.

AWS Apache Spark Data Lake AWS Bedrock ETL Data Catalog
Amazon Vivek Rajakumar Jadhav
Data Engineer II • Amazon • AWS Azure Certified

Apache Spark Expert and Apache Iceberg specialist. Engineered a Lakehouse architecture to process large-scale data daily with high reliability. Dual cloud certified (AWS + Azure). Manages Iceberg format for seamless tracking and analysis of customer data at Amazon scale.

Apache Spark Apache Iceberg AWS Azure Lakehouse
Meta Binoy Dutt
Staff Data Engineer • AI & Cloud Data Architect • Meta • Austin, TX

Staff Data Engineer at Meta with expertise in Kubernetes, Buildkite, Cloud Deployment, and Infrastructure Automation. GCP Certified. Architecting AI and cloud data infrastructure at Meta scale.

Kubernetes GCP Meta Infrastructure Automation Cloud Deployment
Meta Daniel Carey
Data Engineer • Meta • New York, NY

Senior software engineer with extensive experience at Meta. Orchestrated workloads in Kubernetes environments with domain-driven and behavioral-driven design. Strong data engineering foundations supporting Meta's data infrastructure.

Kubernetes Meta Data Engineering Domain-Driven Design
Airbnb Zach Wilson
Staff Data Engineer • Airbnb • TC $600k

Staff Data Engineer at Airbnb with $600k total compensation. Entity extraction and NLP with Apache Spark. One of the most followed data engineers on LinkedIn (518k followers) and founder of DataExpert.io. Strong Spark and distributed systems background.

Apache Spark NLP Airbnb Data Engineering
Technology & Consulting Companies
World Wide Technology Kunal Sharma
Senior Data Engineer • World Wide Technology • Gurugram, India

Lakehouse | Snowflake | Databricks | Apache Iceberg | Data Vault 2.0. Multi-cloud expertise across AWS, GCP, and Azure. Strong in Spark, Python, SQL. Certified CCA175 (Spark). Data engineering with broad lakehouse and Data Vault architecture.

Apache Iceberg Apache Spark AWS GCP Azure Snowflake Databricks Data Vault 2.0
Consulting Pooja N.
Lead Data Engineer • Irving, TX • 500+ connections

Lead Data Engineer certified in both AWS and Azure. CCA Spark and Hadoop Developer (CCA175). Strong PySpark and big data processing background across dual-cloud environments.

Apache Spark AWS Azure PySpark Hadoop
NBCUniversal Kiran Kumar Kalli
Senior Data Engineer • NBCUniversal • London, UK • 4.1K+ followers

Senior Data Engineer with 6+ years of experience. GCP and AWS certified. Apache Spark (PySpark), Kafka, Scala, Airflow specialist. Strong data modeling and multi-cloud platform experience at NBCUniversal.

Apache Spark PySpark GCP AWS Kafka Scala Airflow
Apexon Venkateswarlu Bingi
Data Engineer • Apexon • Hyderabad, India

Data Engineer with hands-on experience in Apache Spark, Apache Iceberg, Lakehouse Architecture, Azure, and AWS. Directly matches the lakehouse + Iceberg + Spark + cloud profile criteria.

Apache Spark Apache Iceberg Lakehouse Azure AWS
Alter Domus Kevin Veerasammy
Senior Data Engineer • Alter Domus • Queens, NY • 790+ followers

Senior Data Engineer with AWS & Snowflake and Apache Iceberg Lakehouse expertise. Spark (EMR), Airflow, dbt. Modern data platforms specialist combining Iceberg lakehouse with AWS.

Apache Iceberg Apache Spark AWS Snowflake Airflow dbt
Amazon (W2) Srinivasa Raju K.
Big Data Engineer (ETL) • Amazon Web Services (AWS) • Washington DC

Big Data / ETL Engineer at AWS. Expertise in Spark, SQL, Python, Kafka, Databricks, Snowflake, and Apache Iceberg with Unity Catalog. Led initiatives on Databricks with Iceberg integration for lakehouse governance.

Apache Spark Apache Iceberg AWS Databricks Unity Catalog Kafka
Precisely Sarthak Madan
Data Engineer • Precisely • Delhi, India • 2.3K+ followers

Data Engineer 2 with strong technical stack: Apache Spark, PySpark, Apache Airflow, AWS EMR, Amazon Redshift, Delta Lake, Apache Iceberg, Data Modeling, Data Quality, ETL/ELT. Good all-around data platform profile.

Apache Spark Apache Iceberg AWS EMR Delta Lake Airflow Redshift
Palantir Anvesh P
Senior Data Engineer • Snowflake | Palantir • 870+ followers

Designed lakehouse architectures with Delta Lake and Apache Iceberg while implementing governance through Unity Catalog and AWS Glue Data Catalog. Strong Spark and AWS Glue background. Good combination of lakehouse architecture and governance.

Apache Iceberg Delta Lake Apache Spark AWS Glue Unity Catalog Snowflake
C2S Technologies Rahul Thipparthi
Software Engineer • C2S Technologies, Inc. • Hyderabad, India

Databricks Certified Data Engineer Associate. Hands-on with Apache Iceberg, Kubernetes, Databricks, Linux. Agile Practitioner with strong Spark and distributed systems fundamentals.

Apache Iceberg Databricks Kubernetes Linux Agile
Microsoft Ruchitha Ganga
Senior Data Engineer • AWS, Azure, GCP • Microsoft Certified

Senior Data Engineer working across AWS, Azure, and GCP. Apache Iceberg and Kubernetes expertise with Microsoft Azure Data Engineer Associate certification. Strong multi-cloud data engineering background.

Apache Iceberg Kubernetes AWS Azure GCP Microsoft Certified
Tech / Scale-up Miao Wang
Senior Engineering Executive • 5.1K+ followers

Senior engineering leader who built the Lakehouse — the data foundation supporting $1B+ in product. Built scalable data ingestion with Apache Spark, Kafka, Azure Data Lake, and Apache Iceberg. Strong enterprise-scale data platform credentials.

Apache Iceberg Apache Spark Kafka Azure Data Lake Lakehouse
CITY Furniture Kellon Lewis
Principal Data Engineer / AI Platform Lead • CITY Furniture • Orlando, FL

Principal Data Engineer with expertise across AWS, GCP, Snowflake, Databricks, and Apache Spark. Designed a distributed AWS data lake improving ingestion and processing by 60%. Strong AI platform and lake architecture background.

Apache Spark AWS GCP Snowflake Databricks Data Lake

Profiles sourced via Google search (site:linkedin.com/in) with keywords: data lake / lakehouse + Apache Iceberg + Spark + cloud providers.
Experience levels self-reported on LinkedIn — verify years explicitly before outreach.
Report generated: April 13, 2026