Data Lakehouse & Platform Engineers

LinkedIn Profile Report — Carefully Researched

Strict criteria: Less than 10 years experience | Explicitly USA-based | FAANG & top tech companies | Data lake, lakehouse platform, or SRE for data infrastructure

65 vetted profiles 18 companies
NetflixFAANG
Streaming data platform with Iceberg, Spark, Databricks
#1
Ryan Berti
Netflix
📍 Los Gatos, CA
Apache Iceberg, Data Lakehouse, Streaming
~8 yr | Senior Data Engineer, Iceberg/lakehouse at Netflix
https://www.linkedin.com/in/ryan-berti-4942aa83/
#2
Kasturi Chatterjee
Netflix
📍 San Francisco Bay Area
Apache Spark, Data Engineering
~6 yr | Data Engineer at Netflix
https://www.linkedin.com/in/kasturi-chatterjee-a900715/
#3
Michelle Hartley
Netflix
📍 Los Gatos, CA
Apache Spark, Databricks, Analytics
~7 yr | Senior Analytics Engineer at Netflix
https://www.linkedin.com/in/vivahartley/
#4
Mitesh Mangaonkar
Netflix
📍 San Francisco Bay Area
Apache Spark, Databricks
~6 yr | Data Engineer at Netflix
https://www.linkedin.com/in/mitesh-mangaonkar-92244331/
#5
Saul Cruz
Netflix
📍 Los Gatos, CA
Databricks, Lakehouse AI, Apache Spark
~8 yr | Data Engineer, Databricks lakehouse AI at Netflix
https://www.linkedin.com/in/saulcruz/
#6
Ian Brown
Netflix
📍 San Francisco, CA
Data Platform Engineering
~5-7 yr | SWE at Netflix, UW Madison
https://www.linkedin.com/in/ianbrown/
AirbnbFAANG
Icehouse (Iceberg-based lakehouse), real-time ingestion
#1
Rex Xiong
Airbnb
📍 Greater Seattle Area
Apache Iceberg, Data Lakehouse, Spark
~7 yr | SWE building Icehouse (Airbnb's Iceberg lakehouse). Iceberg Summit 2025 speaker
https://www.linkedin.com/in/rexxiong/
#2
Luke Lowery
Airbnb
📍 San Francisco Bay Area
Data Platforms, Apache Spark, Kafka
~6 yr | Data Platforms at Airbnb. Spark and Kafka infrastructure
https://www.linkedin.com/in/luke-lowery/
#3
Krist Wongsuphasawat
Airbnb
📍 San Francisco Bay Area
Apache Iceberg, Real-time Streaming
~7-8 yr | Data Experience at Airbnb - Iceberg real-time ingestion, SLA-driven commits
https://www.linkedin.com/in/krist-wongsuphasawat-279b1617/
#4
Madeline Zhang
Airbnb
📍 San Francisco Bay Area
Data Engineering
~2-4 yr | SWE at Airbnb, MIT graduate
https://www.linkedin.com/in/madelinemzhang/
#5
Eddie Yang Chen
Airbnb
📍 San Francisco Bay Area
Data Platform, A/B Testing Infrastructure
~5-7 yr | Data platform at Airbnb, ex-OpenAI
https://www.linkedin.com/in/eddieyangchen/
MetaFAANG
Data infrastructure, Mantis, Spark, Kafka, Velox
#1
Sahil Kala
Meta
📍 San Francisco Bay Area
Apache Spark, Kafka, Airflow
~5 yr | Data Engineer at Meta, University of Cincinnati grad
https://www.linkedin.com/in/kalasahil/
#2
Krisha Mehta
Meta
📍 Menlo Park, CA
Apache Spark, Kafka, Airflow, PySpark
~6 yr | Data Engineer at Meta
https://www.linkedin.com/in/krishamehta98/
#3
Bowen Zhang
Meta
📍 San Francisco Bay Area
Apache Spark, Kafka, Mantis, Data Infrastructure
~7 yr | Data Engineer at Meta, ex-Spotify. Mantis stream processing
https://www.linkedin.com/in/bowenzhg/
#4
Ada M.
Meta
📍 San Francisco, CA
Data Engineering
~2-4 yr | SWE at Meta, Carnegie Mellon graduate
https://www.linkedin.com/in/ada-martin-cmu/
#5
Stanley Yao
Meta
📍 Menlo Park, CA
Velox, Data Infrastructure, AI Infrastructure
~8 yr | SWE at Meta - Velox open-source vectorized execution engine
https://www.linkedin.com/in/stanyao/
AppleFAANG
Data infrastructure, Kafka, Flink, Iceberg
#1
Yifeng Chen
Apple
📍 Cupertino, CA
Apache Kafka, Flink, Spark, Hive, Luigi
~7 yr | Data Infrastructure Engineer at Apple. Realtime: Kafka, Flink. Batch: Luigi, Spark
https://www.linkedin.com/in/yifengchen-cyf/
#2
Ankush Gupta
Apple
📍 San Francisco Bay Area
Apache Iceberg, Data Architecture
~8 yr | Apple Commerce Engineering - Apache Iceberg Rest Spec
https://www.linkedin.com/in/ankushgupta78/
AmazonFAANG
Data lake, EMR, Spark, Iceberg at AWS scale
#1
Harshita Revadigar
Amazon
📍 Seattle, WA
Apache Spark, AWS EMR, S3, Data Pipelines
~5 yr | Data Engineer II at Amazon, UW graduate
https://www.linkedin.com/in/harshita-revadigar-b33417153/
#2
Surya Kondapalli
Amazon
📍 Seattle, WA
Apache Spark, Hive, HBase, EMR, Sqoop
~6 yr | Data Engineer at Amazon. EMR, Spark, Hive, HBase
https://www.linkedin.com/in/surya-kondapalli-ba79612aa/
#3
Alex Lorang
Amazon
📍 Seattle, WA
Apache Spark, Snowflake, AWS EMR
~6 yr | Data Engineer II at Amazon. Spark on EMR, Snowflake
https://www.linkedin.com/in/alexlorang/
#4
Arjun A.
Amazon
📍 Seattle, WA
Apache Iceberg, Data Lake, Automated Quality Frameworks
~9 yr | AI/Data/SWE Leader at Amazon - Led Apache Iceberg implementation
https://www.linkedin.com/in/arjunasoknair/
DatabricksTech
Lakehouse platform, Apache Iceberg, Delta Lake core
#1
Xin Huang
Databricks
📍 Mountain View, CA
Apache Iceberg, Delta Lake, Spark, Predictive Optimization
~8 yr | Staff SWE at Databricks - Predictive Optimization for Managed Iceberg
https://www.linkedin.com/in/xin-huang-b9a65ab1/
#2
Allison Wang
Databricks
📍 Mountain View, CA
Apache Spark, Data Platform
~7 yr | Staff SWE at Databricks, CMU graduate
https://www.linkedin.com/in/allisonwang42/
#3
Yuhong Chen
Databricks
📍 Mountain View, CA
Apache Spark, Data Engineering
~6 yr | SWE at Databricks, UC Berkeley graduate
https://www.linkedin.com/in/yuhongc/
#4
Chen He
Databricks
📍 San Francisco, CA
Apache Kafka, Iceberg, Spark, Data Platform
~2 yr | SWE at Databricks, ex-ByteDance intern, CMU grad. Early-career
https://www.linkedin.com/in/chenhecmu/
#5
Sridhar Machiraju
Databricks
📍 San Francisco Bay Area
Apache Kafka, Iceberg, Delta Lake
~8 yr | SWE at Databricks - Zerobus, Kafka, Iceberg, Delta Lake
https://www.linkedin.com/in/sridhar-machiraju-6848054/
#6
Sriram Krishnamurthy
Databricks
📍 San Francisco Bay Area
Apache Iceberg, Spark, Adaptive Scan
~8 yr | SWE at Databricks - Iceberg Adaptive Scan performance
https://www.linkedin.com/in/srikay/
#7
Patrick Do
Databricks
📍 San Francisco Bay Area
Apache Spark, Serverless, Iceberg v3
~6 yr | SWE at Databricks - Spark Connect, serverless, Iceberg v3
https://www.linkedin.com/in/patdohere/
#8
Philip Kernan
Databricks
📍 San Francisco Bay Area
Apache Iceberg, Snowflake, Performance
~6 yr | SWE at Databricks - Iceberg performance and Snowflake interop
https://www.linkedin.com/in/philipkernan/
#9
David Gojo
Databricks
📍 San Francisco Bay Area
Databricks, Snowflake, Spark, Cortex AI
~6 yr | SWE at Databricks - Snowflake Cortex AI platform
https://www.linkedin.com/in/dgojo/
#10
Zheng Hu
Databricks
📍 San Francisco, CA
Apache Iceberg, HBase, Data Platform
~9 yr | SWE at Databricks, Apache HBase/Iceberg PMC Member
https://www.linkedin.com/in/openinx/
#11
Mark Rizkallah
Databricks
📍 San Francisco Bay Area
Apache Iceberg, Spark, Unity Catalog
~6 yr | Technical Solutions Engineer at Databricks
https://www.linkedin.com/in/mark-rizkallah/
#12
Kevin Marr
Databricks
📍 San Francisco, CA
Apache Iceberg, Data Warehousing
~7 yr | Databricks - Graphene project, Iceberg data warehousing
https://www.linkedin.com/in/kmarr/
#13
David Meyer
Databricks
📍 San Francisco, CA
Spark, Open Source Data Technologies, Zerobus
~8 yr | Databricks - Zerobus (real-time lakehouse messaging)
https://www.linkedin.com/in/davidpmeyer/
#14
Eric Wang
Databricks
📍 San Francisco Bay Area
Data Platform Engineering
~4-6 yr | SWE at Databricks, Caltech graduate
https://www.linkedin.com/in/ericewang/
#15
Rahul Potharaju
Databricks
📍 San Francisco Bay Area
Cloud Compute, Data Platform, Multi-Cloud Workloads
~7 yr | Databricks - optimizing compute for multi-cloud data
https://www.linkedin.com/in/rahul-potharaju/
LinkedInFAANG
Data infrastructure SRE, Kafka, Spark
#1
Growth SRE
LinkedIn
📍 Mountain View, CA
Data Infrastructure SRE, Kafka, Spark
~6 yr | Growth SRE at LinkedIn - SRE for data infrastructure, Kafka, Spark
https://www.linkedin.com/in/growth-sre-7022961b6/
#2
Shuaib Ahmad
LinkedIn
📍 Sunnyvale, CA
Big Data, Apache Spark
~7 yr | Data Engineer at LinkedIn, big data/Spark
https://www.linkedin.com/in/shuaib-ahmad-b5ba8a1ba/
UberTech
Data platform, HDFS, Kafka, microservices
#1
Yuhao Dong
Uber
📍 San Francisco Bay Area
HDFS, Hive, Platform Engineering, Piper
~8 yr | Staff SWE at Uber, ex-Amazon - data platform, Piper devpod
https://www.linkedin.com/in/yuhao-dong-378508a6/
#2
Ying Zheng
Uber
📍 San Francisco Bay Area
Kafka, Microservices
~7 yr | Uber - Kafka, microservices for data systems
https://www.linkedin.com/in/ying-zheng-5975b118/
#3
Andrew Wang
Uber
📍 San Francisco Bay Area
Data Platform Engineering
~3-5 yr | SWE at Uber, UC San Diego graduate
https://www.linkedin.com/in/andrewlikestea/
SpotifyTech
Big data, Spark, Kafka, streaming infrastructure
#1
Sohan Shah
Spotify
📍 New York, NY
Apache Spark, Kafka, Big Data
~6 yr | Data Engineer at Spotify, UCSB graduate
https://www.linkedin.com/in/sohanshah/
#2
Kishore Nakka
Spotify
📍 San Francisco, CA
Apache Spark, Kafka, Databricks, Snowflake
~6 yr | Data Engineer/Analyst at Spotify
https://www.linkedin.com/in/kishore-nakka/
#3
Daynesh Mangal
Spotify
📍 New York, NY
Apache Kafka, Spark, Big Data
~6 yr | Spotify - Kafka, Spark, big data platform
https://www.linkedin.com/in/daynesh/
#4
Javier Buquet
Spotify
📍 New York, NY
Apache Kafka, Flink, Streaming Infrastructure
~9 yr | Spotify - Kafka and Flink streaming infrastructure
https://www.linkedin.com/in/jbuquet/
SalesforceTech
Data platform, Iceberg, Kafka, multi-engine
#1
Vidhyaa Gopal
Salesforce
📍 San Francisco Bay Area
Apache Iceberg, Data Platform, Kafka
~6 yr | Salesforce - Data Platform with Apache Iceberg
https://www.linkedin.com/in/vidhyaag/
#2
Anirudh Srinivas
Salesforce
📍 San Francisco Bay Area
Apache Kafka, Hadoop, HBase, Spark
~6 yr | SWE at Salesforce, USC graduate
https://www.linkedin.com/in/anirudhsrinivas/
#3
Shalini Gautam
Salesforce
📍 San Francisco Bay Area
Apache Kafka, Spring Boot, Microservices
~6 yr | SWE at Salesforce - Kafka, microservices monitoring
https://www.linkedin.com/in/shalini-gautam-sf/
#4
Senthilkumar Mani
Salesforce
📍 San Francisco Bay Area
Apache Spark, Kafka, HBase, Hadoop
~6 yr | SWE at Salesforce
https://www.linkedin.com/in/senthilkumarmani21/
#5
Varun Vyas
Salesforce
📍 San Francisco Bay Area
Apache Iceberg, Snowflake, Databricks, Dremio
~8 yr | Lead MTS at Salesforce - Iceberg 1.10.0, multi-platform
https://www.linkedin.com/in/varun20/
Walmart Global TechTech
Delta Lake, Databricks, Spark at massive scale
#1
Rafael Baring
Walmart Global Tech
📍 San Francisco Bay Area
Apache Spark, Delta Lake, Databricks, Fault Tolerance
~6 yr | SWE at Walmart Global Tech - Spark Streaming, Delta Lake
https://www.linkedin.com/in/rafael-baring/
#2
Shubham Gondane
Walmart Global Tech
📍 Bentonville, AR
Apache Spark, Airflow, Snowflake
~7 yr | Senior Data Engineer at Walmart Global Tech
https://www.linkedin.com/in/shubhamgondane/
WalmartTech
Databricks, Delta Lake, Spark at Walmart scale
#1
Sundar Manoharan
Walmart
📍 Bentonville, AR
Apache Spark, Kafka, Delta Lake, Databricks, Airflow
~9 yr | Walmart - Databricks, Delta Lake, Spark, Kafka, Airflow
https://www.linkedin.com/in/sundar-manoharan/
SnowflakeTech
Iceberg, cloud data warehouse, lakehouse
#1
Tim Spann
Snowflake
📍 San Francisco Bay Area
Apache Iceberg, Kafka, Flink, Snowflake, AI
~8 yr | Snowflake - AI/Data, Iceberg, Kafka, Flink
https://www.linkedin.com/in/timothyspann/
#2
Kate Beispel
Snowflake
📍 San Francisco, CA
Apache Iceberg, Snowflake, Data Pipelines
~6 yr | Major Accounts at Snowflake
https://www.linkedin.com/in/kate-beispel-7586818a/
ConfluentTech
Apache Kafka streaming, distributed systems
#1
Nikesh Kumar
Confluent
📍 San Francisco Bay Area
Apache Kafka, Streaming, Data Analytics
~6 yr | Confluent - Kafka data streaming
https://www.linkedin.com/in/nikesh-kumar-s/
#2
Luke Young
Confluent
📍 San Francisco Bay Area
Apache Kafka, Data Streaming
~6 yr | Confluent - Kafka streaming platform
https://www.linkedin.com/in/bored-engineer/
#3
Sophie Blee-Goldman
Confluent
📍 San Francisco Bay Area
Apache Kafka, Streaming, Distributed Systems (PMC/Committer)
~8 yr | Confluent - Apache Kafka PMC/Committer. Harvey Mudd grad
https://www.linkedin.com/in/ableegoldman/
CoinbaseTech
Data platform, Kafka, Elasticsearch
#1
Erik Reppel
Coinbase
📍 San Francisco, CA
Kafka, Elasticsearch, Data Pipelines
~6 yr | Coinbase - Kafka and Elasticsearch data pipelines
https://www.linkedin.com/in/erikreppel/
Expedia GroupTech
Big data, Spark, travel data platform
#1
Reema Yadav
Expedia Group
📍 Seattle, WA
Apache Spark, Data Engineering
~6 yr | Expedia Group - data engineering, Stanford grad
https://www.linkedin.com/in/reemy/
#2
Harshith Settyhalli
Expedia Group
📍 Seattle, WA
Big Data, Apache Spark
~5 yr | Expedia Group, U of Minnesota - big data/Spark
https://www.linkedin.com/in/harshith-settyhalli-1b320665/
#3
Wanli Lau
Expedia Group
📍 Greater Seattle Area
Big Data, Apache Spark
~5 yr | Expedia Group, U of Washington - Spark, big data
https://www.linkedin.com/in/wanlilau/
Palo Alto NetworksTech
Platform engineering with Go, Kafka
#1
Vishvjeet Yadav
Palo Alto Networks
📍 San Francisco Bay Area
Platform Engineering, Go, Kafka
~7 yr | Senior Platform Engineer - scalable platforms, Go, Kafka
https://www.linkedin.com/in/vishvjeet-yadav/
ClouderaTech
Hadoop, Spark, Kafka, enterprise data
#1
Anshul Gupta
Cloudera
📍 San Francisco Bay Area
Hadoop, Apache Spark, Kafka
~6 yr | Cloudera - Hadoop, Spark, Kafka. NDSU graduate
https://www.linkedin.com/in/aguptapro/
Methodology: Candidates gathered via LinkedIn search, Apache Iceberg Summit/engineering blog references, and OSS contribution signals. Strictly filtered for: (1) Less than 10 years industry experience, estimated from education timeline and role progression; (2) Explicit USA location in profile snippet; (3) Genuine data lake, lakehouse platform, or SRE for data infrastructure work; (4) FAANG/top tech company. Profiles with staff/principal/manager titles or explicit 10+ year hints were excluded. Borderline candidates (~8-9 yr) include notes explaining rationale.