Data Engineering
Interview Questions

Curated Q&As from real interviews at top tech companies — organized by tool, concept, and difficulty level.

● Beginner● Intermediate● Advanced

By Tool & Platform

Jump straight to questions for the tool you are being interviewed on.

🔌
Airbyte
View Q&As →
🌀
Apache Airflow
View Q&As →
📨
Apache Kafka
View Q&As →
🔥
Apache Spark
View Q&As →
AWS Glue
View Q&As →
🔷
Azure Data Factory
View Q&As →
🧱
Databricks
View Q&As →
📐
dbt (data build tool)
View Q&As →
📡
Debezium
View Q&As →
🏗️
IBM DataStage
View Q&As →
🔴
Oracle Database / Oracle Data Integrator
View Q&As →
🐍
Python for Data Engineering
View Q&As →
❄️
Snowflake
View Q&As →
🗄️
SQL Server / SSIS
View Q&As →
🔗
Talend
View Q&As →

By Topic

Cross-tool concepts that appear in almost every data engineering interview.

📊
SQL & Query Optimization
🏛️
System Design
🔄
ETL Concepts
📐
Data Modeling
☁️
Cloud & Infrastructure
💬
Behavioral / Project-Based

💡 How to Use This Section

1. Filter by your interview tool
If you are interviewing for a Talend role, go directly to the Talend questions.
2. Start with Intermediate
Most data engineering interviews skip beginner questions and go straight to intermediate.
3. Always prepare a system design
Senior roles always ask you to design a full pipeline from scratch. Practice this.