MLlib
PySpark
Databricks
Apache Flink
Apache Airflow (pipeline orchestration)
dbt (Data Build Tool)
Delta Lake / Lakehouse Architecture
Parquet / Avro / ORC
SQL (surprisingly absent)
Presto / Trino
Elasticsearch
Data Lake Architecture
Apache Beam
GitHub Actions / CI/CD
Fivetran / Airbyte
MLlib
PySpark
Databricks
Apache Flink
Apache Airflow (pipeline orchestration)
dbt (Data Build Tool)
Delta Lake / Lakehouse Architecture
Parquet / Avro / ORC
SQL (surprisingly absent)
Presto / Trino
Elasticsearch
Data Lake Architecture
Apache Beam
GitHub Actions / CI/CD
Fivetran / Airbyte
MLlib
PySpark
Databricks
Apache Flink
Apache Airflow (pipeline orchestration)
dbt (Data Build Tool)
Delta Lake / Lakehouse Architecture
Parquet / Avro / ORC
SQL (surprisingly absent)
Presto / Trino
Elasticsearch
Data Lake Architecture
Apache Beam
GitHub Actions / CI/CD
Fivetran / Airbyte