DataSEA- Data Science, Engineering, Analytics

  # DataSEA: The Gamified Mobile App to Learn Data Engineering, SQL, and Analytics in 10 Minutes a Day                                                                                                                                                                                                             > **TL;DR** — DataSEA is a free Android app that turns Data Engineering, Analytics, and Data Science into bite-size, gamified lessons. 88+ modules,     500+ lessons,...

Data Engineering Basics to Advance: Phase-III- Advanced

                             Advanced


opics:

  1. Big Data Ecosystem

    • Apache Spark (core, DataFrames, PySpark)

    • Hive/Presto/Athena

    • Hadoop (just architecture overview)

  2. Data Lakes & Lakehouses

    • Concepts: data lake, warehouse, lakehouse

    • Table formats: Apache IcebergDelta LakeHudi

    • Glue, Athena, Iceberg setup

  3. Streaming Systems

    • Kafka: pub/sub, brokers, partitions

    • Kafka Connect, schema registry

    • Apache Flink or Spark Structured Streaming (basics)

  4. Cloud Data Warehouses

    • BigQuery, Redshift, Snowflake: architecture & querying

    • Partitioning, clustering, optimization

  5. Monitoring & Observability

Comments

Popular posts from this blog

Bhakti-Aarti- Android app Privacy policy

DBT tool connect Athena from Local- AWS SSO

AWS Lake formation - AWS LF - Governance Security- Access control