DataSEA- Data Science, Engineering, Analytics

  # DataSEA: The Gamified Mobile App to Learn Data Engineering, SQL, and Analytics in 10 Minutes a Day                                                                                                                                                                                                             > **TL;DR** — DataSEA is a free Android app that turns Data Engineering, Analytics, and Data Science into bite-size, gamified lessons. 88+ modules,     500+ lessons,...

Data Engineering Basics to Advance: Phase-IV- Capstone Projects

                        Capstone Projects


Project Ideas:

  1. Batch Pipeline

    • Source: CSV on S3

    • Process: PySpark/dbt

    • Sink: Redshift/BigQuery

    • Orchestrate: Airflow

  2. Streaming Pipeline

    • Source: Kafka (clickstream or logs)

    • Process: Spark Streaming

    • Sink: ElasticSearch or S3

  3. Data Observability

    • Implement Great Expectations

    • Data profiling and alerting

  4. Cloud-native Data Lakehouse

Comments

Popular posts from this blog

Bhakti-Aarti- Android app Privacy policy

DBT tool connect Athena from Local- AWS SSO

AWS Lake formation - AWS LF - Governance Security- Access control