DataSEA- Data Science, Engineering, Analytics

  # DataSEA: The Gamified Mobile App to Learn Data Engineering, SQL, and Analytics in 10 Minutes a Day                                                                                                                                                                                                             > **TL;DR** — DataSEA is a free Android app that turns Data Engineering, Analytics, and Data Science into bite-size, gamified lessons. 88+ modules,     500+ lessons,...

AWS Lake formation - AWS LF - Governance Security- Access control

 

๐Ÿงญ Overview: What is AWS Lake Formation?

AWS Lake Formation is a service that simplifies building a secure data lake by:

  • Ingesting data from various sources

  • Organizing it in Amazon S3

  • Setting up data catalogs (via AWS Glue)

  • Defining security and access policies

  • Querying data with services like Athena, Redshift, and EMR


๐Ÿ› ️ Prerequisites

Before starting, ensure you have:

  • An AWS account

  • IAM permissions for Lake Formation, Glue, S3, and IAM

  • An existing S3 bucket (or create a new one)


๐Ÿงฑ Step 1: Set Up a Data Lake Location

  1. Go to the Lake Formation Console.

  2. In the left pane, choose "Data lake locations".

  3. Click "Register location".

  4. Choose your S3 bucket or a folder (e.g., s3://your-bucket/data/).

  5. Choose an IAM role that has permission to access this location.


๐Ÿ“‹ Step 2: Add a Data Catalog Table

  1. From the Lake Formation Console, go to "Databases".

  2. Click "Create database" (this is part of AWS Glue).

    • Example: Name: sales_db

  3. Now go to "Tables", and either:

    • Crawl your data with AWS Glue Crawler

    • Or manually define a table (not recommended for beginners)

Use Glue Crawler for simplicity:

  • Create a new Glue Crawler

  • Point it to the S3 path (e.g., s3://your-bucket/data/sales/)

  • Assign it to the sales_db database

  • Run the crawler to populate the catalog


๐Ÿ” Step 3: Grant Permissions with Lake Formation

  1. Go to "Permissions" > "Data lake permissions"

  2. Click "Grant":

    • Choose your IAM user or role

    • Select the database and tables

    • Choose permissions like SELECTALTER, etc.

This step replaces traditional IAM-based S3 permissions for data access


๐Ÿงช Step 4: Query Data with Athena (Optional)

  1. Open Amazon Athena

  2. Choose the Lake Formation-managed database (e.g., sales_db)

  3. Run a SQL query:

    SELECT * FROM sales_table LIMIT 10;

If permissions are properly configured, you’ll see results.

Comments

Popular posts from this blog

Bhakti-Aarti- Android app Privacy policy

DBT tool connect Athena from Local- AWS SSO