Month 14: Cloud Data Analytics and Big Data

Month 14: Cloud Data Analytics and Big Data

Week 1: Introduction to Big Data and Analytics Concepts

Day 1: Understanding Big Data: characteristics, sources, and challenges

Day 2: Introduction to data analytics and its importance

Day 3: Types of analytics: descriptive, diagnostic, predictive, and prescriptive

Day 4: Data warehousing and data lakes

Day 5: Hands-on activity: Exploring popular Big Data and analytics tools and libraries

Week 2: Data Processing Frameworks – Apache Hadoop and Spark

Day 1: Introduction to Apache Hadoop and its ecosystem

Day 2: Hands-on activity: Setting up and working with a Hadoop cluster

Day 3: Introduction to Apache Spark and its components

Day 4: Hands-on activity: Running Spark jobs for data processing

Day 5: Hadoop vs. Spark: similarities, differences, and use cases

Week 3: AWS Big Data and Analytics Services

Day 1: Introduction to AWS EMR (Elastic MapReduce)

Day 2: Hands-on activity: Running Hadoop and Spark jobs with EMR

Day 3: Introduction to AWS Kinesis

Day 4: Hands-on activity: Building real-time data processing with Kinesis

Day 5: Introduction to AWS Glue

Day 6: Hands-on activity: Creating ETL jobs with Glue

Week 4: Azure Big Data and Analytics Services

Day 1: Introduction to Azure HDInsight

Day 2: Hands-on activity: Running Hadoop and Spark jobs with HDInsight

Day 3: Introduction to Azure Stream Analytics

Day 4: Hands-on activity: Building real-time data processing with Stream Analytics

Day 5: Introduction to Azure Data Factory

Day 6: Hands-on activity: Creating ETL jobs with Data Factory

Week 5: GCP Big Data and Analytics Services

Day 1: Introduction to GCP DataProc

Day 2: Hands-on activity: Running Hadoop and Spark jobs with DataProc

Day 3: Introduction to GCP DataFlow

Day 4: Hands-on activity: Building real-time and batch data processing with DataFlow

Day 5: Introduction to GCP DataFusion

Day 6: Hands-on activity: Creating ETL jobs with DataFusion