Course Outline:
DURATION: 3 Days
Module 1: Introduction to Python:
rong>Section 1:
- Why Spark.
- Advantages of Spark.
- What is Spark.
- Components of Spark.
- History of Spark.
- Spark Architecture.
- Spark Language API’s.
- Spark Session.
- Data Frame and Partitions.
- Transformations.
- Actions.
- Structured API’s.
- Schema.
- Spark Types.
- Structured API Execution.
Section 2:
- Read & Write API Structure.
- Reading and Writing Data.
- Reading and Writing Data – CSV, JSON, ORC.
- Reading and Writing Text Files.
Section 3:
- What are Low-level APIs and when to use them.
- What are RDDs?.
- Creating RDDs.
- RDD Transformations and Actions.
Section 4:
- Structure Streaming Overview.
- Streaming vs Structured Streaming.
- Advantage of Structured Streaming.
- Stream Processing Examples.