Best Blackhat Forum

Full Version: [F4LT] Introduction to Python for Big Data Engineering with PySpark
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Introduction to Python for Big Data Engineering with PySpark
Learn how to use Python and Spark 3.0 (PySpark) for Data Engineering and Data Analytics on Big Data Cloud Platforms
Rating: 0.0 out of 5
0.0
(0 ratings)
0 students
1hr 56min of on-demand video
Created by Thulani Mngadi
English
English [Auto]

Description
The key objectives of this course are as follows;

Learn Spark Architecture
Learn Spark Execution Concepts
Learn Spark Transformations and Actions using the Structured API
Learn Spark Transformations and Actions using the RDD (Resilient Distributed Datasets) API
Learn how to set up your own local PySpark Environment
Learn how to interpret the Spark Web UI
Learn how to interpret DAG (Directed Acyclic Graph) for Spark Execution

The Python Spark project that we are going to do together;

Sales Data

Create a Spark Session
Read a CSV file into a Spark Dataframe
Learn to Infer a Schema
Select data from the Spark Dataframe
Produce analytics that shows the topmost sales orders per Region and Country

https://www.udemy.com/course/introduction-to-python-for-big-data-engineering-with-pyspark/
Reference URL's