This auction has been won.
View other items offered by JayDuo194
Leading
MatSte8525118 1 × R1.00
27 Jun 09:23

Similar products

Cisco CCNA Interconnecting Cisco Networking Devices Part 2 Video Training Course
New
R250.00
88% OFF
Brendon Burchard - 10X WEALTH + BUSINESS [COMPLETE VIDEO COURSE + BONUSES | 8.4GB] [USB Drive]
New
R4,495.00 R38,922.00
A short course in modern organic chemistry.
Secondhand
R100.00
Acrylic Workbook a complete course in 10 lessons
Secondhand
R115.00
50 Hours of Big Data, PySpark, AWS, Scala, and Scraping - Video Course
Sold

50 Hours of Big Data, PySpark, AWS, Scala, and Scraping - Video Course

Digital product
New 1 was available
Indicative market price: R700.00
R40.00 minimum increment
R1.00
R499.00
29% off
Shipping
This is a digital product (eg. voucher, product license, service, etc.) and does not require shipping. The seller will be in contact to deliver this product to you electronically.
The seller has indicated that they will usually have this item ready to ship within 5 business days. Shipping time depends on your delivery address. The most accurate delivery time will be calculated at checkout, but in general, the following shipping times apply:
 
Standard Delivery
Main centres:  1-3 business days
Regional areas: 3-4 business days
Remote areas: 3-5 business days
Get it now, pay later

Product details

Condition
New
Location
South Africa
Bob Shop ID
647011643

Key benefits

Data scraping and data mining for beginners to pro with Python

Clear unfolding of concepts with examples in Python, Scrapy, Scala, PySpark, and MongoDB

Master Big Data with PySpark and AWS

Description

Part 1 is designed to reflect the most in-demand Scala skills. It provides an in-depth understanding of core Scala concepts. We will wrap up with a discussion on Map Reduce and ETL pipelines using Spark from AWS S3 to AWS RDS (includes six mini-projects and one Scala Spark project). Part 2 covers PySpark to perform data analysis. You will explore Spark RDDs, Dataframes, a bit of Spark SQL queries, transformations, and actions that can be performed on the data using Spark RDDs and dataframes, the ecosystem of Spark and Hadoop, and their underlying architecture. You will also learn how we can leverage AWS storage, databases, computations, and how Spark can communicate with different AWS services. Part 3 is all about data scraping and data mining. You will cover important concepts such as Internet Browser execution and communication with the server, synchronous and asynchronous, parsing data in response from the server, tools for data scraping, Python requests module, and more. In Part 4, you will be using MongoDB to develop an understanding of the NoSQL databases. You will explore the basic operations and explore the MongoDB query, project and update operators. We will wind up this section with two projects: Developing a CRUD-based application using Django and MongoDB and implementing an ETL pipeline using PySpark to dump the data in MongoDB. By the end of this course, you will be able to relate the concepts and practical aspects of learned technologies with real-world problems. 


This course is designed for absolute beginners who want to create intelligent solutions, study with actual data, and enjoy learning theory and then putting it into practice. Data scientists, machine learning experts, and drop shippers will all benefit from this training. A basic understanding of programming, HTML tags, Python, SQL, and Node JS is required. However, no prior knowledge of data scraping, and Scala is needed.

What you will learn


Build ETL pipeline from AWS S3 to AWS RDS using Spark

Explore Spark/Hadoop applications, ecosystem, and architecture

Learn collaborative filtering in PySpark

Recognize the distinction between synchronous and asynchronous requests

Understand MongoDB CRUD, query operators, projection operators, and update operators

Build APIs for CRUD operations in MongoDB through Django

Please note- This is a Video course, it will be emailed to you and you will need to download the lessons.



Recently viewed

See more
DVORAK String Quartets 10XCD BOX SET [Classical Box 1]
Secondhand
R270.00
Keyboard Shortcut Desk Mat Mouse Pad Non-Slip Office Desk Pad With Computer Commands 400x800x2mm(...
New
R256.83
Digital component tester with lcd graphic display lcr-t4
New
R435.87
Intel Core i5-14400 Processor - Up to 4.7 GHz, 10 Cores, 16 Threads, 20MB SmartCache, 65W TDP
New
R4,670.00