Big Data Analytics Using Spark Free Online Course by University of California, San Diego

Chalmers University of Technology

University of California, San Diego is offering free online course on Big Data Analytics Using Spark. In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation.

In this ten week course, applicants will learn how to analyze large data sets using Jupiter notebooks, Map Reduce and Spark as a platform. This course will start on April 1, 2018.

Course At A Glance 

Length: 10 weeks
Effort: 10 hours pw
Subject: Data Analysis & Statistics
Institution: University of California, San Diego and edx
Languages: English
Price: Free
Certificate Available: Yes, Add a Verified Certificate for $350
Session: Course Starts on April 1, 2018

Providers’ Details

The University of California, San Diego (UC San Diego) is a student-centered, research-focused, service-oriented public institution that provides opportunity for all. This young university has made its mark regionally, nationally and internationally.

About This Course

In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation.

The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, Map Reduce and Spark.

Why Take This Course?

You will learn how to perform supervised unsupervised machine learning on massive datasets using the Machine Learning Library (MLlib).

In this course, as in the other ones in this MicroMasters program, you will gain hands-on experience using PySpark within the Jupyter notebooks environment.

Learning Outcomes

  • Programming Spark using Pyspark
  • Identifying the computational tradeoffs in a Spark application
  • Performing data loading and cleaning using Spark and Parquet
  • Modeling data through statistical and machine learning methods

Instructors

Yoav Freund

Dr. Freund is a Professor of Computer Science and Engineering in the University of California San Diego.

Requirements

The previous courses in the Micro Masters program: DSE200x, DSE210x and DSE220x

How To Join This Course

  • Go to the course website link
  • Create an edX account to SignUp
  • Choose “Register Now” to get started.
  • EdX offers honor code certificates of achievement, verified certificates of achievement, and XSeries certificates of achievement. Currently, verified certificates are only available in some courses.
  • Once applicant sign up for a course and activate their account, click on the Log In button on the edx.org homepage and type in their email address and edX password. This will take them to the dashboard, with access to each of their active courses. (Before a course begins, it will be listed on their dashboard but will not yet have a “view course” option.)

Apply Now

Facebook
Twitter
LinkedIn