Subcategories

Lynda - Scala Essential Training for Data Science

Posted By: U.N.Owen
Lynda - Scala Essential Training for Data Science

Lynda - Scala Essential Training for Data Science
Size: 192 MB | Duration: 1h 51m | Video: AVC (.mp4) 1280x720 15&30fps | Audio: AAC 48KHz 2ch
Genre: eLearning | Level: Intermediate | Language: English

Discover how to leverage Scala—the popular language that combines object-oriented design with functional programming—in your data science work. In this course, learn about the Scala features most useful to data scientists, including custom functions, parallel processing, and programming Spark with Scala. Dan Sullivan kicks off the course with an introduction for non-Scala programmers. Next, he describes how to use SQL from Scala—a particularly useful concept for data scientists, since they often have to extract data from relational databases. He then covers parallel processing constructs in Scala, sharing techniques that are useful for medium-sized data sets that can be analyzed on a single server with multiple cores. Dan also focuses on using Scala with Spark, a distributed processing platform. He first describes how to work with Resilient Distributed Datasets (RDDs)—a fundamental Spark data structure—and then explains how to use Scala with Spark DataFrames, a new class of data structure specially designed for analytic processing. He wraps up the course by providing a summary of advantages of using Scala for data science.

* The advantages of Scala for data science
* Scala data types
* Scala arrays, vectors, and ranges
* Parallel processing in Scala
* Mapping functions over parallel collections
* When and when not to use parallel collections
* Using SQL in Scala
* Scala and Spark RDDs
* Scala and Spark DataFrames
* Creating DataFrames


Lynda - Scala Essential Training for Data Science

No mirrors below please.