Tags
Language
Tags
December 2024
Su Mo Tu We Th Fr Sa
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31 1 2 3 4

Cleaning Bad Data in R

Posted By: IrGens
Cleaning Bad Data in R

Cleaning Bad Data in R
.MP4, AVC, 500 kbps, 1280x720 | English, AAC, 128 kbps, 2 Ch | 1h 54m | 265 MB
Instructor: Mike Chapple

Data integrity is the new focal point of the data science revolution. Now that everybody is onboard with the role of data in people's lives and business, it's not an unfair question to ask, "Can you prove that your data is accurate?" In this course, you can learn how to identify and address many of the data integrity issues facing modern data scientists, using R and the tidyverse. Discover how to handle missing values and duplicated data. Find out how to convert data between different units and tackle poorly formatted text. Plus, learn how to detect outliers, address structural issues, and identify red flags that indicate potential data quality issues.

Where possible, instructor Mike Chapple shows how to correct the issues using R, but the same principles can be applied to any statistical programing language.

Topics include:

Missing data
Duplicate rows and values
Converting data
Formatting data
Working with tidy data
Tidying data sets
Dealing with suspicious data


Cleaning Bad Data in R