The Hive - Learn, Help, Connect
Dirty Data Dojo - Grand Project

Excel

Python

R

Data cleaning is about being organised and having a plan of action to deal with every type of data error.

Having data doesn't mean having analysis-ready data - you'll still have to clean, prepare and validate your data before you can analyse it.

In the first 5 courses of the Dirty Data Dojo series you learnt how to do all of these things, and now it's time to put it all together and clean a large dataset, taking it from dirty to analysis-ready - and you'll do it all in a couple of hours!

The steps you’ll learn in this course are very simple to follow, but are extremely effective, so you’ll know that you’re getting to the true story of the data, saving you weeks of misery!


Video lessons

Articles

Downloadable resources

Certificate of completion

Interactive experience

Perfect for beginners


[wpc_button]

  • Description
  • Content
  • Outcomes
  • FORUM
  • CertificatE
  • 14D2C2

DIrty Data Dojo Recap

Data cleaning is about being organised and having a plan of action to deal with every type of data error.

In this course you'll recap everything you learned over the previous 5 courses in the Dirty Data Dojo series, where we covered:

1.

Data collection

2.

Data cleaning

3.

Data preparation

4.

Data validation

5.

DataKleenr

DIrty Data Dojo - Grand Project

Once you've had a quick recap, you'll be given a dirty dataset and clean it using all the techniques you've learnt, taking it from dirty to analysis-ready in just a couple of hours!

Your Curriculum

1

Dirty Data Dojo Recap

A recap of everything that you learnt in courses 1-5 of the Dirty Data Dojo series

Open Access

Premium

CHAPTER HIGHLIGHTS

Dirty Data Dojo Recap

2

Grand Project

In this chapter you'll learn how to remove unwanted text entries that contaminate your data

Premium

CHAPTER HIGHLIGHTS

Data Cleaning:

Removing unwanted spaces and non-printing characters, and case-controlling

Data Preparation:

Converting your data into the required formats

Data Validation:

Checking that your data are sensible and fit real-life rules.

Includes automatically removing outliers

DataKleenr:

Bypassing everything you've just learnt to get your data clean and analysis-ready in seconds!

Get Started

Ready to get started on the Dirty Data Dojo Grand Project?

Just click the link on the right to go to the first lesson...

How to analyse categorical survey data in Excel and in R
Remember Me
Success message!
Warning message!
Error message!