The Hive - Learn, Help, Connect
Dirty Data Dojo - Data Cleaning

Excel

Python

R

Data cleaning is a serious business – you’ll typically spend up to 80% of your time cleaning data!

In this course you’re going to learn how to clean your data - in Excel, Python and R - in hours rather than days or weeks!

I’ll teach you a method that took me several years to perfect, and as a result you’ll become much more productive, get your results faster and make your boss happy in the process.

The steps you’ll learn in this course are very simple to follow, but are extremely effective, so you’ll know that you’re getting the best start possible, saving you weeks of misery!


Video lessons

Articles

Downloadable resources

Certificate of completion

Interactive experience

Perfect for beginners


  • Description
  • Content
  • Outcomes
  • FORUM
  • CertificatE
  • 14D2C2

If you’ve ever spent days or weeks cleaning your data and wishing there was a better, quicker way of doing it, then congratulations! You are here at the right place because we are going to introduce you to one of the most important aspects of any project that involves data: data cleaning, and teach you how to do it better!

Cleaning data isn’t difficult, but if you don’t observe a few basic rules then you’re storing up trouble for yourself when it comes to analysing it. If you’ve been there, then you know what I’m talking about.

In this course, we’re going to give you the best start possible, introducing you to some of the best tricks you’ll ever learn, saving you weeks of misery!

Here's what you'll learn:

1.

you’ll learn the features of what makes a good dataset and – more importantly – a bad one!

2.

you’ll learn how to remove all unwanted spaces from your entire dataset in one second!

3.

you’ll learn how to standardise the case of all text entries in your dataset in one epic ninja move

4.

you’ll learn all the essential steps in cleaning numerical and text data

5.

you’ll learn the precise order in which you should apply all these techniques

Your Curriculum

1

Introduction to Data Cleaning

An introduction to this course and to the 14 Day Data Cleaning Challenge!

Open Access

CHAPTER HIGHLIGHTS

Introduction to Data Cleaning

2

Removing Unwanted Spaces

In this chapter you'll how to remove all unwanted spaces from your data in one amazing move!

Open Access

Free Plan

CHAPTER HIGHLIGHTS

Anatomy of a Good Workbook

Remove Trailing & Leading Spaces Part 1

Remove Trailing & Leading Spaces Part 2

Standardising the Case of Text Entries

3

Cleaning Text Data

The focus in this chapter is on how to clean your text data in Excel accurately and efficiently

Premium

CHAPTER HIGHLIGHTS

Cleaning Text Data Using Remove Duplicates and Find & Replace

Cleaning Text Data Using Remove Duplicates and VLOOKUP

Cleaning Text Data Using Filter and Find & Replace

4

Cleaning Numerical Data

The focus in this chapter is on how to clean your numerical data in Excel accurately and efficiently

Premium

CHAPTER HIGHLIGHTS

Identifying Text in Your Numerical Data

5

Order, Order, We Must Have Order...

In this chapter you'll learn the precise order in which to apply all the techniques you've learned

Premium

CHAPTER HIGHLIGHTS

Order, Order, We Must Have Order…

6

Data Cleaning Recap

In this chapter you'll recap everything you've learnt and practice it all again!

Premium

CHAPTER HIGHLIGHTS

Data Cleaning Recap

How to analyse categorical survey data in Excel and in R
Chi-Squared Innovations
0
Would love your thoughts, please leave a comment!x
()
x