Data cleaning is a serious business – you’ll typically spend up to 80% of your time cleaning data!
In this course you’re going to learn how to clean your data - in Excel, Python and R - in hours rather than days or weeks!
I’ll teach you a method that took me several years to perfect, and as a result you’ll become much more productive, get your results faster and make your boss happy in the process.
The steps you’ll learn in this course are very simple to follow, but are extremely effective, so you’ll know that you’re getting the best start possible, saving you weeks of misery!
Certificate of completion
Perfect for beginners
- Python & R
If you’ve ever spent days or weeks cleaning your data and wishing there was a better, quicker way of doing it, then congratulations! You are here at the right place because we are going to introduce you to one of the most important aspects of any project that involves data: data cleaning, and teach you how to do it better!
Cleaning data isn’t difficult, but if you don’t observe a few basic rules then you’re storing up trouble for yourself when it comes to analysing it. If you’ve been there, then you know what I’m talking about.
In this course, we’re going to give you the best start possible, introducing you to some of the best tricks you’ll ever learn, saving you weeks of misery!
Here's what you'll learn:
you’ll learn the features of what makes a good dataset and – more importantly – a bad one!
you’ll learn how to remove all unwanted spaces from your entire dataset in one second!
you’ll learn how to standardise the case of all text entries in your dataset in one epic ninja move
you’ll learn all the essential steps in cleaning numerical and text data
you’ll learn the precise order in which you should apply all these techniques
Suitable for beginners to cleaning data in Excel (and those that have a basic grounding in Python and/or R), in this course you’ll learn to ‘play’ with your data, learning what you can and – more importantly – can’t do with it.
Each lesson starts with a quick question to get your mental juices flowing, then you’ll dive straight into the data. There will be LOTS of playing and practising with data in Excel and your choice of Python or R, and you’ll learn by doing.
Then you’ll finish each lesson with a quiz – by now you should have learned EXACTLY what the correct responses are, and answer with confidence!
In this course you will get:
- 1Over 3 hours of video content
- 2Over 2 hours of practice exercises in each of Excel, Python and R
- 3Learn how to set up an Excel workbook for maximum effectiveness
- 4Learn how to remove all unwanted spaces from your entire dataset – in one step!
- 5Learn how to clean text data
- 6Learn how to clean numerical data
- 7Learn the strategy of which steps to perform – in the correct order
- 8Excel is used as a learning tool, and you will learn to transfer these skills to Python and/or R
- 9Data files are provided for the student to practice with
- 10Practical learning experience with real data
Is This a Course or Project?
This course is a mixture of a traditional course and a project, a 'croject', if you will...
Some lessons will be in the form of videos, and some text-based, and there will be a lot of practice exercises to build your experience.
All the lessons are done in Excel, where you will learn the concepts, as it is a non-threatening, familiar medium that everyone starts with when handling data.
There will be lots of practice exercises in Excel.
I won't formally teach the concepts in Python or R, but I will point you in the right direction with strong code hints.
I could just hand you all the code to use, but that wouldn't aid your learning path and would only hinder your progress in the long run.
The best way for you to make the transition from Excel to Python/R is for me to:
- 1teach you the concepts (which are all simple)
- 2point you in the right direction (code hints)
- 3encourage you to create your own code (with access to a forum where you can learn from others)
So for every exercise you will be encouraged to write your own code to perform precisely the same data cleaning tasks in Python/R as you do in Excel.
Don't worry - there are already pre-built functions in Python and R that we can use. It's all rather straight-forward!
Students completing the course will have the knowledge and confidence to be able to clean their data in Excel quickly and accurately.
Complete with HD videos, data, examples and practice exercises, you’ll be able to work alongside the instructor as you work through each concept, and will receive a certificate of completion upon finishing the course.
Oh, yes – and there are lots of little surprises for you along the way!
This course is Part 2 of our 14 Day Data Cleaning Challenge, where I’m going to teach you a method of data cleaning that I developed some years ago. It’s been tried and tested by hundreds of people, and it’s been tweaked along the way, improving it so that it’s always up-to-date and always works!
It will help you get your data clean and analysis-ready in minutes rather than weeks.
Yes, you read that correctly – minutes!
The 14D2C2 is a 14 day course that will take you through 1 hour of video lessons and practice sessions every day. In other words, with a 1 hour investment daily for 14 days you will become an expert in data collection, cleaning and preparation!
If you’re ready to take the 14D2C2, email me and I’ll send you the details to get started.
Introduction to Data Cleaning
An introduction to this course and to the 14 Day Data Cleaning Challenge!
Introduction to Data Cleaning
Removing Unwanted Spaces
In this chapter you'll how to remove all unwanted spaces from your data in one amazing move!
Anatomy of a Good Workbook
Remove Trailing & Leading Spaces Part 1
Remove Trailing & Leading Spaces Part 2
Standardising the Case of Text Entries
Cleaning Text Data
The focus in this chapter is on how to clean your text data in Excel accurately and efficiently
Cleaning Text Data Using Remove Duplicates and Find & Replace
Cleaning Text Data Using Remove Duplicates and VLOOKUP
Cleaning Text Data Using Filter and Find & Replace
Cleaning Numerical Data
The focus in this chapter is on how to clean your numerical data in Excel accurately and efficiently
Identifying Text in Your Numerical Data
Order, Order, We Must Have Order...
In this chapter you'll learn the precise order in which to apply all the techniques you've learned
Order, Order, We Must Have Order…
Data Cleaning Recap
In this chapter you'll recap everything you've learnt and practice it all again!
Data Cleaning Recap