The Hive - Learn, Help, Connect

Data Cleaning Bootcamp - in Excel, Python and R

Chapter 2 - Removing Unwanted Spaces

Lesson 13 - Practice Session - Excel, Python & R

Practicing in Excel

Here's where you get chance to put into practice what you've just learnt.

Hot Tip

Data cleaning isn't an academic exercise - you have to do it and practice it to gain experience.


Don't skip these exercises!


This is your chance to really take your data cleaning skills to the next level!

Before you get started, download the following Excel files to your computer:

Exercise 1: Remove Unwanted Spaces

All the Excel code you need is contained within the Excel Practice File file.

Practice removing unwanted spaces fro large datasets using the steps outlined in the video lecture, in particular, note the advantages and disadvantages of using it.

Under what circumstances would you use it, and when would you use some other technique?

This is a critically important technique – practice it and master it!

Coding in Python and/or R

In the last lesson you learnt how to use the Excel formulae TRIM, CLEAN and SUBSTITUTE.

Investigate what alternatives exist in your choice of Python and/or R.

Exercise 2: Single Solution

Can you effectively combine your alternatives to TRIM(), CLEAN() and SUBSTITUTE() to create a viable alternative in Python and/or R?

Go ahead and do that and submit your solution it to the relevant forum page:

Remember Me
Success message!
Warning message!
Error message!