How to Find & Remove Duplicate Rows in Excel - Excel Untold

Latest

Phantom

BANNER 728X90

How to Find & Remove Duplicate Rows in Excel

 


One very common use of Excel is: that it is used as a tool to clean up data. Cleaning up data means:

  • Getting raw data into a worksheet
  • Manipulating data so it conforms to various requirements

In summary, data is cleaned up so it can be properly analyzed.

Data is cleaned up from so many different angles. We shall discuss those techniques one by one. In this post, we shall confine our analysis only to removing duplicate rows.

Remove Duplicate Rows in Excel

Your data may have duplicate rows, whatever the reasons. Most of the time, you need to eliminate the duplicate rows. In the old days, removing duplicate data was a manual task- although removal works could be done with advanced techniques. But with the Remove Duplicates command, removing duplication is now an easy job. The remove Duplicates command was introduced in Excel 2007.

Now, see the following image. You will find that there are some duplicate rows in the column. We shall remove the duplicate ones.


Removing duplicate rows. Observe the column, there are some duplicate rows.

Select any cell of column A in your sample file. Then choose Data ➪ Data Tools ➪ Remove Duplicates. The Remove Duplicates dialog box will appear, see the following image.

Remove Duplicates dialog box. Column A is automatically selected.

Note: If your data is in a table, you can also use Table Tools ➪ Design ➪ Tools ➪ Remove Duplicates commands to remove duplicate rows. These two commands work exactly in the same way.

The Remove Duplicates dialog box lists all the columns in your data range or table. Place check marks on the columns that you want to add in the duplicate search. Generally, you will want to add all the columns and it is the default. Click OK, and Excel clean-out the duplicate rows, and a message will be showed to let you know how many duplicate rows have been removed.

Click OK. It tells you the number of duplicate rows has been removed.

Excel does not give you chance to change your mind. But if you want to undo your actions you can do it by clicking the Undo Button in the Quick Access toolbar or just pressing CTRL + Z.

When duplicate rows are found, the first row is kept and subsequent duplicate rows are deleted.

Note: Excel treats values as duplicates on the basis of what cells are displaying, not on the basis of what cells are containing. For example, 5/12/2013 and 5 Dec 2013- these two have the same value for Excel. In the following image, you will see that they generate the same value in the formula box. You will get the second date format (5 Dec 2013) if you select “Long Date” from the number format option. In the same way, $500.50 is different from 500.50, though the first one is just the formatted product of 500.50 value. If you want to count both 5/12/2013 and 5 Dec 2013 as duplicate values, then format the whole column at first, then find out the duplicate rows.

Long date format used. The cell contains 5/12/2013 but the cell display 5 Dec 2013.


Currency format is used. The cell contains 500.5 but the cell displays £500.50 due to using the Currency Format

Read More: How to Remove Duplicate Rows Based on One Column in Excel


Download Working File

Download the working file from the link below:

Duplicate-rows1.xlsx

No comments:

Post a Comment