Lesson 4.5 – Removing Duplicates
Duplicate values can cause errors, incorrect calculations, and misleading analysis. Excel provides a simple and reliable tool to remove duplicates from your dataset in just a few clicks. In this lesson, you will learn how to identify and remove duplicate rows safely.
SEO Description
Learn how to remove duplicate values in Excel using the built‑in Remove Duplicates tool to clean data quickly and accurately.
Publication date: 19 March 2025
1. What Are Duplicates?
A duplicate occurs when one or more rows contain the same information. Duplicates often appear when data is imported, copied from other files, or collected from multiple sources.
Examples of duplicates:
- Two identical customer names
- Repeated product codes
- Duplicate email addresses
- Rows with the same values across all columns
2. How to Remove Duplicates
Steps:
- Select your dataset (or click inside an Excel Table).
- Go to Data → Remove Duplicates.
- Choose the columns you want Excel to check.
- Click OK.
Excel will remove duplicate rows and keep only the first occurrence.
3. Choosing Which Columns to Check
You can decide whether Excel should check:
- A single column (e.g., remove duplicate email addresses)
- Multiple columns (e.g., same Name + Date)
- All columns (entire row must match)
Example:
If you select only the “Email” column, Excel removes rows with repeated emails even if other values differ.
4. Removing Duplicates in an Excel Table
If your data is formatted as a Table, the process is even easier:
- Click anywhere inside the Table.
- Go to Table Design → Remove Duplicates.
- Select the columns to check.
Excel automatically updates the Table after removing duplicates.
5. Important Notes
- Removing duplicates is permanent — deleted rows cannot be restored unless you undo.
- Always check the correct columns before confirming.
- Consider making a backup copy of your data.
- Use filters first if you want to preview duplicates before removing them.
6. Alternative: Highlight Duplicates First
If you want to see duplicates before removing them, use Conditional Formatting:
Home → Conditional Formatting → Highlight Cell Rules → Duplicate Values
This highlights duplicates without deleting anything.
7. Practical Exercise
- Create a worksheet named Lesson_4_5_Practice.
- Enter a list of at least 20 items with intentional duplicates.
- Use Conditional Formatting to highlight duplicates.
- Remove duplicates using Data → Remove Duplicates.
- Repeat the process by selecting different columns.
- Try removing duplicates inside an Excel Table.
Comments
Post a Comment