How do you find duplicates in a table?

August 19, 2025

Quality Thought – Best Data Science Training Institute in Hyderabad with Live Internship Program

If you're aspiring to become a skilled Data Scientist and build a successful career in the field of analytics and AI, look no further than Quality Thought – the best Data Science training institute in Hyderabad offering a career-focused curriculum along with a live internship program.

At Quality Thought, our Data Science course is designed by industry experts and covers the entire data lifecycle. The training includes:

Python Programming for Data Science

Statistics & Probability

Data Wrangling & Data Visualization

Machine Learning Algorithms

Deep Learning with TensorFlow and Keras

NLP, AI, and Big Data Tools

SQL, Excel, Power BI & Tableau

What makes us truly stand out is our Live Internship Program, where students apply their skills on real-time datasets and industry projects. This hands-on experience allows learners to build a strong project portfolio, understand real-world challenges, and become job-ready.

Why Choose Quality Thought?

✅ Industry-expert trainers with real-time experience

✅ Hands-on training with real-world datasets

✅ Internship with live projects & mentorship

✅ Resume preparation, mock interviews & placement assistance

✅ 100% placement support with top MNCs and startups

Whether you're a fresher, graduate, working professional, or career switcher, Quality Thought provides the perfect platform to master Data Science and enter the world of AI and analytics.

📍 Located in Hyderabad | 📞 Call now to book your free demo session and take the first step toward a data-driven future!.

Finding duplicates in a table is a common SQL task, usually done by identifying rows with the same values in specific columns. The approach typically involves GROUP BY with HAVING, or using window functions.

🔑 1. Using `GROUP BY` and `HAVING`

This method groups rows by the column(s) you want to check and filters groups having more than one record.

Example (find duplicate emails in users table):


SELECT email, COUNT(*) AS count
FROM users
GROUP BY email
HAVING COUNT(*) > 1;

✅ Returns only emails appearing more than once.

🔑 2. Using `ROW_NUMBER()` (Window Function)

You can assign row numbers within partitions of data and find duplicates easily.


SELECT id, name, email
FROM (
  SELECT id, name, email,
         ROW_NUMBER() OVER (PARTITION BY email ORDER BY id) AS row_num
  FROM users
) t
WHERE row_num > 1;

✅ Returns duplicate rows beyond the first occurrence.

🔑 3. Using `DISTINCT` and `EXISTS`


SELECT u1.*
FROM users u1
WHERE EXISTS (
  SELECT 1
  FROM users u2
  WHERE u1.email = u2.email
  AND u1.id <> u2.id
);

✅ Returns all rows that have at least one duplicate.

⚡ Key Notes

Use GROUP BY + HAVING when you only need duplicate values.
Use ROW_NUMBER() / RANK() when you want full details of duplicate rows.
Use EXISTS / SELF JOIN for flexibility in complex queries.

👉 Summary:
To find duplicates, group records with GROUP BY + HAVING COUNT > 1, or use window functions for detailed duplicate rows. The best method depends on whether you need just the duplicate values or all duplicate rows.

What is a JOIN? Explain types.

What is the difference between INNER JOIN and LEFT JOIN?

Visit Quality Thought Training Institute in Hyderabad

Search This Blog

Data science