Search code examples
sqlsql-serverdistinctinner-joinduplicates

Eliminating duplicate values based on only one column of the table


My query:

SELECT sites.siteName, sites.siteIP, history.date
FROM sites INNER JOIN
     history ON sites.siteName = history.siteName
ORDER BY siteName,date

First part of the output:

enter image description here

How can I remove the duplicates in siteName column? I want to leave only the updated one based on date column.

In the example output above, I need the rows 1, 3, 6, 10


Solution

  • This is where the window function row_number() comes in handy:

    SELECT s.siteName, s.siteIP, h.date
    FROM sites s INNER JOIN
         (select h.*, row_number() over (partition by siteName order by date desc) as seqnum
          from history h
         ) h
        ON s.siteName = h.siteName and seqnum = 1
    ORDER BY s.siteName, h.date