Search code examples
mysqlgroup-byleft-joingroup-concat

MySQL group by with left join


I am trying to do a very complex query (at least extremely complex for me not for YOU :) )

I have users and comments table.

SQL Fiddle: http://sqlfiddle.com/#!9/b1f845/2

select user_id, status_id from comments where user_id in (2,3);
+---------+-----------+
| user_id | status_id |
+---------+-----------+
|       2 |        10 |
|       2 |        10 |
|       2 |        10 |
|       2 |         7 |
|       2 |         7 |
|       2 |        10 |
|       3 |         9 |
|       2 |         9 |
|       2 |         6 |
+---------+-----------+

If I use

select user_id, status_id from comments where user_id in (2,3)

It returns a lot of duplicate values.

What I want to get if possible.

If you see status_id = 10 has user_id= 2,3 and 4 and 2 multiple times. So from here I want to get maximum of latest user_id (unique) so for example,

it will be user_id = 4 and 2 now the main complex part. I now want to get users information of user_id= 4 and 2 in one column so that at the end I can get something like this

status_id |  userOneUserName | userTwoUserName
 10            sadek4             iamsadek2
 ---------------------------------------------
 7        |    iamsadek2     |      null
 ---------------------------------------------
 9 .      |    iamsadek2     |      sadek2
 ---------------------------------------------
 6        |    iamsadek2     |       null

How can I achieve such a complex things. Currently I have to do it using application logic.

Thank you for your time.


Solution

  • I think this might be what you literally want here:

    SELECT DISTINCT
        status_id,
        (SELECT MAX(user_id) FROM comments c2 WHERE c1.status_id = c2.status_id) user_1,
        (SELECT user_id FROM comments c2 WHERE c1.status_id = c2.status_id
         ORDER BY user_id LIMIT 1 OFFSET 1) user_2
    FROM comments c1
    WHERE user_id IN (2,3);
    

    enter image description here

    Demo

    (your update Fiddle)

    We can use correlated subqueries to find the max user_id and second-to-max user_id for each status_id, and then spin each of those out as two separate columns. Using a GROUP_CONCAT approach might be preferable here, since it would also allow you to easily accommodate any numbers of users as a CSV list.

    Also, if you were using MySQL 8+ or greater, then we could take advantage of the rank analytic functions, which would also be easier.