Why does GROUP BY give me too high a row count in BigQuery

I believe I am missing something (probably quite simple) in the use of GROUP BY in BigQuery, and I am hoping someone can set me straight.

Comparing these two queries I get different numbers of users

SELECT SUM(users) FROM (
    SELECT
        DATE,
        COUNT(DISTINCT user_id) AS users,
      FROM
        `mytable`
      WHERE
        DATE BETWEEN ('2022-05-01') AND ('2022-05-31')        
      GROUP BY
        DATE
)

value for users approx: 140000

SELECT
   COUNT(DISTINCT user_id) AS users,
    FROM
      `mytable`
    WHERE
      DATE BETWEEN ('2022-05-01') AND ('2022-05-31')

value for users approx: 120000

Solution

In the second query you're counting the distinct user_id values in the entire date range. In the first query you're counting the distinct user_id values for each day in the range, then summing those. There are probably duplicate users being counted on different days in the first query.

Why 5.0 / 2 returns 2.5000000000000000 (scale of 16) instead of 2.5 (scale of 1)?
SQL - How to SUM and use WHERE clause on this SUM using 4 tables?
The best way of storing many to many hierarchical data in sql
How to group and number SQL results?
How to prevent PostgreSQL from altering my nicely formatted SQL definitions
How to perform division in SQL Server?
Table Primary Key and Foreign Key Relationship Design for Invoices with Summary and Detailed Sections
How to put the select result (column) in an array in PL/SQL
SQL Query to use a Case Statement within and Aggregate Function
How to get date difference from a column?
Prioritization of data in SQL
Operand type clash: date is incompatible with smallint error in sql server
Are GUIDs timely ordered ? If ORDER BY used with a GUID variable type, will records created lately come late?
Find SQL Server Job Related to Existing SSIS Package in Visual Studio
Normalize Google Sheet table (1st step)
How to get column between two values as per user input
Problems selecting multiple column
SQL convert nvarchar to float
MySQL Workbench edit data
Combine overlapping date ranges
How to use values from CTE in ON CONFLICT UPDATE
Expand the date range between two dates
pyspark statistical window function keeps calculating NULL values
sqlDecimal to decimal clr stored procedure Unable to cast object of type 'System.Data.SqlTypes.SqlDecimal' to type 'System.IConvertible'
Sum a record with its previous not-null ordered by timestamp
Prevent redundant results in SQL query
SQL Query in Snowflake - Calculate Days of Supply
How can I find out what FOREIGN KEY constraint references a table in SQL Server?
How to extract column names from SQL query using Python
Select on self-referencing table where all chains meet a condition