How do I find rows that don't exist in a table using hive sql query?

My data is like this:

ID	Stage_1	Stage 2
1	A	F
1	B	G
1	C	H
2	A	F
2	B	G
2	C	H
3	A	F
3	B	G
4	A	F
4	B	G

I want to find the number of unique ID's for which Stage_1 = A exists but Stage_2 = H does not exist. Here, for ID = 3 and ID = 4, A exists in Stage_1 but in Stage_2 there is no H for ID = 3 or ID = 4.

So the expected result here would be 2.

Outer join won't apply as I'm getting data from only one table in our database using hive sql terminal.

Solution

You could try to left join the table to itself and count the nulls

SELECT COUNT(DISTINCT t1.ID)
FROM your_table t1
LEFT JOIN your_table t2
  ON t2.Stage_1 = t1.Stage_1
    AND t2.Stage_2 = 'H'
WHERE t1.Stage_2 = 'A'
AND t2.ID IS NULL

pyspark statistical window function keeps calculating NULL values
sqlDecimal to decimal clr stored procedure Unable to cast object of type 'System.Data.SqlTypes.SqlDecimal' to type 'System.IConvertible'
Sum a record with its previous not-null ordered by timestamp
Prevent redundant results in SQL query
SQL Query in Snowflake - Calculate Days of Supply
How can I find out what FOREIGN KEY constraint references a table in SQL Server?
How to extract column names from SQL query using Python
Select on self-referencing table where all chains meet a condition
SQL query to check if a name begins and ends with a vowel
In a join, fields with null values are not getting shown but in normal query they are
MySQL JSON_OBJECT() datetime format
How to find overlapping records and select the latest record?
SQL query for returning rows with only certain values in list but not others?
Why does my query fail to convert a varchar value to int?
How do I insert a record into a Postgres DB based off the MAX of one column?
How to auto insert Current DATE in SQL with Java / Hibernate
IsDate Function in SQL evaluates invalid dates as valid
SQL Server - Value passes ISDATE() but fails to CAST as DATE or DATETIME
Snowflake Regex to Camel case file name
SQL Server DATEDIFF round up the YEAR difference. How to round it down?
Path table to HTML treeview in SQL Server
SUM and Decimal precision
Choosing simple products and variable products separately in WooCommerce tables
collation conflict between "Hebrew_CI_AS" and "SQL_Latin1_General_CP1_CI_AS"
Separating fields in the same column
SQL filter for same values in different columns
Add multiple conditional values to one column
How do I repeat a SQL query over a set of records?
SET CONSTRAINTS ALL DEFERRED not working as expected
SQL YEAR(GETDATE())