Hive select a column based on a second column where the second column values are different

Let's say we have a hive table with 4 different columns and I want to select from it for those values in the first column while make sure the values are different in the second column. Any help or guidance to how to do it?

  ---------------------
  | C1 | C2 | C3 | C4 |
  ---------------------
  | a     1    g.   h |
  | a     1    f.   l |   
  | a     3    t.   p |  
  | b     1    r.   o |  
  | b     1    e.   q |
  | c     1    w.   w |
  | c     2    z.   p |
   -------------------

In the above example, I want to hive select return a and c because their values at C2 are different.

Solution

As I understand your question, you want c1s that have more than one distinct value in c2.

You can group by c1, and use a having clause with count(distinct) to implement the filtering:

select c1 
from mytable
group by c1
having count(distinct c2) > 1;

pyspark statistical window function keeps calculating NULL values
sqlDecimal to decimal clr stored procedure Unable to cast object of type 'System.Data.SqlTypes.SqlDecimal' to type 'System.IConvertible'
Sum a record with its previous not-null ordered by timestamp
Prevent redundant results in SQL query
SQL Query in Snowflake - Calculate Days of Supply
How can I find out what FOREIGN KEY constraint references a table in SQL Server?
How to extract column names from SQL query using Python
Select on self-referencing table where all chains meet a condition
SQL query to check if a name begins and ends with a vowel
In a join, fields with null values are not getting shown but in normal query they are
MySQL JSON_OBJECT() datetime format
How to find overlapping records and select the latest record?
SQL query for returning rows with only certain values in list but not others?
Why does my query fail to convert a varchar value to int?
How do I insert a record into a Postgres DB based off the MAX of one column?
How to auto insert Current DATE in SQL with Java / Hibernate
IsDate Function in SQL evaluates invalid dates as valid
SQL Server - Value passes ISDATE() but fails to CAST as DATE or DATETIME
Snowflake Regex to Camel case file name
SQL Server DATEDIFF round up the YEAR difference. How to round it down?
Path table to HTML treeview in SQL Server
SUM and Decimal precision
Choosing simple products and variable products separately in WooCommerce tables
collation conflict between "Hebrew_CI_AS" and "SQL_Latin1_General_CP1_CI_AS"
Separating fields in the same column
SQL filter for same values in different columns
Add multiple conditional values to one column
How do I repeat a SQL query over a set of records?
SET CONSTRAINTS ALL DEFERRED not working as expected
SQL YEAR(GETDATE())