sql apache-spark-sql databricks azure-databricks

Delete Duplicate using SPARK SQL

The following code works fine in the Databricks Spark SQL

with CTE1 as
(
 select *,
        row_number()over(Partition by ID order by Name) as r
 from Emp
)
select * from CTE1 where r>1

But for the DELETE statement:

with CTE1 as
(
 select *,
        row_number()over(Partition by ID order by Name) as r
 from Emp
)

DELETE from CTE1 where r>1

there is an Error in SQL statement :

Analysis exception: Table Not found Emp

Solution

The syntax you want is only available in SQL Server. Assuming that Name is unique and not NULL, you can use an alternative method such as this:

delete from emp
    where name > (select min(emp2.name)
                  from emp emp2
                  where emp2.id = emp.id
                 );

Otherwise, use the table's primary key for the comparison.

How to properly delete data from a table and chain the deletion
Find procedure name in all other procedures
SQL-How to Insert Row Without Auto incrementing a ID Column?
Why can't I use an alias in a DELETE statement?
How do I perform a GROUP BY on an aliased column in SQL Server?
Running multiple SQL-Statements with Python
Using rangeBetween considering months rather than days in PySpark
How to find databases which accessible to me in Sql server?
data return incorrect using left join with 2 tables
Repeat rows specified number of times in PostgreSQL
How to visualize database tables in postgresql using pgAdmin?
Odata override behaviour of orderby
Unable to open BCP host data-file
How to join parent object with nested array using OpenJson SQL
batch update a column using sql
Queries using LIKE wildcards in sql server
Conditional aggregation in SQL: display condition-dependent count in percentage?
Parsing JSON with Oracle SQL without knowing incoming field names
In a standard MS Access SQL query output that does not have any aliases, how do I replace the full names by their "first-letters" aliases?
Convert SQL containing GROUP BY and SUM() to active record in CodeIgniter
output sql data to html with hyperlink
Update all column names to be lowercase
SQL Server pivoting on data without an aggregate
Flagging a row in new column based on conditions on previous and current rows
replace or remove characters and then cast to decimal
Export specific rows from a PostgreSQL table as INSERT SQL script
Query to get only numbers from a string
SQL grouping result to half hour
table is specified twice both as a target for INSERT and as separate source of data
SQL to JSON Need Bracket for Array that can have multiple rows