Setup:
Situation:
Azure VM --> LS_CSV queries, all work well.
Azure VM --> LS_Delta: Queries work well, unless they involve a join on STRING column, then they'll hit a point and just hang (presumably forever. 5 second query still churning 2 hours later). It only takes one such join, to 'ruin' the query. These queries run fine if ran from within that SQL endpoint database - just not over the linked server
More Info:
What I've tried:
Mostly got it (via MS Support)
The External Tables, pointed at the Delta tables, were using varchar() for the string columns. Switching the columns to nvarchar() seems to fix it.
Why this is the case, is not clear though.
It also goes against the documentation [per this link on Best Practices, from MS][1]
Use the varchar type with some UTF8 collation if you're reading data from Parquet, Azure Cosmos DB, Delta Lake, or CSV with UTF-8 encoding.
But [shrug].
[1]: https://learn.microsoft.com/en-us/azure/synapse-analytics/sql/best-practices-serverless-sql-pool