To be clear this is not a requirement to identify and redirect or dedupe rows in the data flow task leveraging the Sort Transformation. Instead, the requirement is to identify the duplicate values in a column then handle affected rows differently. Using the attached mock-up table, ProductID 1002 will quality as duplicates, therefore rows 3 and 4 should flow down a different path. Likewise, ProductCode A01 resulting in rows 2 and 6 being redirected.
Any help is appreciated. Thanks!
I tried splitting the dataset then performing joins but that didn't work as intended.
If this is a large dataset, the sorts will cause performance issues. However, if you must do this in SSIS instead of the database, this is what it could look like:
(*)
as Count All
(DT_I4)[Count all] > 1