Search code examples
pentahopentaho-spoonpentaho-data-integration

Merge Rows (diff) is comparing row by row, not one row to entire rows of other table


I am comparing two sheets using Merge Rows (diff).

1st excel sheet:

enter image description here

2nd excel sheet:

enter image description here

and my pentaho transaction:

enter image description here

in preview data showing, that id 2.0 at 2nd row is add new row and at 4 row its showing same data is delete, its suppose to identical, so how it can be achieve.


Solution

  • Merge rows (diff) requires both input streams to be sorted by the merge keys (there's a warning about it when you edit the step's properties).

    Put a sort rows step in each stream ahead of the Merge Rows (diff) step.