Search code examples
sqldatabasepostgresqlpentahopentaho-data-integration

Pentaho Input Text Set Field Null to Data Before


It is hard to explain this in word, so sorry if the title is mismatch.

So I have a file text like this:

Header A     Header B     Header C    Header D
Apple        Red          A           100
                          B           200
                          B           300
             Green        A           400
                          B           500
Eggplant     Purple       A           600
                          B           700
             White        a           800

And I want it to be read like this:

Header A     Header B     Header C    Header D
Apple        Red          A           100
Apple        Red          B           200
Apple        Red          B           300
Apple        Green        A           400
Apple        Green        B           500
Eggplant     Purple       A           600
Eggplant     Purple       B           700
Eggplant     White        a           800

But I confuse what transformation that I have to use. So how to get this result in Pentaho? Or is there video tutorial I have to watch? Thanks Before.

I have tried transformation "If Value is null" and "Filter Rows", but it doesn't seems the solution.


Solution

  • You can do it easily in Pentaho Data Integration with the text file input step. On the definition of fields just set the "Repeat" property to "Y".

    If the field is null it will repeat the last non-null value and you get the results you're after.