Search code examples
excelschematransformationtalend

Talend import, transform a raw file and export to CSV


My data consists of 6 million rows. There is no file extension. Would like to export it to a csv by adding a column of "UPC" which is a combination of 4 fields. input format:

the raw file

REQUIRED OUTPUT FORMAT in CSV

transformed raw file

UPC = SY + GE + VEND + ITEM

Opened it with Excel but because it is more than 6 million rows, it gets truncated at 1 million rows. I have an issue with talend's schema since the there is no delimiter.


Solution

  • First of all create a metadata columns manually (if you import the file, since it is a huge file talend gets hanged). So create the column names in a metadata (verify first snap shot). Once done, drag the metadata as a tFileInputDelimited and make sure you provide delimiter as space (verify second snap shot). In tmap create new port and concatinate the columns as required SY + GE + VEND + ITEM and pass it to a tFileOutputDelimited where you should provide delimeter as , . Let me know if you face any issues. enter image description here

    enter image description here

    Thank you