My data consists of 6 million rows. There is no file extension. Would like to export it to a csv by adding a column of "UPC" which is a combination of 4 fields. input format:
REQUIRED OUTPUT FORMAT in CSV
UPC = SY + GE + VEND + ITEM
Opened it with Excel but because it is more than 6 million rows, it gets truncated at 1 million rows. I have an issue with talend's schema since the there is no delimiter.
First of all create a metadata columns manually (if you import the file, since it is a huge file Talend will hang). So create the column names in a metadata (verify first snap shot).
Once done, drag the metadata as a tFileInputDelimited and make sure you provide delimiter as space (verify second snap shot). In tmap create new port and concatenate the columns as required SY + GE + VEND + ITEM and pass it to a tFileOutputDelimited where you should provide delimiter as ,
.