Search code examples
javascriptregexpentahopentaho-spoonpentaho-data-integration

Pentaho Text Input multiple languages English and Japanese Characters Loaded into MSSQL data table separate columns


I am trying to load a SQL table with a text file. I am using the Text File Input Step. All but one column may contain Japanese characters(Comment). How can I pass all English text and Japanese text as readable characters? Is there a regex or Java Script code that will pass all of these characters or some other step that Pentaho offers? I prefer not to alter the server if possible.

Sample Data

I am at a loss.

Thanks in advance.


Solution

  • I found that it is a simple as setting the encoding to Shift_JS

    Pentaho soln.