Search code examples
google-cloud-platformgoogle-cloud-data-fusion

Reading Excel with edit on read in DataFusion


I am reading an excel file with google DataFusion Wrangler plugin. In the excel the first row needs to be discarded, as headers and data start from second row.
Problem is when Wrangler reads and parse-as-excel a file, it gives default option of choosing the first row as header.
Need some help to isolate such that first row is skipped and header is 2nd row with the data following.
Thanks for the help!


Solution

  • This behavior is currently not supported by the Wrangler plugin. As you are already aware, Wrangler will only take a look at the first column to decode headers.

    In this case, pre-processing the file to remove the first row is the easiest solution.