I have a csv file in GCS with fields with hundreds of columns enclosed in quotes, like below :
"John","Doe","5/15/2021 7:18:26 PM"
I need to load this to BigQuery using Data fusion, created a pipeline. My question is
Please guide me, any good reading on these kind of operations will also be helpful
For testing purposes I used your sample data and add a few more entries.
If your data looks like this and your objective is to just remove the quotes from your data, what you can do is:
Your output will look like this:
You can then convert your CSV to columns:
This will tell your wrangler to read it as a CSV and split the comma to columns. But the original data will remain at column body.
To delete body:
Your data should now look like this: