UnicodeDecodeError trying to merge all these csv files

Hi I am having trouble merging .csv files using Jupyter Notebook. The reason is that the csv files contain hex values, so when I load these csv files normally I need to use "encoding = 'latin'" I do not know how to do this when I am using the glob module. Here is the code that I am using. ByDistrict79 is the name of all the csv files and they will load, the issue is with pd.concat where I am getting the error code, thank you. Also the structure of the files are all the same.

My Code

import pandas as pd
from glob import glob

stock_files = sorted(glob('ByDistrict79_*.csv'))

pd.concat((pd.read_csv(file).assign(filename = file)
          for file in stock_files),ignore_index =True)

Error Code

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa0 in position 14: invalid start byte

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa0 in position 14: invalid start byte


  • The pandas read_csv() function accepts an optional encoding parameter. The revised function call would be: pd.read_csv(file, encoding='latin-1')

    Also, the UnicodeDecodError suggests there may be a byte-order mark (BOM) in the file.