I want to extract some data based on date in an imported excel file in python. I want to be able to give the start date and end date and get the data for that specific period. I tried different ways to install pandas_datareader to use the following code, but I could not.
data = web.DataReader(dataset,start='', end='')
So, here is my code.
import pandas as pd
import datetime
data = pd.read_excel('file.xlsx')
start = datetime.datetime(2009,1,1)
end = datetime.datetime(2018,1,1)
#reshape based on date
set_index = data.set_index('date')
How can I fix this problem? :(
import pandas as pd
import datetime
"""
ticker,date,closeunadj
ZF,2018-11-28,9.22
ZF,2018-11-27,9.16
ZF,2018-11-26,9.23
"""
df = pd.read_clipboard(sep=",", parse_dates=["date"]).set_index("date")
df
Output:
ticker closeunadj
date
2018-11-28 ZF 9.22
2018-11-27 ZF 9.16
2018-11-26 ZF 9.23
You can then use .loc
to subset the index to the date range you like.
df.loc[(df.index < datetime.datetime(2018, 11, 28)) & (df.index > datetime.datetime(2018, 11, 26))]
Output:
ticker closeunadj
date
2018-11-27 ZF 9.16