Dataframe df_implied_full
has several columns, one of them is called 'USDZARV1Y Curncy'
, and it has only floats
.
This code works:
mad = lambda x: np.median(np.fabs(x - np.median(x)))
df_implied_full['madtest'] = df_implied_full['USDZARV1Y Curncy'].rolling(window=60).apply(mad)
This code doesn't work:
test = lambda x: (x - np.median(x))
df_implied_full['rolltest2'] = df_implied_full['USDZARV1Y Curncy'].rolling(window=60).apply(test)
The error shown is:
File "pandas\algos.pyx", line 1831, in pandas.algos.roll_generic (pandas\algos.c:51581)
TypeError: only length-1 arrays can be converted to Python scalars
I'm using Pandas 0.18.1 and python 2.7.12
What is wrong with my code?
There is problem output of x
in lambda x: (x ...
is numpy array
, so if use only test = lambda x: x
numpy array cannot be converted to scalar values per each row. I think you need return scalar value only e.g. use x[0]
or np.median(x)
. The best is use custom function and test it.
Sample with window=2
:
import pandas as pd
import numpy as np
df_implied_full = pd.DataFrame({'USDZARV1Y Curncy': [1.2,4.6,7.3,4.9,1.5]})
print (df_implied_full)
def test (x):
print (x)
#[ 1.2 4.6]
#[ 4.6 7.3]
#[ 7.3 4.9]
#[ 4.9 1.5]
print (type(x))
#<class 'numpy.ndarray'>
#<class 'numpy.ndarray'>
#<class 'numpy.ndarray'>
#<class 'numpy.ndarray'>
#Return only first value of list
return x[0]
mad = lambda x: np.median(np.fabs(x - np.median(x)))
df_implied_full['madtest'] = df_implied_full['USDZARV1Y Curncy'].rolling(window=2).apply(test)
print (df_implied_full)
USDZARV1Y Curncy madtest
0 1.2 NaN
1 4.6 1.2
2 7.3 4.6
3 4.9 7.3
4 1.5 4.9
def test (x):
def test (x):
print (x)
#[ 1.2 4.6]
#[ 4.6 7.3]
#[ 7.3 4.9]
#[ 4.9 1.5]
#Return median as scalar
return np.median(x)
mad = lambda x: np.median(np.fabs(x - np.median(x)))
df_implied_full['madtest'] = df_implied_full['USDZARV1Y Curncy'].rolling(window=2).apply(test)
print (df_implied_full)
USDZARV1Y Curncy madtest
0 1.2 NaN
1 4.6 2.90
2 7.3 5.95
3 4.9 6.10
4 1.5 3.20