Why does np.percentile return NaN for high percentiles?

This code:

print len(my_series)
print np.percentile(my_series, 98)
print np.percentile(my_series, 99)

gives:

14221  # This is the series length
1644.2  # 98th percentile
nan  # 99th percentile?

Why does 98 work fine but 99 gives nan?

Solution

np.percentile treats nan's as very high numbers. So the high percentiles will be in the range where you will end up with a nan. In your case, between 1 and 2 percent of your data will be nan's (98th percentile will return you a number (which is not actually the 98th percentile of all the valid values) and the 99th will return you a nan).

To calculate the percentile without the nan's, you can use np.nanpercentile()

So:

print(np.nanpercentile(my_series, 98))
print(np.nanpercentile(my_series, 99))

Edit: In new Numpy version, np.percentile will return nan if nan's are present, so making this problem directly apparent. np.nanpercentile still works the same. `

Webscraping Roblox
Remove the mandatory field label 'This field is required.' and fix the bug with 'clean_email'
How to plot a Probability Density Function in Python?
How large is a fresh install of Python?
Appending new elements into an empty list
Simple way to measure cell execution time in ipython notebook
PyAudio working, but spits out error messages each time
Reportlab show page number and page count IF there is more than one page in a document
How to read SharePoint Online (Office365) Excel files into Python specifically pandas with Work or School Account?
How to set a column which suffix name is based on a value in another column
Debugging Python C++ extension from Visual Studio Code on Linux
How can I get all users on Google admin_sdk?
csv.Error: iterator should return strings, not bytes
How to check if an object has an attribute?
How to use selenium with proxy auth in headless mode?
Is there a way to exit a pytest test and continue to the next one?
Returning the lowest index for the first non whitespace character in a string in Python
Formatting exceptions as Python does
Prime factorization using list comprehension in Python
Why does the power spectrum E(k) of my velocity field follow 𝑘 ^(−(n−1)) instead of 𝑘^(−n)?
How to merge dataframes over multiple columns and split rows?
How to create a Sympy IndexedBase using a custom subclass of Symbol?
Removing dynamically an element from a list
Returning boolean if set is empty
Can variables be decorated?
Fast(est) exponentiation of numpy 3D matrix
Removing an element from a list based on a condition
Printing elements of dictionary line by line
Matplotlib does not display the hatch of a patch in a legend
Python win32com - Class not registered error