numpy scikit-learn nan idioms mean-square-error

Clean np array of NaN while deleting entries in other array accordingly

I have two numpy arrays, one of which contains about 1% NaNs.

a = np.array([-2,5,nan,6])
b = np.array([2,3,1,0])

I'd like to compute the mean squared error of a and b using sklearn's mean_squared_error.

So my question is, what's the pythonic way of removing all NaNs from a while at the same time deleting all corresponding entries from b as efficiently as possible?

Solution

You can simply use vanilla NumPy's np.nanmean for this purpose:

In [136]: np.nanmean((a-b)**2)
Out[136]: 18.666666666666668

If this didn't exist, or you really wanted to use the sklearn method, you could create a mask to index the NaNs:

In [148]: mask = ~np.isnan(a)

In [149]: mean_squared_error(a[mask], b[mask])
Out[149]: 18.666666666666668

I want to install the "n" package and I get an error
n <version> command does not activate specified version
Change n install location
How to install a specific version of Node on Ubuntu/Debian?
Different node version for different projects, is there a way of telling node which version to use?
Install Node.js to install n to install Node.js?
How to select the latest node.js v6 version using n?
n-install: ERROR: GNU Make not found, which is required for operation
How to downgrade Node version with n
how switch to previous version in n (Node version manager)?
Automatically use the right version of Node for a package
internal/modules/cjs/loader.js:905 -> throw err;
Why doesn't "n" downgrade my node version on a Mac?
Node version manager
n failed to install/switch node in Linux?
vue command not found on Mac
How to uninstall n and all node versions installed by n
Angular CLI on HTTPS - can't install CI as root
n (node version manager): cannot create directory
npm module n emits errors
How to update npm permanently?
Cannot change nodejs version using n
upgrade nodejs to stable version
How should I install and use multiple versions of Node on the same production machine?