Pandas: nan->None

pandas.DataFrame.to_dict converts nan to nan and null to None. As explained in Python comparison ignoring nan this is sometimes suboptimal.

Is there a way to convert all nans to None? (either in pandas or later on in Python)

E.g.,

>>> df = pd.DataFrame({"a":[1,None],"b":[None,"foo"]})
>>> df
     a     b
0  1.0  None
1  NaN   foo
>>> df.to_dict()
{'a': {0: 1.0, 1: nan}, 'b': {0: None, 1: 'foo'}}

I want

{'a': {0: 1.0, 1: None}, 'b': {0: None, 1: 'foo'}}

instead.

Solution

import pandas as pd

df = pd.DataFrame({"a":[1,None],"b":[None,"foo"]})
df.where((pd.notnull(df)), None)
Out[850]: 
      a     b
0     1  None
1  None   foo
df.where((pd.notnull(df)), None).to_dict()
Out[851]: {'a': {0: 1.0, 1: None}, 'b': {0: None, 1: 'foo'}}

Implementation of Okapi BM25 in python
Mathematical explanation of Leetcode question: Container With Most Water
AttributeError: _nanosecond when updating a datetime in transaction
How do I align gridlines for two y-axis scales?
''A wait of x seconds is required'' returns by telethon( python library for telegram) how can I get rid of it?
Can't install open3d libraries (Error:Could not find a version that satisfies the requirement open3d)
Writing multiple dataframes to multiple sheets in an Excel file
Is there any equivalent of SAS merging in python?
Generating low discrepancy quasi-random sequences in python/numpy/scipy?
I want to convert the categorical variable to numerical in Python
Python winreg is updating SOMETHING, but not the Windows registry
What does it mean to "downcast" a numeric type in pandas?
How to stream the interactive shell of a remote program to stdout of a running c++ program that launched the remote program (using BSUB -I)
Python Pandas: Counting the amount of subsequent value and assign a name if conditions are met
Behavior of object.__new__ Python dunder. What is happening under the hood?
How to execute raw SQL in Flask-SQLAlchemy app
Python Tkinter structured in a Class: Can methods be in an independent file?
How to extract some rows under specific condition in a dataframe (Python)?
Using scipy.signal.stft() vs scipy.signal.ShortTimeFFT.stft()
Calculate 6 months forward date from a dataframe of dates
Dtypes Data Frame Assigns nvarchar(MAX) by default
Using Postgres/Flask, how might I query for the next occurrence of a scheduled task when days/hours/minutes are stored as ints in their own columns?
Error: _tkinter.TclError: can't invoke "wm" command: application has been destroyed
pyodbc cursor.fetchall() is returning "strange" values
f2py - understanding how to pass an integer (and avoiding message Deprecated NumPy 1.25.)
" An operation was attempted on something that is not a socket" when I try to add SSL authentication
Web Api to extract information from website
Elegant way to remove fields from nested dictionaries
Summing up arrays without doubles
Pyspark - how to initialize common DataFrameReader options separately?