python python-3.x dataframe count python-polars

Compute the ratio between the number of rows where A=True, to the number of rows where A=False

I have a Polars dataframe:

df = pl.DataFrame(
    {
        "nrs": [1, 2, 3, None, 5],
        "names": ["foo", "ham", "spam", "egg", None],
        "random": np.random.rand(5),
        "A": [True, True, False, False, False],
    }
)

How can I compute the ratio between the number of rows where A==True, to the number of rows where A==False? Note that A is always True or False. I found a solution, but it seems a bit clunky:

ntrue = df.filter(pl.col('A')==1).shape[0]
ratio = ntrue/(df.shape[0]-ntrue)

Solution

You can leverage polars' expression API as follow.

df.select(pl.col("A").sum() / pl.col("A").not_().sum()).item()

The summing works as A is a boolean column. If this is not the case, you can exchange pl.col("A") for another corresponding boolean expression.

How to get the shape of a xarray dataset by using dims labels
Generating new SQLite database django
Remove background text and noise from an image using image processing with OpenCV
ImportError : No module named graphics
Python TypeError: 'function' object is not subscriptable
python: when can I unpack a generator?
Creating an index in PyMilvus 2.5.x does not actually index any rows
merging xml files using python's ElementTree
Disable python import sorting in VSCode
TemplateDoesNotExist at /users/register/ bootstrap5/uni_form.html
OpenCV Apriltag detection only detects a few markers
How to convert 2D networkx graph to interactive 3D in python?
Custom Service Account with KFP pipelines in Vertex AI
Can I automate discord actions with python?
Anti-Join Pandas
Batch matrix multiplication in numpy
How to align two plots in Matplotlib
Aligning frames in tkinter python, (customtkinter)
Tkinter Listbox How to tell if an item is selected
python filename.py in command line does not work
Text representation of a list with gaps
How to Unit Test a Python Class Which Needs to Make an API Call to an External Service?
convert multi-index column to single column in dataframe
How to find duplicates in a string
Cannot convert base64 string into image
How can I select the proper openai.api_version?
How to extract text associated with image from pdf?
How to import python file from git submodule
Get last row that satisfies a condition using pandas groupby
Python: sharing common code among a family of scripts