how to know how many GPUs are used in pytorch?

The bash file I used to launch the training looks like this:

CUDA_VISIBLE_DEVICES=3,4 python -m torch.distributed.launch \
--nproc_per_node=2  train.py \
--batch_size 6 \
--other_args

I found that the batch size of tensors in each GPU is acctually batch_size / num_of_gpu = 6/2 = 3.

When I initialize my network, I need to know the batch size in each GPU. (Ps. in this phase, I can't use input_tensor.shape to get the size of batch-dimension, since there are no data fed in jet.)

Somehow I could not find where does the pytorch store the parameter --nproc_per_node. So how could I know how many GPUs are used, without passing it manually as --other_args?

Solution

I think you are looking for torch.distributed.get_world_size() - this will tell you how many processes were created.

For Loops in Python (Output Smallest Input)
How to parse a function with ply in Python?
Quantum Circuit not drawing on Colab
Prime factorization using list comprehension in Python
How do I place two or more ASCII images side by side?
Unable to get local issuer certificate when using requests
Get mutual settlements from records using SQL
How to convert a file to utf-8 in Python?
SQLAlchemy join & filter
How to access FastAPI backend from a different machine/IP on the same local network?
Python does not see pygraphviz
Default filter expression to "match anything"
Django Scraper Matching Issue: match_maker Only Returns 4 Members Instead of 150
Flask App works with Curl but not with HTTP request
Adding a combination in a datafra, which is missing. Pandas data frame
How to inherit from Python None
How to make a triangle of x's in python?
Using Yaml Anchors across different files using python / ruamel.yaml
Python: Create strikethrough / strikeout / overstrike string type
Boolean operators: Branching using Boolean variables ( python)
Django is taking a long time to load
How to find the most common frequeny in Time series
Adjust Matplotlib Polar Plot to Show Sub Degree Motion (AKA Stretch a polar plot() slice)
pandas: Convert string column to ordered Category?
Problem scraping table row data into an array
What's win32con module in python? Where can I find it?
Why Am I Seeing Multiple python.exe In Different Locations On a Virtual Machine?
Does python3 asyncio use a work stealing scheduler like Rust Tokio?
What is the best way to Install Conda on MacOS (Apple/Mac)?
Configuration of Django+WSGI+Apache