round results in aggregate table results (pyspark)

Hello how would I round this content of table outputted by this code.

from pyspark.sql.functions import *
exprs = {x: "sum" for x in data2.columns[:4]}
data2.groupBy("Species").agg(exprs).show()

I've tried

round(data2.groupBy("Species").agg(exprs),2).show() #not ok

data2.groupBy("Species").agg(exprs).show().round(2) # not ok

Solution

round only works on one column. So you have to call it for each column, e.g.

agg_cols = data2.columns[:4]
exprs = [sum(col(x)).alias(x) for x in agg_cols]
aggregated_df = data2.groupBy("Species").agg(*exprs)
aggregated_df.select(col("Species"), *[round(c, 2) for c in agg_cols]).show()

Spark sending LIMIT to SQL Server on display function
Start of the week on Monday in Spark
Spark dataframe not adding columns with null values
Spark-delta not working when upgrade to spark 3.5.0 and delta 3.1.0
how to mask hash users into random values [email protected] in azure databricks
Passing dataframe column as an argument to a function inpyspark
Casting RDD to a different type (from float64 to double)
Removing NULL items from PySpark arrays
Spark Row object instantiated differently from overloaded prototypes?
How to copy and convert parquet files to csv
Unable to write Data from Kafka to Delta Live Table in Databricks
Spark/pyspark on same version but "py4j.Py4JException: Constructor org.apache.spark.api.python.PythonFunction does not exist"
How to resolve the following AWS Glue error while writing to Redshift using Spark: "ORA-01722: invalid number"?
Convert PySpark column from strings to lists
Spark getItem shortcut
Pyspark error when converting boolean column to pandas
Json & PySpark - read value from a struct that may be null
Not able to access zip/exe from ADLSv2 into synapse
Spark: fill spec value between flag values
Check whether boolean column contains only True values
Task stuck at "GET RESULT" from Join -> groupby in Spark (sedona)
Pyspark - how to initialize common DataFrameReader options separately?
mypy type checking shows error when a variable gets dynamically allocated
Open, High, Low, Close, Volume in PySpark using tick data
Dataframe.write() produces csv file on single node jobs cluster, but not on 2+1 nodes cluster
Making a series montonically decreasing in pyspark
Internals of worker/executor usage during coalesce/repartition
Why is my PySpark row_number column messed up when applying a schema?
Split a datafarme column based on another column - Column is not iterable
Issue with Multiple Spark Structured Streaming Jobs Consuming Same Kafka Topic