df %>%
tbl_summary(by=Field)
It seems when there are repetitions in some variables, this package gives strange result. It has given me the n(%) of each variable instead of median(IQR) and p values. Is there a way to get around this?
I tried everything possible.
The issue is that by default tbl_summary
uses the categorical
summary type for numerics with less than 10 unique levels. From the docs ?tbl_summary
:
... numeric variables with fewer than 10 unique levels default to type categorical.
To fix that you have to explicitly set the type to be continuous:
To change a numeric variable to continuous that defaulted to categorical, use type = list(varname ~ "continuous")
Using some fake example data:
library(gtsummary)
set.seed(123)
df <- data.frame(
Alphonso = sample(1:6, 100, replace = TRUE),
Field = sample(c("field1", "field2"), 100, replace = TRUE)
)
df %>%
tbl_summary(by=Field)
df %>%
tbl_summary(by=Field, type = list(Alphonso ~ "continuous"))