Search code examples
rdplyrtidyversemagrittr

Printing intermediate results without breaking pipeline in tidyverse


Is there a command to add to tidyverse pipelines that does not break the flow, but produces some side effect, like printing something out. The usecase I have in mind is something like this. In case of a pipeline

data %>%
  mutate(new_var = <some time consuming operation>) %>%
  mutate(new_var2 = <some other time consuming operation>) %>%
  ...

I would like to add some command to the pipeline that would not modify the end result, but would print out some progress or the state of things. Maybe something like this:

data %>%
  mutate(new_var = <some time consuming operation>) %>%
  command_x(print("first operation done")) %>%
  mutate(new_var2 = <some other time consuming operation>) %>%
  ...

Does there exist such command_x already?


Solution

  • You could easily write your own function

    pass_through <- function(data, fun) {fun(data); data}
    

    And use it like

    mtcars %>% pass_through(. %>% ncol %>% print) %>% nrow
    

    Here we use the . %>% syntax to create an anonymous function. You could also write your own more explicitly with

    mtcars %>% pass_through(function(x) print(ncol(x))) %>% nrow