How do I use mutate on DataFrame in SparkR?

I am trying to use this method to explode a field in a dataframe using SparkR. My code is:

Sys.setenv(SPARK_HOME="/usr/hdp/2.6.0.3-8/spark")
.libPaths(c(file.path(Sys.getenv("SPARK_HOME"), "R", "lib"), .libPaths()))
library(SparkR)
sc <- sparkR.init(master="local")
hc <- sparkRHive.init(sc)
df <- sql(hc, "SELECT * FROM tweetsorc5")
library(tidyverse)
dat <- df %>%   mutate(a=explode(df$user)) %>%  select("created_at", "a.utc_offset")

but i get the error:

Error in UseMethod("mutate_") : 
  no applicable method for 'mutate_' applied to an object of class "DataFrame"

I cannot find any help for this.

Solution

It is not possible. SparkDataFrame doesn't implement the same interface as data.frame.

If you want to use dplyr with Spark you should use sparklyr, not SparkR.

With SparkR use SparkR::withColumn:

withColumn(df, "a", explode(df$user))

or SparkR::mutate.

Rcpp Rf_warningcall compiler warnings
Modify the name of factor variables in lm function(summary function)
Extreme value analysis and quantile estimation using log Pearson type 3 (Pearson III) distribution - R vs Python
How to hide NAs when using xlsx::saveWorkbook?
How do I retrieve a simple numeric value from a named numeric vector in R?
Matching pair-wise columns from left to right across rows in one dataframe to another dataframe and adding new columns with matching values
Income to outcome flow chart in Sankey plotly R
color mapping in geom_conn_bundle not showing correctly
Print R package startup message AFTER automatic package conflict messages instead of before
Summing a set of R dataframe rows (column-wise), while retaining the first n columns
Added variable / partial regression plots for groups in an interaction?
how to make a topoplot in R with coordinates variable distribution
List of all functions in base R?
Plotting multiple plots for different initial conditions in one graph
Printing repetitively on the same line in R
Generating UI/Server based on initial selection
Subset dataframe based on pickerInput
How to let user pick the data in R-shiny?
Couldn't show my simple bar charts separately on Shiny R dashboardBody
How to programmatically filter contents of a second shiny app displayed via iframe
How to select specific interesting groups for the boxplot in R Shiny app?
Crosstable and Plot grouping with reactive values
Is there a way to make multiple Shiny picker inputs where the selections must be disjoint?
Delay/avoid duplication of shiny server side functions until after credentials
Predictions only returns value "1"
How to display a busy indicator in a shiny app?
Append doesn't work when writing to CSV in R
Changing the start date of a gantt chart in DiagrammeR
Check for installed packages before running install.packages()
Compare two columns element-wise