Convert sql data table to sparklyr dataframe

I uploaded the data.csv to Microsoft Azure Storage Explorer. Then copied the url and created a table in databricks.

%sql 
DROP TABLE If EXISTS data; 
CREATE TABLE IF NOT EXISTS data 
USING CSV 
OPTIONS (header "true", inferSchema "true") 
LOCATION "url/data.csv"

Now I want to use sparklyr to manupulate "data".

How should I convert above data to a sparklyr dataframe to use the full potential of sparklyr?

Solution

First you must initialise your sparklyr session as follows:

sc = spark_connect(method = 'databricks')

you can then read directly from your SQL tables using:

sdf_sql(sc, 'SELECT * From ...')

and then perform all of the usual sparklyr/dplyr work as normal.

Note that databricks clusters do not come pre-loaded with sparklyr as they want to push you towards using the SparkR API to interact with your data instead. If you with to use the sparklyr API you must install and load sparklyr each time your start the cluster.

R Language - Extracting the correct Data Type in a PDF Table
Comparing the values of a certain number previous rows with the current row
rpart package installation in R
An efficient way to assign value based on a min-max range and category
Change output of the `purrr::map` function
osmdata_sf returns failed to perform HTTP request curl::curl_fetch_memory() error in R?
Comparing nls() to nls2() - what am I doing wrong
How to add "variables grid" below ggplot
How can I use predefined code snippets outside of code chunks in Quarto within RStudio/Posit?
Wrap text for collapse rows in KableExtra for a long table in R
Implementation of Breusch-Pagan test for random effects in plm with unbalanced panels
Finding a value of a dataset in different ones
Replicate matrix
Unexpected results after converting raster data from geographic to projected coordinate system using the terra package
How to remove rows by condition in R?
How do I add an alias for magrittr pipe from R in vscode
Package ‘neuralnet’ in R, rectified linear unit (ReLU) activation function?
Sub-subtitle in a graph made with `ggplot2`
How can I execute a statement and ignore warnings with tryCatch?
Enumerate events where n consecutive values are not NA
Serialize/deserialize a column with R and DuckDB
Putting multiple plots on the same page in R?
NA values in a non-editable date column in a datatable in a shiny app change to "Invalid Date" when clicked on
How to enable/disable checkboxInput when certain panel is selected
Writing robust R code: namespaces, masking and using the `::` operator
Replacing with conditional value in dplyr case_when()
How to assign pre-determined RGB values to polygons
python/pandas equivalent to dplyr 1.0.0 summarize(across())
Calculating moving average
Estimating non-monotonic bi-exponential curve fit