Removing retweets from data frame in R based on text column

I pulled tweets from twitter using the academictwitter package. I would now like to remove all retweets = tweets starting with "RT" in the first column "text" (e.g. third row). You can download a similar data frame from github including tweets from Trump: https://github.com/cbail/cbail.github.io/blob/master/Trump_Tweets.Rdata

Except my data frame has no column called "is_retweet", which makes it more difficult.

The output from my data frame looks like this (I have removed some redundant columns to make it clearer):

Thank you in advance for any suggestions

Solution

You can use regular expressions to figure out which rows start with 'RT'. If your data is in a data frame called tweets, maybe something like this?

tweets[grepl("^(?!RT)", tweets$text, perl = TRUE),]

Or if you're using tidyverse:

tweets %>% 
  filter(grepl("^(?!RT)", text, perl = TRUE))

Calculate mean of matrices having different dimensions
check if two columns have a one-to-one relationship in R
How to extract Std.Dev from VarCorr glmmTMB
Determine level of nesting in R?
How do you print to stderr in R?
How to plot China map with South China Sea in base R
Get column and row position of nth element in a matrix
Is there any authoritative documentation on R release nicknames?
R Glassdoor Web Scraping
Issue with graticule across 180° for several country/territory EEZs
Separating grouped layers in a raster stack in terra
How can I use group_by and mutate to perform a subtraction calculation with specific groupings? Time 0 minus Time X for all groups
How to directly open .R data containing data frame code in R?
Way to web-scrape a popular eSport website using R?
Variance calculation warning: longer object length is not a multiple
gratia::draw(): "'length.out' must be a non-negative number"
Using Swift as custom engine in knitr and including all previous content
convert source target value dataframe into a correlation matrix
ggplot2 plotting a 100% stacked area chart
Use string as formula for ipwtm function?
interpolarization within groups with NA
Multi-row x-axis labels in ggplot line chart
How to do a SOAP request for EUR-Lex API with R?
Make an alluvial plot
Parameters for the ggplot theme function about legend.axis.line
Error handling for tidyr hoist in API call dplyr pipe when column type changes between calls
calculate distance between regression line and datapoint
Colour picker input not updating output in R Shiny
Order() in R - argument is missing, with no default
How to plot geom_bar without showing multiple lines