Search code examples
rregexdata-manipulation

How do I get every character after a segment in my column in R?


Fake City, TX Court House

I would like to return: Court House

This is what I have from another post. I used it to be able to extract the city piece.

gsub(",.*$", "", COLUMN$Entity)

My gut tells me that ",.*$ will have to change but I'm not sure what to change it to.

I'm new to regular expressions. Thank you, guys.


Solution

  • Please try

    # Input string
    input_string <- 'Fake City, TX Court House'
    
    # Use regular expression to extract 'Court House'
    extracted_text <- gsub('.*\\,\\s\\w+\\s', '', input_string)
    
    
    [1] "Court House"