Search code examples
regexrdataframeword-boundary

detecting word boundary with regex in data frame in R


I have a data.frame named all that has a column of factors, these factors include "word","nonword" and some others. My goal is to select only the rows that have the factor value "word".

My solution grep("\bword\b",all[,5]) returns nothing.

How come word boundaries are not recognized?


Solution

  • In R, you need two times \:

    grep("\\bword\\b", all[5])
    

    Alternative solutions:

    grep("^word$", all[5])
    
    which(all[5] == "word")