pattern to extract linkedin username from text

I am trying to extract linkedin url that is written in this format,

text = "patra 12 EXPERIENCE in / in/sambhu-patra-49b4759/ 2020 - Now O Skin Curate Research Pvt Ltd Embedded System Developer, WB 0 /bindasssambhul O SKILLS LANGUAGES Arduino English Raspberry Pi Movidius Hindi Bengali ICS Intel Compute Stick PCB Design Python UI Design using Tkinter HOBBIES HTML iti CSS G JavaScript JQuery IOT\n"


pattern = \/?in\/.+\/?\s+

I need to extract this in/sambhu-patra-49b255129/ from the any noisy text like the one above,

It's a linkedin url written in short form.

My pattern is not working

Solution

You can use

m = re.search(r'\bin\s*/\s*(\S+)', text)
if m:
  print(m.group(1))

See the regex demo.

Details:

\b - word boundary
in - a preposition in
\s* - zero or more whitespaces
/ - a / char
\s* - zero or more whitespaces
(\S+) - Capturing group 1: any one or more whitespaces.

Find the first row in a data frame that satisfies a condition and delete everything above?
Any other options besides the traditional CLD bar graph?
R data.table update join by reference the, but updating the RIGHT table
Problems with installation R packages
R correlation: I'm getting inconsistent correlation results with cor() function
Convert a matrix in R into a upper triangular/lower triangular matrix with those corresponding entries
Printing text in ggplot
Making Replicable Layout Matrices for R Plots
Reading in a data file with staggered column names into R
Start a PowerShell script in R via system2()
Issues integrating shinychat into a modular R Shiny app
Web scraping on tipti page that requires login
barplot multiple aggregation
plot from sankeyNetwork in networkD3 does not show output (issue is not number of unique nodes)
"Target position can only be set for new windows" in chromote in R
Extract the correct data type in a PDF table
Time conversion in R
Comparing the values of a certain number previous rows with the current row
Run a single test function in R's testthat
rpart package installation in R
An efficient way to assign value based on a min-max range and category
Change output of the `purrr::map` function
osmdata_sf returns failed to perform HTTP request curl::curl_fetch_memory() error in R?
Comparing nls() to nls2() - what am I doing wrong
How to add "variables grid" below ggplot
How can I use predefined code snippets outside of code chunks in Quarto within RStudio/Posit?
Wrap text for collapse rows in KableExtra for a long table in R
Implementation of Breusch-Pagan test for random effects in plm with unbalanced panels
Finding a value of a dataset in different ones
Replicate matrix