Search code examples
pythonregexsplitregex-group

Regex match group of words and parenthesis after comma


I've got an string like this:

aaaa (aa), bbbb, b2b2b2 (bbbb), ccc, c33 (ccc), ddd, d222, d33 (ddd)

And I want to split it to extract every group after the comma followed by the parentheses:

aaaa (aa)
bbbb, b2b2b2 (bbbb)
ccc, c33 (ccc)
ddd, d222, d33 (ddd)

I tried several regex expressions, even the Parse library, but without success

Any idea would be welcomed!!

Thank you in advance!!


Solution

  • Using Lookbehind

    Ex:

    import re
    
    s = "aaaa (aa), bbbb, b2b2b2 (bbbb), ccc, c33 (ccc), ddd, d222, d33 (ddd)"
    print(re.split(r"(?<=\)),", s))
    

    Output:

    ['aaaa (aa)', ' bbbb, b2b2b2 (bbbb)', ' ccc, c33 (ccc)', ' ddd, d222, d33 (ddd)']