c:[["$","$Ld",null,{"page":"page"}],["$","$Le",null,{}],["$","div",null,{"className":"container shadow-4 rounded mt-3 mb-3 p-3 border border-primary","children":[[["$","$Lf","0",{"href":"../apache-spark","className":"badge badge-primary m-1","children":"apache-spark"}],["$","$Lf","1",{"href":"../pyspark","className":"badge badge-primary m-1","children":"pyspark"}],["$","$Lf","2",{"href":"../apache-spark-sql","className":"badge badge-primary m-1","children":"apache-spark-sql"}],["$","$Lf","3",{"href":"../nlp","className":"badge badge-primary m-1","children":"nlp"}],["$","$Lf","4",{"href":"../punctuation","className":"badge badge-primary m-1","children":"punctuation"}]],["$","h1",null,{"className":"h1","dangerouslySetInnerHTML":{"__html":"How to remove punctuation from a text?"}}],["$","hr",null,{}],["$","div",null,{"dangerouslySetInnerHTML":{"__html":"

I have a very big data set . I am wondering How I can remove all punctuation from a big dataset in pyspark? For example , . & \\ | - _

\n"}}],["$","hr",null,{}],["$","div",null,{"className":"h3","children":["Solution ",["$","li",null,{"className":"h3 fa fa-arrow-down"}]]}],["$","hr",null,{}],["$","div",null,{"dangerouslySetInnerHTML":{"__html":"

You can use regexp_replace to remove the punctuations you specified using a regex expression:

import pyspark.sql.functions as F\n\ndf2 = df.select(\n    [F.regexp_replace(col, r',|\\.|&|\\\\|\\||-|_', '').alias(col) for col in df.columns]\n)\n

\n"}}],["$","br",null,{}],["$","ul",null,{"className":"list-group","children":[]}],["$","br",null,{}],["$","$L10",null,{}],["$","br",null,{}]]}],["$","$L11",null,{}],["$","$L12",null,{}],["$","$L13",null,{}],["$","div",null,{"className":"container ftr1","children":[["$","hr",null,{}],["$","footer",null,{"className":"bg-body-tertiary text-center text-lg-start","children":["$","div",null,{"className":"text-center p-3","children":["Content Source :",["$","a",null,{"className":"text-body","href":"https://stackoverflow.com","rel":"nofollow noreferrer noopener","id":"ftlk","style":{"color":"black"},"children":"Stackoverflow"}]," | ",["$","$Lf",null,{"href":"/privacy-policy","style":{"color":"blue !important"},"children":"Privacy Policy"}]," | ",["$","$Lf",null,{"href":"/terms-and-condition","style":{"color":"blue !important"},"children":"Terms and Condition"}]," | ",["$","$Lf",null,{"href":"/contact-us","style":{"color":"blue !important"},"children":"Contact Us"}]]}]}]]}],["$","$L14",null,{}],["$","$L15",null,{}]]