python regex pandas string text-processing

Locate all non-number elements in a pandas.Series

For a pd.Series with mixed strings and numbers (integers and floats), I need to identify all non-number elements. For example

data = pd.Series(['1','wrong value','2.5','-3000','>=50','not applicable', '<40.5'])

I want it to return the following elements:

wrong value
>=50
not applicable
<40.5

What I'm currently doing is:

data[~data.str.replace(r'[\.\-]','').str.isnumeric()]

That is, because .str.isnumeric() will give False to decimal points and negative signs, I had to mask "." and "-" first and then find out the non-numeric fields.

Is there a better way of doing this? Or is there any potential problem/warning with my current method? Thanks!!

Solution

Use pd.to_numeric to flag them

data[pd.to_numeric(data, errors='coerce').isna()]

Out[1159]:
1       wrong value
4              >=50
5    not applicable
6             <40.5
dtype: object

How to pick just one item from a generator?
Python: Get unbound class method
global frame vs. stack frame
How to generate a snapshot of a field in a time step with VTK and Python
How to read the first letter from the last line in a txt file in python
How to control scientific notation in matplotlib?
Streamlit multiselect, if I don't select anything, doesn't show data frame
How to extend a class in python?
Is there a standard location to store function cache files in Python?
C++ function (Vectors) wrapped with Cython being around 4 times slower than equivalent Cython function (NumPy Arrays MemoryViews), with large arrays
Error in anyjson setup command: use_2to3 is invalid
Send paid media aiogram 3.10
Is there a workaround for adding Microsoft Word footnotes dynamically in Python?
Training a Keras model to identify leap years
Overload a method based on init variables
How do I create a constant in Python?
What is gettext_lazy on django for?
Pydantic - parse a list of objects from YAML configuration file
How to print stdout excerpt in IPython
What is the difference between Spyder and Jupyter?
How do I create a multiline plot using seaborn?
How to read the request body using orjson library in FastAPI?
Does iPython have built-in support for viewing a variable in pager?
cropping the image by removing the white spaces
Verbose level with argparse and multiple -v options
How to return data in JSON format using FastAPI?
Rounding a rational number to the nearest integer, with half-up
Python inspector ignores property return hint when using TypeVar
How to highlight values per column in Polars
Create arbitrary multidimensional zeros array