python jquery css-selectors html-parsing

jquery-like HTML parsing in Python?

Is there any way in Python that would allow me to parse an HTML document similar to what jQuery does?

i.e. I'd like to be able to use CSS selectors syntax to grab an arbitrary set of nodes from the document, read their content/attributes, etc.

Solution

If you are fluent with BeautifulSoup, you could just add soupselect to your libs.
Soupselect is a CSS selector extension for BeautifulSoup.

Usage:

from bs4 import BeautifulSoup as Soup
from soupselect import select
import urllib
soup = Soup(urllib.urlopen('http://slashdot.org/'))
select(soup, 'div.title h3')

    [<h3><span><a href='//science.slashdot.org/'>Science</a>:</span></h3>,
     <h3><a href='//slashdot.org/articles/07/02/28/0120220.shtml'>Star Trek</h3>,
    ..]

How to pick just one item from a generator?
Python: Get unbound class method
global frame vs. stack frame
How to generate a snapshot of a field in a time step with VTK and Python
How to read the first letter from the last line in a txt file in python
How to control scientific notation in matplotlib?
Streamlit multiselect, if I don't select anything, doesn't show data frame
How to extend a class in python?
Is there a standard location to store function cache files in Python?
C++ function (Vectors) wrapped with Cython being around 4 times slower than equivalent Cython function (NumPy Arrays MemoryViews), with large arrays
Error in anyjson setup command: use_2to3 is invalid
Send paid media aiogram 3.10
Is there a workaround for adding Microsoft Word footnotes dynamically in Python?
Training a Keras model to identify leap years
Overload a method based on init variables
How do I create a constant in Python?
What is gettext_lazy on django for?
Pydantic - parse a list of objects from YAML configuration file
How to print stdout excerpt in IPython
What is the difference between Spyder and Jupyter?
How do I create a multiline plot using seaborn?
How to read the request body using orjson library in FastAPI?
Does iPython have built-in support for viewing a variable in pager?
cropping the image by removing the white spaces
Verbose level with argparse and multiple -v options
How to return data in JSON format using FastAPI?
Rounding a rational number to the nearest integer, with half-up
Python inspector ignores property return hint when using TypeVar
How to highlight values per column in Polars
Create arbitrary multidimensional zeros array