Piculet is a module for extracting data from XML or HTML documents using XPath queries. It consists of a single source file with no dependencies other than the standard library, which makes it very easy to integrate into applications. It also provides a command line interface.
Piculet has been tested with Python 3.5+ and compatible versions of PyPy.
You can install the latest version using
pip install piculet
Installing Piculet creates a script named
piculet which can be used
to invoke the command line interface:
$ piculet -h usage: piculet [-h] [--version] [--html] (-s SPEC | --h2x)
$ cat shining.html | piculet -s movie.json
The documentation is available on: https://piculet.tekir.org/
The source code can be obtained from: https://github.com/uyar/piculet
- Data extraction
- Lower-level functions