BeautifulSoup4 packaged into a command line tool.

For now this tool just parses HTML tag soup with BeautifulSoup4, and writes out the results. This can help for example to properly parse the structure of a Netscape bookmarks file, which omits many ending tags.

Installation

From the Python package index (Pypi):

(sudo) pip install beautifulsoup4-slurp

or from Github:

git clone https://github.com/peterhil/slurp.git cd slurp (sudo) python setyp.py install

Usage

Show help:

slurp -h

Parse with html5lib and pretty print into stdout:

slurp -i bookmarks.html -p 'html5lib' -y

Parse with lxml and pretty print into stdout:

slurp -i bookmarks.html -p 'lxml' -y

Write pretty-printed to output to file:

slurp -y -i bookmarks.html -o bookmarks_soup.html

Pipe into slurp:

echo '<title>Slurp!</title><p><a href="https://github.com/peterhil/slurp/">Github</a>' | slurp -y

License

For the full copyright and license information, please view the LICENSE file that was distributed with this source code.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
bin		bin
slurp		slurp
.gitignore		.gitignore
LICENSE		LICENSE
README.rst		README.rst
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BeautifulSoup4 packaged into a command line tool.

Installation

Usage

License

About

Releases

Packages

Languages

License

peterhil/slurp

Folders and files

Latest commit

History

Repository files navigation

BeautifulSoup4 packaged into a command line tool.

Installation

Usage

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages