forked from html5lib/html5lib-python
- Notifications
You must be signed in to change notification settings - Fork 0
Standards-compliant library for parsing and serializing HTML documents and fragments in Python
License
Notifications You must be signed in to change notification settings
hroncok/html5lib-python
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Latest commit | ||||
Repository files navigation
html5lib is a pure-python library for parsing HTML. It is designed to conform to the HTML 5 specification, which has formalized the error handling algorithms of popular web browsers. = Installation = html5lib is packaged with distutils. To install it use: $ python setup.py install = Tests = You may wish to check that your installation has been a success by running the testsuite. All the tests can be run by invoking runtests.py in the tests/ directory = Usage = Simple usage follows this pattern: import html5lib f = open("mydocument.html") parser = html5lib.HTMLParser() document = parser.parse(f) More documentation is avaliable in the docstrings or from http://code.google.com/p/html5lib/wiki/UserDocumentation = Bugs = Please report any bugs on the issue tracker: http://code.google.com/p/html5lib/issues/list = Get Involved = Contributions to code or documenation are actively encouraged. Submit patches to the issue tracker or discuss changes on irc in the #whatwg channel on freenode.net About
Standards-compliant library for parsing and serializing HTML documents and fragments in Python
Resources
License
Contributing
Uh oh!
There was an error while loading. Please reload this page.
Stars
Watchers
Forks
Packages 0
No packages published
Languages
- Python99.9%
- Other0.1%