htmllib

htmllib — A parser for HTML documents

Deprecated since version 2.6: The htmllib module has been removed in Python 3. Use HTMLParser instead in Python 2, and the equivalent, html.parser, in Python 3.

This module defines a class which can serve as a base for parsing text files formatted in the HyperText Mark-up Language (HTML). The class is not directly concerned with I/O — it must be provided with input in string form via a method, and makes calls to methods of a “formatter” object in order to produce output. The 登录查看完整内容