Package htmlcleaner: Information
Danger alert: Package removed from sisyphus repository
Removed in the task: #301906
Package removed: Igor Vlasenko
Deletion date: June 12, 2022
Message: java11migration
Package removed: Igor Vlasenko
Deletion date: June 12, 2022
Message: java11migration
Source package: htmlcleaner
Version: 2.2.1-alt1_15jpp8
Build time: Sep 30, 2019, 03:25 PM in the task #238366
Category: Development/Java
Report package bugHome page: http://htmlcleaner.sourceforge.net/
License: BSD
Summary: HTML parser written in Java
Description:
HtmlCleaner is open-source HTML parser written in Java. HTML found on Web is usually dirty, ill-formed and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring the order to tags, attributes and ordinary text. For the given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows similar rules that the most of web browsers use in order to create Document Object Model. However, user may provide custom tag and rule set for tag filtering and balancing.
Maintainer: Igor Vlasenko