Sisyphus repository
Last update: 2018-10-16 21:08:37 +0400 | SRPMs: 18651 | Sign in or Sign up
en ru uk br
ALT Linux repositories
hide window
Sisyphus: 1.03-alt1
p8: 1.03-alt1
p7: 1.03-alt1
t7: 1.03-alt1
Platform6: 1.03-alt1
t6: 1.03-alt1

Other repositories
hide window
CPAN: 1.03

Group :: Development/Perl
Source RPM: perl-HTML-TagFilter

 Main   Changelog   Spec   Patches   Sources   Download   Gear   Bugs and FR (0/0)   Repocop 

Current version: 1.03-alt1
Built: about 7 years ago
Size: 18.5 KB
Repocop status: ok

Home page:

License: Artistic
Summary: A fine-grained html-filter, xss-blocker and mailto-obfuscator

HTML::TagFilter is a subclass of HTML::Parser with a single purpose:
it will remove unwanted html tags and attributes from a piece of text.
It can act in a more or less fine-grained way - you can specify
permitted tags, permitted attributes of each tag, and permitted
values for each attribute in as much detail as you like.

Tags which are not allowed are removed. Tags which are allowed are
trimmed down to only the attributes which are allowed for each tag.
It is possible to allow all or no attributes from a tag, or to allow
all or no values for an attribute, and so on.

The filter will also guard against cross-site scripting attacks
and obfuscate any mailto:email addresses, unless you tell it not to.

The original purpose for this was to screen user input.
In that setting you'll often find that just using:

my $tf = new HTML::TagFilter;

will do. However, it can also be used for display processes
(eg text-only translation) or cleanup (eg removal of old javascript).
In those cases you'll probably want to override the default rule set
with a small number of denial rules.

my $self = HTML::TagFilter->new(deny => {img => {'all'}});
print $tf->filter($my_text);

Will strip out all images, for example, but leave everything
else untouched.

nb (faq #1) the filter only removes the tags themselves:
all it does to text which is not part of a tag is to escape
the <s and >s, to guard against false negatives and some common
cross-site attacks.

obPascal: Sorry about the incredibly long documentation, by the way.
When I have time I'll make it shorter.

Current maintainer: Andrey Stroganov

List of contributors: ACL: List of rpms provided by this srpm:
  • perl-HTML-TagFilter
Recent changes (last three changelog entries):

2011-07-11 Andrey V. Stroganov <dja at> 1.03-alt1

    - initial build for ALT Linux Sisyphus

© 2009–2018 Igor Zubkov