Package perl-HTML-TagFilter: Specfile
# # - HTML::TagFilter - # This spec file was automatically generated by cpan2rpm [ver: 2.027] # (ALT Linux revision) # The following arguments were used: # HTML::TagFilter # For more information on cpan2rpm please visit: http://perl.arix.com/ # %define module HTML-TagFilter %define m_distro HTML-TagFilter %define m_name HTML::TagFilter %define m_author_id unknown %define _enable_test 1 Name: perl-HTML-TagFilter Version: 1.03 Release: alt1 Summary: A fine-grained html-filter, xss-blocker and mailto-obfuscator License: Artistic Group: Development/Perl Url: http://www.cpan.org Packager: Andrey V. Stroganov <dja@altlinux.org> BuildArch: noarch Source: %m_distro-%version.tar # Automatically added by buildreq on Mon Jul 11 2011 # optimized out: perl-URI BuildRequires: perl-HTML-Parser perl-devel %description HTML::TagFilter is a subclass of HTML::Parser with a single purpose: it will remove unwanted html tags and attributes from a piece of text. It can act in a more or less fine-grained way - you can specify permitted tags, permitted attributes of each tag, and permitted values for each attribute in as much detail as you like. Tags which are not allowed are removed. Tags which are allowed are trimmed down to only the attributes which are allowed for each tag. It is possible to allow all or no attributes from a tag, or to allow all or no values for an attribute, and so on. The filter will also guard against cross-site scripting attacks and obfuscate any mailto:email addresses, unless you tell it not to. The original purpose for this was to screen user input. In that setting you'll often find that just using: my $tf = new HTML::TagFilter; put_in_database($tf->filter($my_text)); will do. However, it can also be used for display processes (eg text-only translation) or cleanup (eg removal of old javascript). In those cases you'll probably want to override the default rule set with a small number of denial rules. my $self = HTML::TagFilter->new(deny => {img => {'all'}}); print $tf->filter($my_text); Will strip out all images, for example, but leave everything else untouched. nb (faq #1) the filter only removes the tags themselves: all it does to text which is not part of a tag is to escape the <s and >s, to guard against false negatives and some common cross-site attacks. obPascal: Sorry about the incredibly long documentation, by the way. When I have time I'll make it shorter. %prep %setup -n %m_distro-%version %build %perl_vendor_build %install %perl_vendor_install %files %perl_vendor_privlib/HTML/* %changelog * Mon Jul 11 2011 Andrey V. Stroganov <dja@altlinux.org> 1.03-alt1 - initial build for ALT Linux Sisyphus |