On Wed, Oct 17, 2018 at 02:12:37PM -0500, Bruno Wolff III wrote: > On Wed, Oct 17, 2018 at 14:48:52 -0400, > "tonynelson@xxxxxxxxxxxxxxxxx" <tonynelson@xxxxxxxxxxxxxxxxx> wrote: > >> ... For html only messages you would either need to reject them or rewrite them, both of which have issues. > >I've used elinks to do that in an email forum I wrote. It worked better than doing it with, say, Beautiful Soup. > > That is a big risk on your list serve processor. I would want to use something > a lot safer than elinks (or lynx) to parse unsolicited email messages. What I > do at work is use a simple perl script, but it doesn't do a great job. I use a perl script with these modules and some regexps to clean up the result: use HTML::Strip; use HTML::LinkExtor; use HTML::Entities qw/decode_entities/; use URI::Escape qw/uri_unescape/; _______________________________________________ devel mailing list -- devel@xxxxxxxxxxxxxxxxxxxxxxx To unsubscribe send an email to devel-leave@xxxxxxxxxxxxxxxxxxxxxxx Fedora Code of Conduct: https://getfedora.org/code-of-conduct.html List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/devel@xxxxxxxxxxxxxxxxxxxxxxx