margol at beamartyr.net
Sun Jun 25 01:58:29 PDT 2006
> Using simple regexps to parse HTML (which seems similar to your problem) is a
> very old Perl request, and often appears in #perl on Freenode.
It's not valid HTML. Look carefully at the "closing tag". So HTML
parsers probably won't help. If it was, it'd be enough to cleanup the
trailing ### (or whatever other EOL marker) and run it through
HTML::Parser asking just for the body text.
More information about the Perl