[Israel.pm] regexp

Issac Goldstand margol at beamartyr.net
Sun Jun 25 01:58:29 PDT 2006

>> "aaa<asd>='asd'/6>bbb<asd>='asd'/3>ccc<asd>='asd'/5>ddd###"


> Using simple regexps to parse HTML (which seems similar to your problem) is a 
> very old Perl request, and often appears in #perl on Freenode.

It's not valid HTML.  Look carefully at the "closing tag".  So HTML
parsers probably won't help.  If it was, it'd be enough to cleanup the
trailing ### (or whatever other EOL marker) and run it through
HTML::Parser asking just for the body text.


More information about the Perl mailing list