[Israel.pm] regexp
Issac Goldstand
margol at beamartyr.net
Sun Jun 25 11:58:29 EEST 2006
>>
>> "aaa<asd>='asd'/6>bbb<asd>='asd'/3>ccc<asd>='asd'/5>ddd###"
[snip]
> Using simple regexps to parse HTML (which seems similar to your problem) is a
> very old Perl request, and often appears in #perl on Freenode.
It's not valid HTML. Look carefully at the "closing tag". So HTML
parsers probably won't help. If it was, it'd be enough to cleanup the
trailing ### (or whatever other EOL marker) and run it through
HTML::Parser asking just for the body text.
Yitzchak
More information about the Perl
mailing list