Quarantining crap HTML?

Jérôme Étévé jerome.eteve at gmail.com
Tue May 21 12:42:32 BST 2013


What about parsing it with a lax XHTML parser and rendering it?


On 21 May 2013 12:31, Dave Hodgkinson <daveh at hodgkinson.org> wrote:
> In keeping with the spirit of the list, this isn't directly a perl question
> but it might be part of the solution.
>
> I'm picking up HTML from another site, and that HTML is pretty crappy.
>
> Is there any way of quarantining it so it doesn't bugger up the rest of the
> page?
>
>
>



-- 
Jerome Eteve
+44(0)7738864546
http://www.eteve.net/


More information about the london.pm mailing list