XML and UTF-8 BOM. [Was Re: Using Template Toolkit and UTF-8]

Matt Sergeant msergeant at startechgroup.co.uk
Thu Jan 19 13:09:09 GMT 2006


On Thu, 19 Jan 2006, Aaron Crane wrote:

> Steve Sims writes:
> > These saved files have been generated by TT as UTF-8 but those files  
> > do not contain a BOM
> 
> There wasn't really meant to be any such thing as a "UTF-8 BOM", and
> there are situations in which it's harmful.  (It's not clear that XML
> documents are well-formed if their first three bytes are 0xef 0xbb 0xbf
> and they contain an XML declaration, for example.)

Not so. You can even read the XML::SAX::PurePerl code for processing BOMs 
which looks for this before checking for XML content. It's even talked 
about in the XML spec, IIRC.

Matt.


______________________________________________________________________
This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email 
______________________________________________________________________


More information about the london.pm mailing list