Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation
 
PerlMonks  

Re: Re: What are you expecting XML to be in?

by John M. Dlugosz (Monsignor)
on Jun 03, 2001 at 22:44 UTC ( [id://85352]=note: print w/replies, xml ) Need Help??


in reply to Re: What are you expecting XML to be in?
in thread Converting character encodings

That's a proper subset of UTF-8, so not really necessary. Can a particular XML file be represented in, say, 8859-6 or JIS-X, and still be standard? I don't like this because it means that a file can't be read unless the parser knows that character set.
  • Comment on Re: Re: What are you expecting XML to be in?

Replies are listed 'Best First'.
Re: Re: Re: What are you expecting XML to be in?
by merlyn (Sage) on Jun 04, 2001 at 02:19 UTC
    7-bit ISO-8859-1 (also called "ASCII" {grin}) is a proper subset of UTF-8, but not 8-bit ISO-8859-1. So yes, you'd need to declare the file as ISO-8859-1 if you wanted to have any "second half" characters, but otherwise you can let it default to UTF-8.

    -- Randal L. Schwartz, Perl hacker

      Oh yea, that's right. 8859-1 is a subset of Unicode character encoding, but not UTF-8. I mixed them up.

      So one can find existing xml files with various encodings? Does that mean a parser needs a conformance statement as to what encodings it can understand?

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://85352]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others learning in the Monastery: (2)
As of 2024-04-24 17:57 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found