In this case what goes over the wire is ASCII but the ASCII represents
the "HTML encoded" character in UCS-2. The wonderful thing about
HTML-encoded characters is that the encoding is unique and therefore web
servers always know how to unencode each character.
-Paul
Paul Deuter
Internationalization Manager
Plumtree Software
paul.deuter@plumtree.com
-----Original Message-----
From: Timothy Greenwood [mailto:tgreenwood@openmarket.com]
Sent: Monday, October 01, 2001 1:41 PM
To: Paul Deuter; Timothy Greenwood; souravm; www-international@w3.org
Subject: RE: ISO-8859-1
> Numeric character references? What are those?
From the HTML specification
5.3.1 Numeric character references
Numeric character references specify the code position of a character
in the document character set. Numeric character references may take
two forms:
* The syntax "&#D;", where D is a decimal number, refers to the ISO
10646 decimal character number D.
* The syntax "&#xH;" or "&#XH;", where H is a hexadecimal number,
refers to the ISO 10646 hexadecimal character number H.
Hexadecimal numbers in numeric character references are
case-insensitive.