![]() |
|
![]() |
||
![]() |
![]() |
|
[WWW-HTML Mailing List Archive Home] [Messages By Thread] [Messages By Date] Re: Problem in publishing multilingual HTML document on web in UTF-8 encoding
From: Laurens Holst <lholst@students.cs.uu.nl>
Date: Fri, 02 Jun 2006 23:09:38 +0200 Message-ID: <4480A912.7020700@students.cs.uu.nl> To: Philip TAYLOR <P.Taylor@Rhul.Ac.Uk> Cc: "à¤?शà¥?ष शà¥à¤?à¥à¤²à¤¾ \"Wah Java !!\"" <wahjava@gmail.com>, W3C HTML Mailing List <www-html@w3.org> Philip TAYLOR schreef: > it interprets the META directive as you would wish. But in so > doing, it starts to parse the document on the basis of it being > expressed in ISO-9999-9, whereupon it discovers that there wasn't > a META directive at all, there was, rather, a(n ill-formed) BODY > tag. But because it now knows there /was/ no META directive, it > parses using ISO-8859-1. But that means there IS a META > directive. And so on. I'm sure you see the problem ... On the other hand you see that languages such as CSS use a similar mechanism to determine the character encoding: http://www.w3.org/TR/CSS21/syndata.html#x62 So itâ??s not without precedent. Of course due to the constraints that CSS puts on the location and the encoding of the character encoding identifier, itâ??s a lot simpler to determine than in HTML. ~Grauw -- Ushiko-san! Kimi wa doushite, Ushiko-san!! ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Laurens Holst, student, university of Utrecht, the Netherlands. Website: www.grauw.nl. Backbase employee; www.backbase.com.
|
|
||||||||||||||||