Search code examples
xmlunicodeutf-8xml-entities

How can I convert unicode byte sequence to XML entity?


I want to be able to display the tsu (ツ) character from Katakana instead of smiley faces on my website.

I already found the byte sequence, which is \343\203\204, but I don't know how to represent this as a XML entity, so that I can display it in a HTML page.


Solution

  • If your XML-to-HTML-transformation stack (and the web server delivering the HTML pages) supports unicode properly, it should be as simple as writing the unicode code point as an entity (&x30c4;). If they don't ... well, good luck ツ