Unicode in C
Elazar Leibovich
elazarl at gmail.com
Mon Mar 12 15:56:05 IST 2012
On Mon, Mar 12, 2012 at 3:20 PM, Omer Zak <w1 at zak.co.il> wrote:
>
> If you need to use Far Eastern fonts and/or have random access for your
> text, use fixed size wide character encoding (16 bit or 32 bit size).
>
>
Note that UTF-16, doesn't really offer random access, due to surrogate
pairs (not all Unicode code points fits into 0..2^16). Although some
implementations simply ignore this fact.
I humbly suggest you to have a look at
https://github.com/elazarl/javaUnicodePitfalls a place I tried to capture
some common language pitfalls (despite the name, not everything is unique
to java).
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cs.huji.ac.il/pipermail/linux-il/attachments/20120312/1da855aa/attachment.html>
More information about the Linux-il
mailing list