How to get glyph unicode representation of Unicode character

Windows use uniscribe library to substitute arabic and indi typed characters based on their location. The new glyph is still have the original unicode of the typed character althogh it has its dedicated representation in Unicode How to get the Unicode of what is actually displayed not what is typed.


There are lots of tools for this like ICU, Charmap and the rest. I myself recommand http://unicode.codeplex.com, it uses Unicode Character Database to represent characters.

Note that unicode is just some information about characters and never spoke about representation. They just suggest to implement a word just like their example. so that to view each code you need Standard Unicode Font like MS Arial Unicode whichis the largest and the best choise in windows platform.

Most of the characters are implemented in this font but for new characters you need an update for it (if there are such an update) or you can use the font which you know that it implemented your desire characters


Your interpretation of what is happening in Uniscribe is not correct. Once you have glyphs the original information is gone there is no reliable way to go back to Unicode.

Even without going to Arabic, there is no way to distinguish if the glyph for the fi ligature (for example) comes from 'f' and 'i' (U+0066 U+0069) or from 'fi' (U+FB01). (http://www.fileformat.info/info/unicode/char/fb01/index.htm)

Also, some of the resulting glyphs do not have a Unicode value associated with them, so there is no "Unicode of what is actually displayed"

链接地址: http://www.djcxy.com/p/68432.html

上一篇: Shopify OAuth步骤2问题

下一篇: 如何获得Unicode字符的字形unicode表示