|
COMMENTS
|
Hanzi (Kanji) characters for digits 0-9 are: 〇, 一, 二, 三, 四, 五, 六, 七, 八, 九. 零 is also used for number zero.
The hexadecimal values are 3007, 4E00, 4E8C, 4E09, 56DB, 4E94, 516D, 4E03, 516B, 4E5D.
From Alonso del Arte, Oct 20 2019: (Start)
The characters indexed by these code points are more like words than digits, sort of like the difference between the word "three" and the character "3".
With one exception, all of these characters come from the CJK Unified Ideographs block, into which they are sorted according to stroke count or stroke count of the primary radical rather than the numerical value of the number word.
Only the zero character has an associated numeric value that can be accessed through Java's Character.getNumericValue() function. The rest return -1 to indicate "the character does not have a numeric value." (End)
|
|
LINKS
|
Table of n, a(n) for n=0..9.
Oracle, Character Class Javadoc.
Unicode Standard, East Asia, Version 9.0 - Core Specification, Chapter 18.
Unihan Database, Unicode Unihan Database Lookup (characters may be looked up by the hexadecimal values of their code points).
|
|
EXAMPLE
|
The very first character of the CJK Unified Ideographs block is U+4E00 (19968), often used to mean "one," but it can also be used to mean "alone." This is followed by U+4E01, a character sometimes used to mean "male adult" and a character sometimes used to mean "obstruction of breath" (U+4E02).
Next is U+4E03 (19971), which is always used to mean "seven" (though in combination with other characters it may mean a multiple of 7 or a number congruent to 7 mod 10).
|