JIS X 0208
From Wikipedia, the free encyclopedia
JIS X 0208 is a Japanese Industrial Standard defining a set of kanji indexed by a pair of integers from 1 to 94 (this is known as the kuten pair of the kanji). This standard was previously known as JIS-C-6226.
The standard defines two "levels" of kanji. Level 1 contains 2965 characters of the most common kanji (arranged by their on'yomi - Chinese style - pronunciation), and level 2 contains 3390 characters the next most common kanji (arranged in dictionary order after the level 1 characters). Also encoded are katakana, hiragana, romaji (Latin characters), Greek, Cyrillic, line drawing characters and various symbols.
JIS X 0208 is incorporated into many Japanese encodings, such as Shift JIS, EUC-JP and ISO 2022-JP.
Japanese software typically uses the order of JIS X 0208 to sort kanji for display to the user. This is essentially the same as the order of characters by radical and stroke count used in a kanji dictionary.
A small number of characters in the set have no recorded uses, and have unknown readings and meanings. They are called 幽霊文字 (yuureimoji, ghost characters). When the JIS set was being created, many documents were consulted to discover various place names, to make sure that a high percentage of Japanese place names would be represented in the set. During this process, some mistakes were made in the transmission of certain characters (i.e a crease in a paper being interpreted as a stroke, or a scribbled character being incorrectly read) resulting in about 20 characters that have no known instances of use.

