
The tables below list CJKI's lexical resources, along with the approximate number of entries for each. Not included is most of our extensive data for CJK single character input methods, nor our data for European languages, such as our Spanish resources. Our Simplified <> Traditional Chinese mapping tables have also been excluded.
A detailed description of most of our CJK lexical databases, which currently contains over 24 million entries, can be found at:
http://www.cjk.org/cjk/samples/
| SC | Simplified Chinese |
|---|---|
| TC | Traditional Chinese |
| C | Simplified and/or Traditional Chinese |
| J | Japanese |
| K | Korean |
| E | English |
| CAN | Cantonese |
| A | Arabic |
| Language | Description | Entries | Remarks |
|---|---|---|---|
| J<>E | Companies and organizations | 600,000 | |
| J<>E | Personal names | 570,000 | |
| J<>E | Personal name variants | 3,500,000 | |
| J<>E | Place names | 200,000 | |
| J | General vocabulary monolingual | 300,000 | excludes proper nouns |
| J>E | General vocabulary kanji | 40,000 | based on NJECD to be expanded |
| E>J | General vocabulary bilingual | 82,000 | |
| J | Phonological/phonemic, general/proper | 130,000 | |
| J>E | General vocabulary bilingual | 110,000 | |
| J | General vocabulary katakana | 50,000 | some English |
| J | Pornographic terms | 720 | some English |
| J<>E | Technical terms | 1,000,000 |
max. 1.5 million |
| J<>E | Other | 50,000 |
|
| Total: | 6,632,720 |
| Language | Description | Entries | Remarks |
|---|---|---|---|
| SC<>E | Personal names | 650,000 | |
| SC<>E | Personal name variants | 243,000 | |
| TC<>E | Personal names | 650,000 | |
| SC<>E | Place names | 170,000 | |
| TC<>E | Place names | 170,000 | |
| SC<>J | Proper nouns (place/personal) | 106,000 | |
| SC<>E | Companies and organizations | 55,000 | |
| TC<>E | Companies and organizations | 55,000 | |
| SC<>E | Computer terms | 100,000 | |
| TC<>E | Computer terms | 100,000 | |
| SC<>E | Technical terms | 4,750,000 |
|
| SC<>J | Technical terms | 820,000 |
|
| SC | General vocabulary monolingual | 250,000 | excludes proper nouns |
| TC | General vocabulary monolingual | 250,000 | excludes proper nouns |
| E>SC | General vocabulary bilingual | 80,000 | |
| SC>E | General vocabulary bilingual | 700,000 |
|
| E>TC | General vocabulary bilingual | 85,000 | |
| SC>J | Chinese-Japanese proper nouns | 600,000 | |
| CAN | Cantonese input method | 25,000 |
|
| Other | 75,000 |
||
| Total | 9,934,000 |
| Language | Description | Entries | Remarks |
|---|---|---|---|
| CJKEA | Place/personal names multilingual | 150,000 | |
| CJE | Technical terms multilingual | 150,000 |
under development, eventually 500,000 |
| Total | 300,000 |
| Language | Description | Entries | Remarks |
|---|---|---|---|
| AE | Romanized name variants | 7,000,000 |
|
| A | Arabic name variants | 220,000 |
|
| CJKEA | Place/personal names multilingual | 150,000 |
Arabic partially available |
| EA | Romanized place names | 6,000 |
|
| Total | 7,376,000 |
| Language | Description | Entries | Remarks |
|---|---|---|---|
| K<>E | Personal and place names | 30,000 | in progress |
| K | Companies and organizations | 30,000 | some English in progress |
| K | Pornographic terms | 610 | some English |
| K | General vocabulary monolingual | 100,000 | in progress |
| E>K | General vocabulary bilingual | 80,000 | in progress |
| K | Korean input method | 11,172 |
|
| Total | 251,782 |