Chinese Wordlist

Over 10 million SC and TC entries

Covers proper nouns and technical terms

Ideal for speech technology

Overview

CJKI maintains comprehensive monolingual wordlists for CJK languages, Arabic, and Spanish covering some 30 million entries. Our Chinese Wordlist (CWL) covers approximately 10.6 million entries (both Simplified and Traditional) for general vocabulary, proper nouns, and technical terminology, and includes pinyin/zhuyin readings, part-of-speech codes, and more.

Used by some of the world’s leading IT companies, this database is suitable for a variety of NLP applications for information retrieval like search engines, morphological analysis tools like tokenizers, and speech technology applications like text-to-speech synthesis.

* Select different dialects by clicking on the tabs below.
TypeSCPinyinPOS
G作用zuò yòngNC
G作用zuò yòngV
G作出zuò chūV
G最终zuì zhōngD
G状态zhuàng tàiNC
G状况zhuàng kuàngNC
G重大zhòng dàA
G制度zhì dùNC
G指导zhǐ dǎoV
G指出zhǐ chūV
G值得zhí deV
G之中zhī zhōngWL
G知识zhī shiNC
G政治zhèng zhìNC
G真是zhēn shìD
G真实zhēn shíA
G真是zhēn shiV
G针对zhēn duìV
G增长zēng zhǎngV
G再次zài cìD
G运动yùn dòngNC
G运动yùn dòngV
G有时yǒu shíD
G永远yǒng yuǎnD
G一切yī qièNR
G医疗yī liáoNC
G眼睛yǎn jingNC
G迅速xùn sùA
G形象xíng xiàngNC
G心理xīn lǐNC
G相应xiāng yìngV
G相对xiāng duìD
G相对xiāng duìV
G限制xiàn zhìV
G先进xiān jìnA
G下去xià qùV
G下面xià mianWL
G文学wén xuéNC
G委员会wěi yuán huìNC
G为主wéi zhǔV
G投入tóu rùV
G通讯tōng xùnNC
G特殊tè shūA
G特色tè sèNC
G特别是tè bié shìA
G特别是tè bié shìD
G它们tā menNR
G思想sī xiǎngNC
G数字shù zìNC
G属于shǔ yúV
G授权shòu quánV
G收入shōu rùNC
G收入shōu rùV
G实在shí zàiA
G实在shí zàiD
G价值jià zhíNC
G加上jiā shàngV
G集中jí zhōngA
G集中jí zhōngV
G即将jí jiāngD
G回到huí dàoV
G回答huí dáV
G后来hòu láiNT
G孩子hái ziNC
G关键guān jiànNC
G股票gǔ piàoNC
G根据gēn jùNC
G根据gēn jùPR
G各地gè dìNR
G高级gāo jíA
G感谢gǎn xièV
P阿迪勒ā dí lèNP
T阿迪芬宁ā dí fēn níngn
P阿笛尔ā dí ěrU
P阿迪ā díNP
P阿登高原ā dēng gāo yuánNP
P阿登ā dēngNP
P阿德瓦尔ā dé wǎ ěrNP
P阿德诺伊尔ā dé nuò yī ěrNP
T阿德内弗林ā dé nèi fú línn
P阿德内ā dé nèiNP
P阿德南ā dé nánNP
P阿德马ā dé mǎNP
T阿得罗依得数ā dé luó yī dé shùn
P阿德列尔ā dé liè ěrNP
P阿德里亚诺ā dé lǐ yà nuòNP
P阿德里亚纳ā dé lǐ yà nàNP
P阿德里昂ā dé lǐ ángNP
P阿德里安娜ā dé lǐ ān nàNP
P阿德里安ā dé lǐ ānNP
T阿德勒反常ā dé lè fǎn chángn
P阿德勒ā dé lèNP
P阿德莱德ā dé lái déNP
P阿德拉ā dé lāNP
T阿德克索林ā dé kè suǒ línn
P阿德克公司ā dé kè gōng sīNP
P阿德尔曼ā dé ěr mànNP
P阿德尔伯特ā dé ěr bó tèNP
P阿德尔ā dé ěrNP
P阿德ā déNP
P阿道弗斯ā dào fú sīNP
P阿道福ā dào fúNP
P阿道弗ā dào fúNP
P阿道夫ā dào fūNP
P阿当松ā dāng sōngNP
P阿丹ā dānNP
P阿黛尔ā dài ěrNP
P阿代尔ā dài ěrNP
G阿呆ā dāiA
G阿呆ā dāiN
G阿大ā dàN
P阿达舍夫ā dá shè fūNP
P阿达尼ā dá níNP
P阿达穆斯ā dá mù sīNP
P阿达姆斯基ā dá mǔ sī jīNP
P阿达姆ā dá mǔNP
P阿达莫维奇ā dá mò wéi qíNP
P阿达莫夫斯基ā dá mò fū sī jīNP
P阿达莫夫ā dá mò fūNP
P阿达莫ā dá mòNP
P阿达米昂ā dá mǐ ángNP
P阿达米安ā dá mǐ ānNP
P阿达米ā dá mǐNP
P阿达麦茨ā dá mài cíNP
P阿达马ā dá mǎNP
P阿达玛ā dá mǎNP
P阿达克ā dá kèNP
P阿达尔韦特ā dá ěr wéi tèNP
P阿达尔贝特ā dá ěr bèi tèNP
P阿达ā dáNP
P阿措里ā cuò lǐNP
P阿春ā chūnNP
P阿川ā chuānNP
P阿出川ā chū chuānNP
P阿城市ā chéng shìNP
P阿城ā chéngNP
P阿琛迪亚ā chēn dí yàNP
P阿彻ā chèNP
G阿昌族ā chāng zúN
P阿昌庞ā chāng pángNP
P阿查德ā chá déNP
P阿查ā cháNP
P阿布朱格ā bù zhū géNP
P阿布扎比ā bù zhā bǐNP
P阿布施ā bù shīNP
T阿布尼定律ā bù ní dìng lǜU
P阿布鲁佐ā bù lǔ zuǒNP
P阿布鲁齐ā bù lǔ qíNP
T阿布里科索夫理论ā bù lǐ kē suǒ fū lǐ lùnn
P阿布里科索夫ā bù lǐ kē suǒ fūNP
P阿布雷乌ā bù léi wūNP
P阿布勒ā bù lèNP
P阿布兰ā bù lánNP
P阿布拉西莫夫ā bù lā xī mò fūNP
P阿布拉瓦内尔ā bù lā wǎ nèi ěrNP
P阿布拉姆松ā bù lā mǔ sōngNP
P阿布拉姆斯ā bù lā mǔ sīNP
P阿布拉姆ā bù lā mǔNP
P阿布拉莫维茨ā bù lā mò wéi cíNP
P阿布拉莫夫斯基ā bù lā mò fū sī jīNP
P阿布拉莫夫ā bù lā mò fūNP
P阿布拉哈ā bù lā hāNP
T阿布拉盖-庞德理论ā bù lā gài páng dé lǐ lùnn
T阿布拉盖模型ā bù lā gài mó xíngn
P阿布拉ā bù lāNP
P阿布贾ā bù jiǎNP
P阿布尔公司ā bù ěr gōng sīNP
P阿布德哈尔登ā bù dé hā ěr dēngNP
P阿部ā bùNP
P阿布ā bùNP
T吖卟吩ā bǔ fēnU
P阿卜杜什ā bǔ dù shíNP
P阿卜杜勒ā bǔ dù lèNP
P阿卜杜拉赫曼ā bǔ dù lā hè mànNP
P阿卜杜拉ā bǔ dù lāNP
P阿博维扬ā bó wéi yángNP
P阿博特ā bó tèNP
P阿伯斯ā bó sīNP
P阿伯内西ā bó nèi xīNP
P阿伯内克ā bó nèi kèNP
P阿伯纳西ā bó nà xīNP
T阿博霉素ā bó méi sùn
P阿伯里ā bó lǐNP
T阿勃勒ā bó lèN
P阿伯兰ā bó lánNP
P阿伯克龙比ā bó kè lóng bǐNP
T阿博回旋加速器ā bó huí xuán jiā sù qìn
P阿伯尔ā bó ěrNP
P阿伯丁ā bó dīngNP
G阿伯ā bóN
P阿伯ā bóNP
P阿波银行ā bō yín hángNP
P阿波斯托列斯库ā bō sī tuō liè sī kùNP
P阿波斯托利ā bō sī tuō lìNP
P阿波斯托尔ā bō sī tuō ěrNP
T阿波罗太空船ā bō luó tài kōng chuánN
P阿波罗ā bō luóNP
P阿波里耐ā bō lǐ nàiNP
P阿波尔蒂ā bō ěr dìNP
P阿波ā bōNP

Practical Applications

CJKI’s Comprehensive Wordlists are being used by some of the world’s leading IT companies for a variety of natural language processing applications, including:

Information retrieval

Morphological analysis

Word segmentation

Related Resources

JWL

Japanese Wordlist

General vocabulary, proper nouns and technical terms

KWL

Korean Wordlist

General vocabulary, proper nouns and technical terms

AWL

Arabic Wordlist

General vocabulary, proper nouns and technical terms