Chinese Phonetic Database

Chinese Phonetic Database 

Over four million entries

Precise PRC and Taiwan pronunciation differences

Ideal for speech technology

Overview

The CJKI Chinese Phonological Database (CPD) currently contains over four million Simplified Chinese (SC) and Traditional Chinese (TC) headwords covering general vocabulary, important technical terms, and proper nouns. Each headword is accompanied by pinyin or zhuyin readings and various other attributes. A unique feature of CPD is that the readings take into account the differences in pronunciation between Taiwan and the PRC.

For example, pinyin for 危险 ‘dangerous’ is wēixiǎn for SC but wéixiǎn for TC, as shown in the table below. The readings have been thoroughly proofread for accuracy, and explicitly indicate the neutral tone, which is often ignored by conventional dictionaries.

SC-TC Pinyin Differences
SCSC PinyinTCTC Pinyin
临期línqī臨期línqí
企业qǐyè企業qìyè
倬雄zhuōxióng倬雄zhuóxióng
wēiwéi
危险wēixiǎn危險wéixiǎn
埒城lièchéng埒城lèchéng
夕日xīrì夕日xìrì
大期dàqī大期dàqí
巍八郎wēibāláng巍八郎wéibāláng
帆柱fānzhù帆柱fánzhù
wēiwéi
微笑wēixiào微笑wéixiào
拙夫zhuōfū拙夫zhuófū
无着wúzhuó無著wúzhuó
昔日xīrì昔日xírì
显微镜xiǎnwēijìng顯微鏡xiǎnwéijìng
期待qīdài期待qídài
池穴chíxué池穴chíxuè
理发lǐfà理髮lǐfǎ
隆巴妮lóngbānī隆巴妮lóngbāní
麦卡锡màikǎxī麥卡錫màikǎxí
SC: Simplified Chinese
TC: Traditional Chinese

Practical Applications

CPD can be used in a variety of applications such as:

Input method editors

Transcription systems

Pedagogical applications

Text-to-speech

Related Resources

Chinese to Chinese Conversion

SC/TC mapping tables supporting orthographic and lexemic conversion

Yue Phonetic Database

Phonemic transcriptions of Cantonese (Yue) vocabulary in jyutping

Japanese Phonetic Database

IPA phonetic and phonemic transcriptions for core Japanese vocabulary