Covers over 300,000 Cantonese entries

Includes standard Jyutping romanization

All major romanization systems can be provided


CJKI’s Yue Phonetic Database (YPD) provides Cantonese readings for 300,000 compound words and approximately 80,000 readings and romanized variants for about 13,000 single Traditional Chinese characters.

YPD features phonemic transcriptions given in the standard Jyutping romanization, also available in up to ten Cantonese romanization systems (IPA accurate transcriptions also possible). The readings are ordered by frequency and/or importance, while flags distinguish common readings from rare ones.

- [S] Single character entries
- [G] General vocabulary
- [P] Proper nouns
- [T] Technical terms

TC: Traditional Chinese

Practical Applications

YPD is ideal for applications such as:

Natural language processing applications

such as speech recognition and speech synthesis

Machine translation

for use in input method editors, TTS for car navigation systems and speech-to-speech systems

Related Resources

Chinese Phonetic Database

Phonemic transcriptions showing differences between PRC and Taiwan

Japanese Phonetic Database

IPA phonetic and phonemic transcriptions for core Japanese vocabulary

Chinese Hanyu Pinyin Database

Accurate hanyu pinyin data including technical terms and proper nouns