Korean Lexical Database
Covers nearly 100,000 entries
Morphological attributes such as POS codes
Phonetic attributes such as romanization and IPA
Overview
The CJKI Korean Lexical Database (KLD) is a monolingual lexical database of Korean developed by CJKI’s Korean editors. It contains approximately 97,000 entries covering general vocabulary, both free forms, and bound forms, and includes a rich set of grammatical and phonetic attributes, as well as hanja when applicable.
KLD includes a significant number of affixes, particles, auxiliaries, and conjugation pattern codes to account for all the inflectional and derivational morphology in Korean so as to enable recognition of inflected forms.
Main Features
Phonological information
Such as romanized forms and IPA transcriptions
Semantic classification codes
Such as type of proper noun
Grammatical information
Such as detailed part-of-speech codes
Morphological information
Derivational affixes and binding valency codes
Korean Lexical Database
Korean | POS | Roman |
---|---|---|
가둥-거리다 | V | ka-tung-ko~-ri-ta |
가로놓이다 | V | ka-ro-noh-i-ta |
가리산지리산 | D | ka-ri-san-chi-ri-san |
가볍다 | AX | ka-pyo~p-ta |
가살-스럽다 | AX | ka-sal-su~-ro~p-ta |
가수분해 | NC | ka-su-pun-hae |
가시화-되다 | V | ka-si-hwa-toe-ta |
가져가다 | V | ka-chyo~-ka-ta |
가파르다 | AX | ka-p'a-ru~-ta |
간정되다 | V | kan-cho~ng-toe-ta |
Practical Applications
KLD is especially suitable for applications in the field of:
Morphological analysis
Machine translation
CJK input method editors
Speech technology
Reference Documents
Related Resources

Chinese Lexical Database
Monolingual general vocabulary for NLP applications

Japanese Lexical Database
Monolingual general vocabulary for NLP applications

Korean Phonetic Database
Phonetic and phonemic transcriptions for core Korean vocabulary