Korean Lexical Database

Covers nearly 100,000 entries

Morphological attributes such as POS codes

Phonetic attributes such as romanization and IPA

Overview

The CJKI Korean Lexical Database (KLD) is a monolingual lexical database of Korean developed by CJKI’s Korean editors. It contains approximately 97,000 entries covering general vocabulary, both free forms, and bound forms, and includes a rich set of grammatical and phonetic attributes, as well as hanja when applicable.

KLD includes a significant number of affixes, particles, auxiliaries, and conjugation pattern codes to account for all the inflectional and derivational morphology in Korean so as to enable recognition of inflected forms.

Main Features

Phonological information

Such as romanized forms and IPA transcriptions

Semantic classification codes

Such as type of proper noun

Grammatical information

Such as detailed part-of-speech codes

Morphological information

Derivational affixes and binding valency codes

Korean Lexical Database

Practical Applications

KLD is especially suitable for applications in the field of:

Morphological analysis

Machine translation

CJK input method editors

Speech technology

Related Resources

CLD

Chinese Lexical Database

Monolingual general vocabulary for NLP applications

JLD

Japanese Lexical Database

Monolingual general vocabulary for NLP applications

KPD

Korean Phonetic Database

Phonetic and phonemic transcriptions for core Korean vocabulary