CJKI Data

Lexical Resources

The CJK Dictionary Institute (CJKI) specializes in the creation and continuous expansion of comprehensive lexical and dictionary databases for Chinese, Japanese, Korean (CJK) and Arabic. These databases contain over 50 million entries for general vocabulary, proper nouns, and technical terms, and include a rich set of grammatical, phonological and semantic attributes.

CJKI has become one of the world’s prime resources for CJK and Arabic (CJKA) lexical resources, and is contributing to CJKA natural language processing technology, including machine translation, speech technology and entity recognition, by providing high-quality lexical resources to many of the world’s leading IT companies, especially in Japan, China and the US.

・CJKI’s data resources can be quickly located by language below.

・More information about CJKI here. Relevant linguistic and technical documents are collected here (old website).

・Details on licensing data and our business model can be found here.