CJKI Data

Lexical Resources

CJKI’s large-scale databases currently have over 50 million entries and are continuously being expanded. They cover general vocabulary, proper nouns, and technical terms, and include a rich set of grammatical, phonological, syntactic, and semantic attributes.

CJKI is one of the world’s prime sources for CJK and Arabic dictionaries and lexical resources. We contribute to natural language processing (NLP) technology, including machine translation, speech technology and named entity recognition, by providing high-quality lexical resources to many of the world’s leading IT companies, especially in Japan, China and the US.