Japanese Full-Form Lexicon
Simplifies morphological analysis
Instantly identifies inflected forms
Comprehensive coverage, especially verbs
Overview
CJKI provides aΒ Japanese Full-Form Lexicon (JFULEX) that covers roughly 120 million entries, including canonical forms, inflected forms, and compound words. This lexicon is being used byΒ major IT companies like Amazon and Google to enhance their search technology.
The Japanese language is agglutinative; that is, it forms words by putting together basic elements called morphemes to form countless inflected forms, compound words, and affixed words. For example, the compound ι θΉζΒ zΕsenjoΒ βshipyardβ consists of the free word ι θΉ βshipbuildingβ (ι βmake; buildβ + θΉ βshipβ) followed by the suffix ζ βplaceβ.
Japanese also has many derived words, (morpheme + grammatical suffix) such as combining ι» kuro βblackβ with the suffix γ i to form the adjective ι»γΒ kuroiΒ βblackβ. Derivation should not be confused with inflection, which consists of adding word endings to indicate grammatical functions such as tense. For example, the last syllable of the verb εΈ°γΒ kaeruΒ βto returnβ is inflected to yield εΈ°γΒ kaere, the imperative. Japanese verbs have thousands of inflected forms.
If proper nouns, technical terms and verb-following expressions (such as γͺγγγ°γͺγγͺγΒ nakerebanaranai) are included, the the total can exceed 120 million.
tazuneruοΌγγγγοΌγPOS=V1
| Tense | Stem | Kana | Kanji | Inflected | Roman |
|---|---|---|---|---|---|
| Past | γγγ | S + γΎγγ | - | γγγγΎγγ | TAZUNEmashita |
| Past | γγγ | S + γ¦ γ γΎγγ | S + γ¦ ε± γΎγγ | γγγγ¦ γγΎγγ | TAZUNEte imashita |
| Past | γγγ | S + γ¦ γγ γΎγγ | S + γ¦ ε± γ γΎγγ | γγγγ¦ γγγΎγγ | TAZUNEte orimashita |
| Past | γγγ | S + γγγ | - | γγγγγγ | TAZUNEyashita |
| Past | γγγ | S + γ¦ γ γγγ | S + γ¦ ε± γγγ | γγγγ¦ γγγγ | TAZUNEte iyashita |
| Past | γγγ | S + γ¦ γγ γγγ | S + γ¦ ε± γ γγγ | γγγγ¦ γγγγγ | TAZUNEte oriyashita |
| Past -tara I | γγγ | S + γΎγγγ | - | γγγγΎγγγ | TAZUNEmashitara |
| Past -tara I | γγγ | γ + S + γγ¦ γγ γΎγγγ | εΎ‘ + S + ηΊγ¦ ε± γ γΎγγγ | γγγγγγ¦ γγγΎγγγ | oTAZUNE shite orimashitara |
| Past -tara I | γγγ | S + γγγγ | - | γγγγγγγ | TAZUNEyashitara |
| Past -tara I | γγγ | γ + S + γγ¦ γγ γγγγ | εΎ‘ + S + ηΊγ¦ ε± γ γγγγ | γγγγγγ¦ γγγγγγ | oTAZUNE shite oriyashitara |
| Past -tara II | γγγ | S + γΎγγγγ° | - | γγγγΎγγγγ° | TAZUNEmashitaraba |
| Past -tara II | γγγ | γ + S + γγ¦ γγ γΎγγγγ° | εΎ‘ + S + ηΊγ¦ ε± γ γΎγγγγ° | γγγγ γγ¦ γγγΎγγγγ° | oTAZUNE shite orimashitaraba |
| Past -tara II | γγγ | S + γγγγγ° | - | γγγγγγγγ° | TAZUNEyashitaraba |
| Past -tara II | γγγ | γ + S + γγ¦ γγ γγγγγ° | εΎ‘ + S + ηΊγ¦ ε± γ γγγγγ° | γγγγ γγ¦ γγγγγγγ° | oTAZUNE shite oriyashitaraba |
| Past causative | γγγ | S + γγ γΎγγ | - | γγγγγγΎγγ | TAZUNEsasemashita |
| Past causative | γγγ | S + γγ γγγ | - | γγγγγγγγ | TAZUNEsaseyashita |
| Past causative honorific | γγγ | S + γγ γγ γΎγγ | - | γγγγγγγγΎγγ | TAZUNEsaseraremashita |
| Past causative honorific | γγγ | S + γγ γγ γ¦ γ γΎγγ | S + γγ γγ γ¦ ε± γΎγγ | γγγγγγγγ¦ γγΎγγ | TAZUNEsaserarete imashita |
| Past causative honorific | γγγ | S + γγ γγ γγγ | - | γγγγγγγγγγ | TAZUNEsaserareyashita |
| Past causative honorific | γγγ | S + γγ γγ γ¦ γ γγγ | S + γγ γγ γ¦ ε± γγγ | γγγγγγγγ¦ γγγγ | TAZUNEsaserarete iyashita |
| Past causative passive | γγγ | S + γγ γγ γΎγγ | - | γγγγγγγγΎγγ | TAZUNEsaseraremashita |
Practical Applications
CJKIβsΒ full-form lexiconsΒ can bring the following benefits to various NLP applications:
Machine translation
Greatly enhanced translation quality
Named-entity recognition (NER)
Dramatically improved
Morphological analysis
Significantly simplified algorithms
Information retrieval applications
Support for query processing
Pedagogical applications
Automatic conjugation systems
Part-of-speech (POS) analysis and tagging
Automatic conjugation systems
JFULEX Related Resources

Arabic Full-Form Lexicon
Includes all inflected, declined, and conjugated forms

Spanish Full-Form Lexicon
Includes all inflected, declined, and conjugated forms

Comprehensive Japanese Wordlist
General vocabulary, proper nouns and technical terms