Arabic Full-Form Lexicon
Over 530 million entries
Exhaustive coverage of inflected forms
Ideal for NLP, AI and cybersecurity
Overview
CJKI is pleased to announce the release of the Arabic Full-Form Lexicon, or ArabLEX, the most comprehensive Arabic computational lexicon ever created, covering over 530 million entries.
Full-form means that it includes all inflected forms. It covers not only general vocabulary but also, for the first time, fully inflected proper nouns (personal and place names).
ArabLEX is, quite literally, the ultimate resource for Arabic NLP and AI. It is suited for such applications as machine translation, speech technology, deep learning, and cybersecurity. No other Arabic lexicon comes close to scope and comprehensiveness.
Distinctive Features
- Extremely comprehensive — over 530 million full form entries
- Exhaustively lists all inflected, declined, and cliticized wordforms
- Hundreds of millions of full form orthographic variants, including proper nouns
- Fully vocalized and Unvocalized Arabic
- Accurate phonemic and phonetic transcriptions (IPA)
- Rich in morphological, morphosyntactic and phonological attributes
- All wordforms are cross-referenced to their lemma
- Constantly maintained and quickly expanding
* Select one of the tabs below.
POS | Arab | Lemma | Gen | Num | Case | Per |
---|---|---|---|---|---|---|
N | kā́tibun | كَاتِبٌ | M | S | NOM | 000 |
N | kā́tibu | كَاتِبٌ | M | S | NOM | 000 |
N | kā́tibi̱ | كَاتِبٌ | M | S | NOM | 1SC |
N | kātíbuka | كَاتِبٌ | M | S | NOM | 2SM |
N | kātíbuki | كَاتِبٌ | M | S | NOM | 2SF |
N | kātíbuhu | كَاتِبٌ | M | S | NOM | 3SM |
N | kātíbuha̱ | كَاتِبٌ | M | S | NOM | 3SF |
N | kātíbuna̱ | كَاتِبٌ | M | S | NOM | 1PC |
N | kātíbukum | كَاتِبٌ | M | S | NOM | 2PM |
N | kātibukúnna | كَاتِبٌ | M | S | NOM | 2PF |
N | kātibúkuma̱ | كَاتِبٌ | M | S | NOM | 2DC |
N | kātíbuhum | كَاتِبٌ | M | S | NOM | 3PM |
N | kātibuhúnna | كَاتِبٌ | M | S | NOM | 3PF |
N | kātibúhuma̱ | كَاتِبٌ | M | S | NOM | 3DM |
N | kātibúhuma̱ | كَاتِبٌ | M | S | NOM | 3DF |
N | kā́tibin | كَاتِبٌ | M | S | GEN | 000 |
N | kā́tibi | كَاتِبٌ | M | S | GEN | 000 |
N | kā́tibi̱ | كَاتِبٌ | M | S | GEN | 1SC |
N | kātíbika | كَاتِبٌ | M | S | GEN | 2SM |
POS | ARab | Lemma | Lemma | Per |
---|---|---|---|---|
V | kátaba | كَتَبَ | kátaba | 3SM |
V | katábani̱ | كَتَبَ | kátaba | 3SM |
V | katábana̱ | كَتَبَ | kátaba | 3SM |
V | katábaka | كَتَبَ | kátaba | 3SM |
V | katábaki | كَتَبَ | kátaba | 3SM |
V | katábakum | كَتَبَ | kátaba | 3SM |
V | katabákuma̱ | كَتَبَ | kátaba | 3SM |
V | katábaha̱ | كَتَبَ | kátaba | 3SM |
V | katábahu | كَتَبَ | kátaba | 3SM |
V | katábahum | كَتَبَ | kátaba | 3SM |
V | katabáhuma̱ | كَتَبَ | kátaba | 3SM |
V | katabahúnna | كَتَبَ | kátaba | 3SM |
V | katabakúnna | كَتَبَ | kátaba | 3SM |
V | wakátaba | كَتَبَ | kátaba | 3SM |
V | wakatábani̱ | كَتَبَ | kátaba | 3SM |
V | wakatábana̱ | كَتَبَ | kátaba | 3SM |
V | wakatábaka | كَتَبَ | kátaba | 3SM |
V | wakatábaki | كَتَبَ | kátaba | 3SM |
V | wakatábakum | كَتَبَ | kátaba | 3SM |
V | wakatabákuma̱ | كَتَبَ | kátaba | 3SM |
V | wakatábaha̱ | كَتَبَ | kátaba | 3SM |
V | wakatábahu | كَتَبَ | kátaba | 3SM |
V | wakatábahum | كَتَبَ | kátaba | 3SM |
V | wakatabáhuma̱ | كَتَبَ | kátaba | 3SM |
V | wakatabahúnna | كَتَبَ | kátaba | 3SM |
V | wakatabakúnna | كَتَبَ | kátaba | 3SM |
V | fakátaba | كَتَبَ | kátaba | 3SM |
V | fakatábani̱ | كَتَبَ | kátaba | 3SM |
V | fakatábana̱ | كَتَبَ | kátaba | 3SM |
V | fakatábaka | كَتَبَ | kátaba | 3SM |
V | fakatábaki | كَتَبَ | kátaba | 3SM |
V | fakatábakum | كَتَبَ | kátaba | 3SM |
V | fakatabákuma̱ | كَتَبَ | kátaba | 3SM |
V | fakatábaha̱ | كَتَبَ | kátaba | 3SM |
V | fakatábahu | كَتَبَ | kátaba | 3SM |
V | fakatábahum | كَتَبَ | kátaba | 3SM |
V | fakatabáhuma̱ | كَتَبَ | kátaba | 3SM |
V | fakatabahúnna | كَتَبَ | kátaba | 3SM |
V | fakatabakúnna | كَتَبَ | kátaba | 3SM |
V | ʾakátaba | كَتَبَ | kátaba | 3SM |
V | ʾakatábani̱ | كَتَبَ | kátaba | 3SM |
V | ʾakatábana̱ | كَتَبَ | kátaba | 3SM |
V | ʾakatábaka | كَتَبَ | kátaba | 3SM |
V | ʾakatábaki | كَتَبَ | kátaba | 3SM |
V | ʾakatábakum | كَتَبَ | kátaba | 3SM |
V | ʾakatabákuma̱ | كَتَبَ | kátaba | 3SM |
V | ʾakatábaha̱ | كَتَبَ | kátaba | 3SM |
V | ʾakatábahu | كَتَبَ | kátaba | 3SM |
V | ʾakatábahum | كَتَبَ | kátaba | 3SM |
V | ʾakatabáhuma̱ | كَتَبَ | kátaba | 3SM |
V | ʾakatabahúnna | كَتَبَ | kátaba | 3SM |
V | ʾakatabakúnna | كَتَبَ | kátaba | 3SM |
V | ʾawakátaba | كَتَبَ | kátaba | 3SM |
V | ʾawakatábani̱ | كَتَبَ | kátaba | 3SM |
V | ʾawakatábana̱ | كَتَبَ | kátaba | 3SM |
V | ʾawakatábaka | كَتَبَ | kátaba | 3SM |
V | ʾawakatábaki | كَتَبَ | kátaba | 3SM |
V | ʾawakatábakum | كَتَبَ | kátaba | 3SM |
V | ʾawakatabákuma̱ | كَتَبَ | kátaba | 3SM |
V | ʾawakatábaha̱ | كَتَبَ | kátaba | 3SM |
V | ʾawakatábahu | كَتَبَ | kátaba | 3SM |
V | ʾawakatábahum | كَتَبَ | kátaba | 3SM |
V | ʾawakatabáhuma̱ | كَتَبَ | kátaba | 3SM |
V | ʾawakatabahúnna | كَتَبَ | kátaba | 3SM |
V | ʾawakatabakúnna | كَتَبَ | kátaba | 3SM |
V | ʾafakátaba | كَتَبَ | kátaba | 3SM |
V | ʾafakatábani̱ | كَتَبَ | kátaba | 3SM |
V | ʾafakatábana̱ | كَتَبَ | kátaba | 3SM |
V | ʾafakatábaka | كَتَبَ | kátaba | 3SM |
V | ʾafakatábaki | كَتَبَ | kátaba | 3SM |
V | ʾafakatábakum | كَتَبَ | kátaba | 3SM |
V | ʾafakatabákuma̱ | كَتَبَ | kátaba | 3SM |
V | ʾafakatábaha̱ | كَتَبَ | kátaba | 3SM |
V | ʾafakatábahu | كَتَبَ | kátaba | 3SM |
V | ʾafakatábahum | كَتَبَ | kátaba | 3SM |
V | ʾafakatabáhuma̱ | كَتَبَ | kátaba | 3SM |
V | ʾafakatabahúnna | كَتَبَ | kátaba | 3SM |
V | ʾafakatabakúnna | كَتَبَ | kátaba | 3SM |
V | kátaba̱ | كَتَبَ | kátaba | 3DM |
V | katabā́ni̱ | كَتَبَ | kátaba | 3DM |
V | katabā́na̱ | كَتَبَ | kátaba | 3DM |
V | katabā́ka | كَتَبَ | kátaba | 3DM |
V | katabā́ki | كَتَبَ | kátaba | 3DM |
V | katabā́kum | كَتَبَ | kátaba | 3DM |
V | katabā́kuma̱ | كَتَبَ | kátaba | 3DM |
V | katabā́ha̱ | كَتَبَ | kátaba | 3DM |
V | katabā́hu | كَتَبَ | kátaba | 3DM |
V | katabā́hum | كَتَبَ | kátaba | 3DM |
V | katabā́huma̱ | كَتَبَ | kátaba | 3DM |
V | katabāhúnna | كَتَبَ | kátaba | 3DM |
V | katabākúnna | كَتَبَ | kátaba | 3DM |
V | wakátaba̱ | كَتَبَ | kátaba | 3DM |
V | wakatabā́ni̱ | كَتَبَ | kátaba | 3DM |
V | wakatabā́na̱ | كَتَبَ | kátaba | 3DM |
V | wakatabā́ka | كَتَبَ | kátaba | 3DM |
V | wakatabā́ki | كَتَبَ | kátaba | 3DM |
V | wakatabā́kum | كَتَبَ | kátaba | 3DM |
V | wakatabā́kuma̱ | كَتَبَ | kátaba | 3DM |
Practical Applications
CJKI’s full-form lexicons can bring the following benefits to various NLP applications:
Machine translation
Greatly enhanced translation quality
Morphological analysis
Significantly simplified algorithms
Pedagogical applications
Automatic conjugation systems
Named-entity recognition (NER)
Dramatically improved
Reference Documents
Related Resources

Japanese Full-Form Lexicon
Includes all inflected, declined and conjugated forms

Spanish Full-Form Lexicon
Includes all inflected, declined and conjugated forms

Arabic Wordlist
General vocabulary, proper nouns and technical terms