Arabic Full-Form Lexicon

Over 530 million entries

Exhaustive coverage of inflected forms

Ideal for NLP, AI and cybersecurity

Overview

CJKI is pleased to announce the release of the Arabic Full-Form Lexicon, or ArabLEX, the most comprehensive Arabic computational lexicon ever created, covering over 530 million entries.

Full-form means that it includes all inflected forms. It covers not only general vocabulary but also, for the first time, fully inflected proper nouns (personal and place names).

ArabLEX is, quite literally, the ultimate resource for Arabic NLP and AI. It is suited for such applications as machine translation, speech technology, deep learning, and cybersecurity. No other Arabic lexicon comes close to scope and comprehensiveness.

Distinctive Features

* Select one of the tabs below.

POSArabLemmaGenNumCasePer
Nkā́tibunكَاتِبٌMSNOM000
Nkā́tibuكَاتِبٌMSNOM000
Nkā́tibi̱كَاتِبٌMSNOM1SC
NkātíbukaكَاتِبٌMSNOM2SM
NkātíbukiكَاتِبٌMSNOM2SF
NkātíbuhuكَاتِبٌMSNOM3SM
Nkātíbuha̱كَاتِبٌMSNOM3SF
Nkātíbuna̱كَاتِبٌMSNOM1PC
NkātíbukumكَاتِبٌMSNOM2PM
NkātibukúnnaكَاتِبٌMSNOM2PF
Nkātibúkuma̱كَاتِبٌMSNOM2DC
NkātíbuhumكَاتِبٌMSNOM3PM
NkātibuhúnnaكَاتِبٌMSNOM3PF
Nkātibúhuma̱كَاتِبٌMSNOM3DM
Nkātibúhuma̱كَاتِبٌMSNOM3DF
Nkā́tibinكَاتِبٌMSGEN000
Nkā́tibiكَاتِبٌMSGEN000
Nkā́tibi̱كَاتِبٌMSGEN1SC
NkātíbikaكَاتِبٌMSGEN2SM

Practical Applications

CJKI’s full-form lexicons can bring the following benefits to various NLP applications:

Machine translation

Greatly enhanced translation quality

Morphological analysis

Significantly simplified algorithms

Pedagogical applications

Automatic conjugation systems

Named-entity recognition (NER)

Dramatically improved

Related Resources

JFULEX

Japanese Full-Form Lexicon

Includes all inflected, declined and conjugated forms

Spanish Full-Form Lexicon

Includes all inflected, declined and conjugated forms

AWL

Arabic Wordlist

General vocabulary, proper nouns and technical terms