Arabic Dialects Full-Form Lexicon

DiaLEX
Arabic Dialects Full-Form Lexicon

Covers all major Arabic dialects

Currently over 878 million entries

Ideal for NLP, including MT and speech

Overview

While Modern Standard Arabic is used as the official language of 22 Arab League nations, Arabs normally use one of the 30 or so modern dialects for communicating with family and friends. However, Arabic dialects don’t have a formal written language nor a standard orthography, resulting in a lack of applications and technologies that support them.

Our Arabic Dialects Full-Form Lexicon, or DiaLEX, has been developed to address this lack of support. DiaLEX is a comprehensive computational lexicon covering several major Arabic dialects and subdialects, including Egyptian, Emirati, Saudi Arabian Hejazi, Syrian, Lebanese, and Palestinian.

Based on ArabLEX, our full-form lexicon for Modern Standard Arabic, DiaLEX will cover all inflected, declined, and cliticized wordforms. It is ideally suited for morphological analysis, machine translation, and speech technology applications.

Distinctive Features

Arabic Dialects Full-Form Lexicon

* Select one of the tabs below.

ARABICLEMMABWTENSENPG
جِبْتْجَابْjibotoperfect indicativeS1C
جِبْتْجَابْjibotoperfect indicativeS2M
جِبْتِىجَابْjibotiYperfect indicativeS2F
جِبْتِيجَابْjibotiyperfect indicativeS2F
جَابْجَابْjaAboperfect indicativeS3M
جَابِتْجَابْjaAbitoperfect indicativeS3F
جِبْنَاجَابْjibonaAperfect indicativeP1C
جِبْتُواجَابْjibotuwAperfect indicativeP2C
جِبْتُوجَابْjibotuwperfect indicativeP2C
جَابُواجَابْjaAbuwAperfect indicativeP3C
جَابُوجَابْjaAbuwperfect indicativeP3C
اَجِيبْجَابْAajiyboimperfect subjunctiveS1C
أَجِيبْجَابْ>ajiyboimperfect subjunctiveS1C
تِجِيبْجَابْtijiyboimperfect subjunctiveS2M
تِجِيبِىجَابْtijiybiYimperfect subjunctiveS2F
تِجِيبِيجَابْtijiybiyimperfect subjunctiveS2F
يِجِيبْجَابْyijiyboimperfect subjunctiveS3M
تِجِيبْجَابْtijiyboimperfect subjunctiveS3F
نِجِيبْجَابْnijiyboimperfect subjunctiveP1C
تِجِيبُواجَابْtijiybuwAimperfect subjunctiveP2C
تِجِيبُوجَابْtijiybuwimperfect subjunctiveP2C
يِجِيبُواجَابْyijiybuwAimperfect subjunctiveP3C
يِجِيبُوجَابْyijiybuwimperfect subjunctiveP3C
بَجِيبْجَابْbajiyboimperfect indicativeS1C
بِتْجِيبْجَابْbitojiyboimperfect indicativeS2M
بِتْجِيبِىجَابْbitojiybiYimperfect indicativeS2F
بِتْجِيبِيجَابْbitojiybiyimperfect indicativeS2F
بِيْجِيبْجَابْbiyojiyboimperfect indicativeS3M
بِتْجِيبْجَابْbitojiyboimperfect indicativeS3F
بِنْجِيبْجَابْbinojiyboimperfect indicativeP1C
بِتْجِيبُواجَابْbitojiybuwAimperfect indicativeP2C
بِتْجِيبُوجَابْbitojiybuwimperfect indicativeP2C
بِيْجِيبُواجَابْbiyojiybuwAimperfect indicativeP3C
بِيْجِيبُوجَابْbiyojiybuwimperfect indicativeP3C
جِيبْجَابْjiyboimperativeS2M
جِيبِىجَابْjiybiYimperativeS2F
جِيبِيجَابْjiybiyimperativeS2F
جِيبُواجَابْjiybuwAimperativeP2C
جِيبُوجَابْjiybuwimperativeP2C
هَجِيبْجَابْhajiybosimple futureS1C
حَجِيبْجَابْHajiybosimple futureS1C
هَتْجِيبْجَابْhatojiybosimple futureS2M
حَتْجِيبْجَابْHatojiybosimple futureS2M
هَتْجِيبِىجَابْhatojiybiYsimple futureS2F
هَتْجِيبِيجَابْhatojiybiysimple futureS2F
حَتْجِيبِىجَابْHatojiybiYsimple futureS2F
حَتْجِيبِيجَابْHatojiybiysimple futureS2F
هَيْجِيبْجَابْhayojiybosimple futureS3M
حَيْجِيبْجَابْHayojiybosimple futureS3M
هَتْجِيبْجَابْhatojiybosimple futureS3F
حَتْجِيبْجَابْHatojiybosimple futureS3F
هَنْجِيبْجَابْhanojiybosimple futureP1C
حَنْجِيبْجَابْHanojiybosimple futureP1C
هَتْجِيبُواجَابْhatojiybuwAsimple futureP2C
هَتْجِيبُوجَابْhatojiybuwsimple futureP2C
حَتْجِيبُواجَابْHatojiybuwAsimple futureP2C
حَتْجِيبُوجَابْHatojiybuwsimple futureP2C
هَيْجِيبُواجَابْhayojiybuwAsimple futureP3C
هَيْجِيبُوجَابْhayojiybuwsimple futureP3C
حَيْجِيبُواجَابْHayojiybuwAsimple futureP3C
حَيْجِيبُوجَابْHayojiybuwsimple futureP3C

Practical Applications

CJKI’s full-form lexicons can bring the following benefits to various NLP applications:

Machine translation

Greatly enhanced translation quality

Morphological analysis

Significantly simplified algorithms

Pedagogical applications

Automatic conjugation systems

Named-entity recognition (NER)

Dramatically improved

Related Resources

ArabLEX

Arabic Full-Form Lexicon Includes all inflected, declined, and conjugated forms

APD: Arabic Phonetic Database

Phonemic transcriptions for core Arabic vocabulary

Palestinian Arabic Text-to-Speech System

A TTS system developed specifically for Palestinian Arabic