WORLD'S LARGEST DATABASE OF ARAB NAMES
قاعدة بيانات الأسماء العربية
For Immediate Release
April 1 2008, Tokyo, Japan
The CJK Dictionary Institute, which specializes in the compilation of large-scale CJK and Arabic lexical resources, is pleased to announce the release of a major expansion of our comprehensive Database of Arab Names, referred to as DAN, which covers about one and a half million entries and over ten million potential variants.
DAN covers Arab personal names in both the roman and Arabic scripts and includes numerous orthographic variants and other attributes such as web frequency, name type codes and normalized forms. Based on authoritative linguistic resources, DAN is undergoing a major expansion and extensive proofreading by a team of Arabic native speakers, and expected to grow substantially in the coming months to cover all the major countries in the Middle East.
Key features of DAN
- Millions of Arabic and romanized name variants.
- Ideal for security and anti-money laundering applications.
- Enhances NLP applications such as MT and IR.
- Based on 15,000,000 names from authoritative resources.
- Proofread by native editors trained in Arabic phonology.
- Validated against the Arabic web and corpora.
- Full vocalization and various variants in Arabic script.
- Web-based frequency statistics for each name.
- Various romanization systems, such as the official IC standard.
DAN is playing an important role in helping software developers, especially of security applications and NLP tools, enhance their technology by enabling named entity recognition and extraction, machine translation (MT),variant normalization, and information retrieval (IR) of Arabic names.
|