WORLD'S LARGEST DATABASE OF ARAB NAMES
قاعدة بيانات الأسماء العربية
For Immediate Release
December 1 2008, Tokyo, Japan
The CJK Dictionary Institute, which specializes in the compilation of large-scale CJK and Arabic lexical resources, is pleased to announce the release of a major expansion of our comprehensive Database of Arab Names, referred to as DAN, which now covers about 2.4 million entries based on over 20 million source variants.
DAN covers Arab personal names in both the roman and Arabic scripts and includes numerous orthographic variants and other attributes such as web frequency, name type codes and normalized forms. Based on authoritative linguistic resources, DAN is undergoing a major expansion and extensive proofreading by a team of Arabic native speakers, and expected to grow substantially in the coming months to cover all the major countries in the Middle East.
Key features of DAN
- 2.4 million validated Arabic name variants.
- Ideal for security and anti-money laundering, and NLP.
- Based on over 20,000,000 source names from authoritative resources.
- Proofread by native editors trained in Arabic phonology.
- Validated against the web and corpora.
- Fully vocalized with various variants in Arabic script.
- Web-based frequency statistics for each name.
- Various romanization systems, such as the official IC standard.
- Fully supports OFAC names, their official aliases and unofficial variants.
DAN is playing an important role in helping software developers, especially of security applications and NLP tools, enhance their technology by enabling named entity recognition and extraction, machine translation (MT),variant normalization, and information retrieval (IR) of Arabic names.
|