Chinese Name Dialectical Variants

Chinese Name Dialectical Variants

Millions of entries

Covers four major Chinese dialects and their subdialects

Attributes gender and romanization codes

Overview

To complement our Database of Chinese Personal Name Variants, which focuses on Mandarin, CJKI maintains a separate Database of Chinese Name Dialectical Variants (CDV) which covers millions of romanized variants for each of the following Chinese dialects and subdialects:

– Cantonese, covering formal and many informal inpfrmal and popular romanizations
– Hokkien, or Southern Min, covering especially the Taiwanese and Xiamen/Amoy subdialects
– Hakka, covering various subdialects, especially Sixian and Hailu
– Hainanese, a subdialect of Southern Min, covering especially the Haikou and Wenchang subdialects

CDV includes also includes classification codes, frequency of occurrence statistics, and gender codes.

Chinese Name Dialectical Variants: Hokkien
STTypeTCRomanR-TypeID
AMStsi3ngT00979900-01
AMSche3ngT00979900-02
AMSzi3ngT00979900-03
AMSze3ngT00979900-04
R1MSChingTX00979900-05
RMSTsingT00979900-06
RMSChengT00979900-07
RMSZingT00979900-08
RMSZengT00979900-09
AMSti1ngTX01083594-01
AMSte1ngT01083594-02
AMSdi1ngT01083594-03
RMSTingTX01083594-04
RMSTengT01083594-05
RMSDingT01083594-06
AMSti7ngTX01098779-01
AXMStia7*TX01098779-02
AMSdi7ngTX01098779-03
AXMSdia7*TX01098779-04
AMSti3ngX01098779-05
AXMStia3*X01098779-06
AMSdi3ngX01098779-07
AXMSdia3*X01098779-08
AMSte7ngT01098779-09
RMSTingTX01098779-10
RXMSTiaTX01098779-11
RMSDingTX01098779-12
RXMSDiaTX01098779-13
RMSTengT01098779-14
AM政定tsi3ng-ti7ngT00980714-01
AM政定tsi3ng-di7ngT00980714-02
AM政定che3ng-ti7ngT00980714-03
AM政定che3ng-di7ngT00980714-04
AM政定che3ng-te7ngT00980714-05
AM政定zi3ng-ti7ngT00980714-06
AM政定zi3ng-di7ngT00980714-07
AM政定zi3ng-te7ngT00980714-08
AM政定ze3ng-ti7ngT00980714-09
AM政定ze3ng-di7ngT00980714-10
AM政定ze3ng-te7ngT00980714-11
R1M政定ChingtingTX00980714-12
R1M政定ChingdingTX00980714-13
RM政定TsingtingT00980714-14
RM政定TsingdingT00980714-15
RM政定ChengtingT00980714-16
RM政定ChengdingT00980714-17
RM政定ChengtengT00980714-18
RM政定ZingtingT00980714-19
RM政定ZingdingT00980714-20
RM政定ZingtengT00980714-21
RM政定ZengtingT00980714-22
RM政定ZengdingT00980714-23
RM政定ZengtengT00980714-24
AM政丁tsi3ng-ti1ngT00980701-01
AM政丁tsi3ng-te1ngT00980701-02
AM政丁tsi3ng-di1ngT00980701-03
AM政丁che3ng-ti1ngT00980701-04
AM政丁che3ng-te1ngT00980701-05
AM政丁che3ng-di1ngT00980701-06
AM政丁zi3ng-ti1ngT00980701-07
AM政丁zi3ng-te1ngT00980701-08
AM政丁zi3ng-di1ngT00980701-09
AM政丁ze3ng-ti1ngT00980701-10
AM政丁ze3ng-te1ngT00980701-11
AM政丁ze3ng-di1ngT00980701-12
R1M政丁ChingtingTX00980701-13
RM政丁TsingtingT00980701-14
R1M政丁ChingdingXT00980701-15
R1M政丁ChingtengT00980701-16
RM政丁TsingtengT00980701-17
RM政丁TsingdingT00980701-18
RM政丁ChengtingT00980701-19
RM政丁ChengtengT00980701-20
RM政丁ChengdingT00980701-21
RM政丁ZingtingT00980701-22
RM政丁ZingtengT00980701-23
RM政丁ZingdingT00980701-24
RM政丁ZengtingT00980701-25
RM政丁ZengtengT00980701-26
RM政丁ZengdingT00980701-27
RSTanNAN-TW101092836-01
RSDanNAN-TW101092836-02
RXSTinNAN-TW01092836-03
RXSDinNAN-TW01092836-04

Practical Applications

CDV is used for identifying, processing and normalizing names and their numerous romanized variants and is indispensable in a variety of applications, including:

Improving accuracy of machine translation

Segmentation and morphological analysis

Immigration control systems

Security applications

identifying suspected name variants of criminals

Query processing by search engines

Named-entity recognition

Database cleansing and normalization

Anti-money laundering (AML)

fraud detection by financial institutions

Related Resources

Chinese-English Personal Names

Chinese-English database of CJK and Western personal names

Chinese-Japanese Personal Names

Chinese-Japanese database of CJK and Western personal names

Japanese Personal Name Variants

Japanese personal names and their romanized variants

Reference Documents

Coming soon