1 Introduction
2 Methods
2.1 Data
2.2 Name frequency calculations
3 Results
3.1 RQ1: How nationally distinctive are researchers’ first and last names by country?
Table 1. First and last name national uniqueness statistics for the 25 countries or regions with the highest average percentage of last name in the country (2001-2021). A “national” first or last name is used only by researchers in one country. A complete list of countries/regions is in the supplementary materi als (https://doi.org/10.6084/m9.figshare.21954467). |
Country/ region | Authors with a first name | % of national first names | Average % of first name in country | First name rank | Authors with a last name | % of national last names | Average % of last name in country | Last name rank |
---|---|---|---|---|---|---|---|---|
Thailand | 85,162 | 42% | 81% | 3 | 96,790 | 81% | 88% | 1 |
Japan | 1,059,927 | 5% | 87% | 1 | 1,217,638 | 8% | 87% | 2 |
Lithuania | 15,245 | 8% | 55% | 15 | 19,240 | 73% | 86% | 3 |
China | 4,488,356 | 3% | 85% | 2 | 4,988,411 | 1% | 83% | 4 |
Turkey | 217,560 | 6% | 75% | 4 | 255,292 | 26% | 80% | 5 |
Russian Fed. | 137,269 | 4% | 41% | 31 | 452,961 | 29% | 77% | 6 |
Iran | 277,009 | 4% | 66% | 7 | 350,691 | 23% | 76% | 7 |
Laos | 1,179 | 60% | 72% | 5 | 1,370 | 65% | 75% | 8 |
Kazakhstan | 18,780 | 24% | 59% | 14 | 27,596 | 57% | 74% | 9 |
Indonesia | 98,259 | 20% | 53% | 17 | 121,959 | 35% | 72% | 10 |
Greece | 71,132 | 5% | 50% | 19 | 101,776 | 38% | 72% | 11 |
Madagascar | 2,084 | 25% | 37% | 38 | 2,950 | 48% | 71% | 12 |
Poland | 163,794 | 3% | 63% | 10 | 199,191 | 33% | 71% | 13 |
Czech Republic | 61,820 | 3% | 39% | 34 | 89,460 | 43% | 70% | 14 |
India | 629,418 | 10% | 61% | 12 | 982,934 | 12% | 69% | 15 |
Georgia | 3,615 | 5% | 31% | 56 | 6,628 | 36% | 69% | 16 |
Latvia | 5,775 | 9% | 40% | 32 | 8,441 | 54% | 69% | 17 |
Finland | 65,723 | 3% | 47% | 21 | 81,530 | 26% | 67% | 18 |
Hungary | 47,033 | 8% | 62% | 11 | 59,104 | 24% | 65% | 19 |
Slovakia | 24,042 | 3% | 24% | 81 | 33,917 | 46% | 65% | 20 |
Mongolia | 2,335 | 38% | 65% | 9 | 3,101 | 37% | 65% | 21 |
Uganda | 8,543 | 9% | 14% | 120 | 10,160 | 37% | 64% | 22 |
Romania | 51,161 | 6% | 37% | 41 | 63,560 | 28% | 64% | 23 |
Nigeria | 47,983 | 15% | 44% | 26 | 78,489 | 25% | 63% | 24 |
South Korea | 514,026 | 5% | 69% | 6 | 556,771 | 1% | 63% | 25 |
Table 2. First and last name national uniqueness statistics for the 25 countries or regions with the lowest average percentage of last name in the country/region. A “national” first or last name is used only by researchers in one country. |
Country/region | Authors with a first name | % of national first names | Average % of first name in country | First name rank | Authors with a last name | % of national last names | Average % of last name in country | Last name rank |
---|---|---|---|---|---|---|---|---|
Virgin Islands (UK) | 26 | 0% | 2% | 191 | 36 | 8% | 11% | 176 |
Paraguay | 1,683 | 2% | 3% | 186 | 1,990 | 5% | 10% | 177 |
El Salvador | 814 | 1% | 2% | 192 | 947 | 5% | 10% | 178 |
Andorra | 106 | 3% | 6% | 168 | 130 | 4% | 10% | 179 |
Barbados | 357 | 3% | 6% | 170 | 436 | 6% | 10% | 180 |
New Zealand | 50,477 | 2% | 5% | 175 | 63,946 | 4% | 10% | 181 |
Guatemala | 1,968 | 2% | 4% | 182 | 2,199 | 6% | 10% | 182 |
Hong Kong, China | 53,289 | 1% | 8% | 157 | 66,369 | 1% | 10% | 183 |
Montserrat | 20 | 0% | 5% | 176 | 39 | 5% | 10% | 184 |
Costa Rica | 6,122 | 2% | 5% | 178 | 6,996 | 2% | 9% | 185 |
Falkland Islands | 69 | 0% | 0% | 199 | 88 | 6% | 9% | 186 |
Grenada | 551 | 6% | 8% | 152 | 640 | 4% | 9% | 187 |
Panama | 1,864 | 6% | 9% | 146 | 2,135 | 5% | 9% | 188 |
Virgin Islands (USA) | 157 | 6% | 8% | 149 | 202 | 3% | 9% | 189 |
Bahamas | 196 | 5% | 7% | 159 | 237 | 5% | 8% | 190 |
Cayman Islands | 96 | 0% | 1% | 197 | 120 | 3% | 8% | 191 |
Jamaica | 2,217 | 9% | 13% | 122 | 3,113 | 4% | 8% | 192 |
Bermuda | 144 | 1% | 2% | 194 | 187 | 4% | 8% | 193 |
Puerto Rico | 8,310 | 6% | 10% | 139 | 9,347 | 2% | 8% | 194 |
Dominican Republic | 1,198 | 6% | 9% | 141 | 1,355 | 3% | 7% | 195 |
Honduras | 1,175 | 3% | 5% | 173 | 1,345 | 3% | 6% | 196 |
Nicaragua | 961 | 3% | 5% | 174 | 1,133 | 3% | 6% | 197 |
Cape Verde | 247 | 8% | 11% | 133 | 279 | 3% | 6% | 198 |
Dominica | 82 | 6% | 10% | 136 | 101 | 2% | 5% | 199 |
Macao, China | 3,933 | 2% | 3% | 184 | 4,352 | 1% | 2% | 200 |
3.2 RQ2: Are first or last names the most unique for countries?
Figure 1. The percentage of national last names against the percentage of national first names by country (all authors of Scopus journal articles 2002-2021). |
Figure 2. Last name average percentage in country/region against first name average percentage in country/region (all authors of Scopus journal articles 2002-2021). |
3.3 RQ3: Which factors affect the uniqueness of researcher first and last names in a country?
Figure 3. Average percentage of first names in a country against the number of authors for 200 countries/regions in Scopus (the x axis scale is logarithmic). |
Figure 4. Average percentage of last names in a country/region against the number of authors for 200 countries/regions in Scopus (the x axis scale is logarithmic). |
3.4 RQ4: Do first and last names that occur disproportionately often in a country tend to be associated with that country, even for researchers in other countries?
Figure 5. The proportion of articles mentioning Greece and written by authors with an affiliation outside Greece against Greece’s share of the world’s authors with the author’s first name. Qualification: at least ten authors with the same first name. Percentage points without data have no names with at least 100 researchers affiliated outside Greece. The y co-ordinates are smoothed by a 3-point moving average (e.g., the 4% y coordinate is the average of the data from the 3%, 4%, and 5% points). Points are annotated with an illustrative first name in the correct percentage point. |
3.5 RQ5: From which countries/regions is it easiest to track diaspora researchers through first or last names?
3.5.1 Countries with national unimodal name distributions
Figure 6. The number of authors with an affiliation in Turkey against the national percentage for the name. Qualification: at least ten authors with the same name. Percentages without data have no names with at least 10 researchers. Points are annotated with an illustrative name at the correct percentage, but most columns represent multiple names. |
Figure 7. As |
3.5.2 Countries with bimodal name distributions
Figure 8. As |
3.5.3 Countries with unimodal name distributions of international names
Figure 9. As |
3.5.4 Countries with relatively uniform name distributions
Figure 10. As |
Figure 11. As |
4 Discussion
5 Conclusion
Figure S1. The proportion of author first names in a country against the percentage of each first name in that country (ten countries with the most authors with first names). |
Figure S2. The proportion of author first names in a country against the percentage of each first name in that country (ten countries with moderate numbers of authors with first names). |
Figure S3. The proportion of author last names in a country against the percentage of each last name in that country (ten countries with the most authors with last names). |
Figure S4. The proportion of author last names in a country against the percentage of each last name in that country (ten countries with moderate numbers of authors with last names). |