Wednesday, June 9, 2010

Mirror, mirror, on the wall, who's the Whitest of them all?


This post will attempt to rank 13 European countries' relative proportions of non-euro admixture/affinity. I will be using a study which included data on those 13 European countries. Unfortunately not all major countries in Europe were included in the study. Italy, Ireland, Portugal, and Greece were not included and neither were any Yugoslavian countries.

The study compares the 13 European countries with each other together with 4 other geographically non-European countries: Nigeria (Ibidan Yoruba, denoted YRI), USA Whites (Utah, self-identified northwest European ancestry, denoted CEU), China (Beijing, denoted CHB), and Japan (Tokyo, denoted JPT). I use the data relating the 13 European countries + CEU against CHB as a proxy for the mongoloid racial type and YRI as a proxy for the negroid racial type. I will use two data sources from the study - Fst statistics and Principle Component Analysis (PCA). Note that the data provided in the study is based on samples (groups of people) from each of the countries and therefore only directly compares these samples. The samples are not perfectly representative of the genetics of each of the countries but can still serve as accurate proxies (with some caveates). All the data used can be freely accessed at:
http://www.nature.com/ejhg/journal/v16/n12/fig_tab/ejhg2008210ft.html

First we look at what PCA can tell us by comparing countries' positions along the first PC in Figure 2. Below is a brief explanation of what PCA is followed by a rough ranking that I constructed based on the data in the figure. Feel free to draw your own conclusions from the figure.

PCA decomposes multi-dimensional data into maximally informative components. The first Principle Component (PC) captures as much of the total variance in a single linear dimension as possible. The second PC captures as much of the remaining variance (variance which isn’t correlated with the first PC) in a single linear dimension as possible. Similarly, all remaining PCs (3, 4, …) each account for as much of the remaining variance (variance which isn’t correlated with any of the previous PCs) as possible. The data from the study covers approximately 10% of the human genome. That is, the same corresponding 10% of each sampled person’s genome is examined by the study. Each dot/symbol in the PCA plots represents an individual as in a cartesian plot/plane.

Legend:
UK United Kingdom of Great Britain and Northern Ireland
No Norway
Si Slovakia
Ge Germany
Fr France
Po Poland
Be Belgium
Ru Russia
Ro Romania
Cz Czech Republic
Sw Sweden
Sp Spain
Hu Hungary
CEU Utah, USA

Ranking by most Caucasoid member (PC 1)
1/2. UK
1/2. No
3. Ge
4. Fr
5. Po
6/7. Cz
6/7. Ru
8/9. Be
8/9. Sw
10. CEU
11. Si
12/13/14. ?

Ranking by second most Caucasoid member (PC 1)
1. UK
2. Ge
3. Fr
4. Po
5. No
6. Sw
7. Be
8. Ru
9. CEU
10/11/12/13/14. ?

Ranking by most Caucasoid estimated median (PC 1)
1. UK
2/3. Po
2/3. No
4. Sw
5. Be
6. CEU
7. Ge?
8. Fr
9. Cz
10. Si
11/12. Ru
11/12. Hu
13. Sp
14. Ro

Next we will see the results obtained from an analysis of the Fst data provided in table 1. The Fst is a measure of genetic distance between two samples which compares the average within sample variability to the variability of the aggregation/combination of the two samples. The larger the Fst the greater the between sample/population variance relative to the within population variance. In general the Fst is nonlinear (further information will be provided upon request) but for the purposes of this analysis the Fst is probably approximately linear and will be treated as such. First, the 13 euros + CEU are ranked (by me) from least to greatest raw (unadjusted) affinity to YRI and CHB.

YRI (negroid) Raw Affinity (relative to Norway)
1. No 0.0000
2. Sw 0.0007
3. P0 0.0011
4. UK 0.0018
5. CEU 0.0021
6. Ru 0.0027
7. Cz 0.0028
8. Ge 0.0029
9. Si 0.0033
10. Be 0.0035
11. Fr 0.0038
12. Hu 0.0041
13. Ro 0.0068
14. Sp 0.0071

CHB (mongoloid) Raw Affinity (also relative to Norway)
1/2. UK -0.0015
1/2. Sp -0.0015
3. CEU -0.0014
4. Fr -0.0013
5. Be -0.0012
6. Po -0.0005
7. Ge -0.0004
8. No 0.0000
9. Cz 0.0001
10. Sw 0.0008
11. Si 0.0012
12. Hu 0.0023
13. Ro 0.0034
14. Ru 0.0045

Simple sum of Mongoloid and Negroid affinities
1. No 0.0000
2. UK 0.0003
3. Po 0.0006
4. CEU 0.0007
5. Sw 0.0015
6. Be 0.0023
7/8. Ge 0.0025
7/8. Fr 0.0025
9. Cz 0.0029
10. Si 0.0045
11. Sp 0.0056
12. Hu 0.0064
13. Ru 0.0072
14. Ro 0.0113

There will be more to come later.

No comments:

Post a Comment