Introduction
The I-L38 tree is growing and new testers are constantly being added. In the past, the standard test for Y-DNA at FTDNA was the STR test, with 12, 25, 37, 67 and 111 markers, and the Y-haplogroup was and is only predicted with I-M170. Other suppliers also had and still have panels with a different number of markers, e.g. the PowerPlex® Y23 or the Y-STR panels from YSEQ. Even Ancestry.com initially offered Y-STR tests that could be uploaded to FTDNA. With adequate Y-STRs and some experience, you can predict the Y-haplogroup a little deeper than with FTDNA (I-M170) and for a while it was common to verify this with single SNP tests or SNP packs. Nowadays you can choose the same procedure, but in my opinion it is more reasonable to buy an NGS (Next Generation Sequencing) test, which is now affordable. At FTDNA this is called Big Y700, with other providers you can do a WGS and upload the relevant data to YFull.com.
For my research, but also for the FigUre tree, I have collected our I-L38 cousins from all possible databases and want to show a few statistics here. Sometimes you have to add apples and oranges together, as it is not easy to summarise data with different levels of information, but you still get a small overview.
Source of Data
FTDNA
Familytreedna.com has the largest database for Y tests, both for Y-STR tests and for NGS tests (Big Y 700 and Big Y 500), so most of the data is from there. In the tables of the numerous publicly accessible projects, you can search for the I-L38 kits using the predicted Y haplogroup I-M170 or the verified SNPs and the Y-STR markers and, with the experience I have been able to gather so far, classify them more or less specifically in the respective subgroups.
The Big Y block tree contains the tests in which the Y haplogroups are verified by SNPs. The majority of these are now Big Y tests.
For some months now, the autosomal test Family Finder from FTDNA has also included a Y haplogroup. This is only available at the levels I-L38, I-S2606, I-L533 and I-S27697 for subgroups of BY14072 and I-BY1183, as the only subgroup of I-S2606. This makes some testers with I-L38 and these subgroups visible, but due to the lack of Y-STR markers, further categorisation into subgroups is not possible for these.
I hope that many testers will take advantage of the current Early Bird Sale at FTDNA to upgrade their test to a Big Y700.
YFull
At YFull.com private testers can upload their NGS data. In addition to the BigY from FTDNA, it also contains uploaded tests from other providers, as well as tests from scientific studies in which living people were tested. Although data from ancient DNA samples are added to the database, they are not counted in these statistics.
More Data
- Serbian DNA Project, Y-STR Data (PowerPlex® Y23)
- Y-STR Data and low level Y-SNP Data from scientific studies that have not been uploaded to YFull.
- YSEQ, Y-STR Data
Breakdown by subgroups
So far, I have found around 3800 tests with the Y haplogroup I-L38 in the various databases. On the one hand the NGS tests (e.g. Big Y) with verified terminal SNPs and on the other hand the Y-STR tests, in which the sub-branches were only predicted. In about 1700 tests I could not predict any subgroup. Most of these are tests that only have the Family Finder and the Y-haplogroup indicated by it. Another smaller part are Y-STR tests where a more precise categorisation by STR markers is not possible because the Y-STRs are not visible (tester in no project).
For a simple overview, I have used the “main subgroups”, which we are still familiar with from the good old Y-STR grouping. Note that some of these form groups together. (S2606, FGC29656, Y13076)
660 tests were verified by NGS (Big Y and other NGS tests). This is an average of around 18% of the total tests. The proportion of NGS tests is highest in group S2488 and lowest in group BY1183. The following graph also shows the relationship between FTDNA-Big Y and other NGS. This diagram clearly shows that the FTDNA Family Finder was unable to determine the subgroups of the tests in more detail.
- The tests with Family Finder result I-BY1183 are listed in the correct group.
- The tests with Family Finder result I-S2606 belong in the subgroups I-S24121, I-PH1237 and I-S2488.
- The tests with Family Finder result I-L38 belong in all subgroups.
Breakdown by Country of Origin
For all databases, it is possible to enter details for the earliest known paternal ancestor. This information is voluntary and is more or less filled in by the participants themselves. Unfortunately, the number of tests without information is very high at approx. 50%. In addition, there is a relatively high number of testers who cannot clearly assign their origin to a European country, so that there are numerous entries such as United Kingdom (instead of e.g. England, Wales) or United States and other American locations.
At this point, I have found and added this information to some kits by hand and research to the best of my knowledge and belief.
The majority of the Y-haplogroup I-L38 comes from England, Scotland and the rest of the United Kingdom and Ireland. This means that almost two thirds come from the islands and only around a third from mainland Europe. On the mainland, our Y-Haplogroup is most strongly represented in Germany, Switzerland and France.
I am a big fan of NGS tests, so I am very interested in the percentage of testers who have taken an NGS (like Big Y). That’s why there is not only a graph for the NGS tests per subgroup, but also for the NGS tests per country.
Another interesting illustration is the breakdown of the subgroups by country.
Conclusion
When creating the FigUre tree, I noticed that unfortunately the information for Country of Origin was missing in many tests, which made the tree look a bit sad, as there were many points with Unknown Origin. This was similar when compiling these statistics. Therefore I would like to ask all testers to enter their “Country of Origin” and if not already done to join the I-L38 Project at FamilytreeDNA.
Please note: Below the screenshots there are links to the actual interactive diagrams. These are updated regularly, so that very soon the screenshots will no longer be up-to-date. Please take a look.