Login to participate
  
Register   Lost ID/password?
Louis Kessler’s Behold Blog » Blog Entry           prev Prev   Next next

Combine Kits into One Superkit on GEDmatch Genesis - Sat, 6 Apr 2019

Today GEDmatch Genesis added a new Tier 1 application. They state:

image

I did that myself manually with 5 kits about 6 months ago, uploaded my combined raw data to GEDmatch Genesis, and reported the results in my post: The Benefits of Combining Your DNA Raw Data.

I thought I’d try the new GEDmatch Genesis application to see if it produces essentially the same result.

I selected the Tier 1 “Combine mupltiple kits into 1 superkit” application and it gave the the option to select up to 4 kits that are already uploaded. I had all my 5 kits uploaded and I selected FTDNA, 23andMe, Ancestry and LivingDNA. I left out MyHeritage which uses includes almost the same SNPs as my FTDNA file does.

image

I pressed the “Generate” button and within a second, I got my combined kit:

image

Comparing my kits using the GEDmatch Diagnostic utility gives:

image

When I manually combined the kits, I got 1,389,750 SNPs, but GEDmatch only combines the 1,123,247 SNPs it wants to combine that it knows it is going to use. Slimmed SNPs are what GEDmatch actually uses for comparisons with other kits. I’m surprised that GEDmatch’s 834,457 slimmed SNPs are over 20,000 more than my manually combined kits. I have no explanation for that.

I’ve included my Whole Genome kit from Dante, that GEDmatch only loads the SNPs in the VCF file. Those SNPs are the ones where I am different from the human reference genome. The SNPs where I am the same as the human reference genome are not included. The GEDmatch people still have to fix the upload of VCF files so that human reference genomes are added when the SNP is not included in the file.

The one to one comparison was possible immediately, so I compared the GEDmatch combined kit to each of my individual kits, and to my manually created All-5 kit.

image

All of the comparisons indicate that I match myself at least 99.210%. It’s not important that there are some small breaks in the matching segments which results in more than 22 shared segments. I expect that when the one-to-many comparisons become available, the overlaps will improve just as they did with my manually combined file.



The Bottom Line

If you’ve tested with multiple companies and you subscribe to Tier 1, you should combine your kits to get better comparisons at GEDmatch Genesis. Make sure you make this combined kit the kit for yourself that you use for matching, and change all the others to Research so that you show up only once in other people’s match list.

The only unfortunate thing is that you don’t have access to your raw data at GEDmatch. So you won’t know exactly what they did and you won’t have the raw data for yourself to look at or use for other purposes.

4 Comments           comments Leave a Comment

1. spmcdonald (spmcdonald)
United States flag
Joined: Sun, 5 May 2019
1 blog comment, 0 forum posts
Posted: Sun, 5 May 2019  Permalink

Could you combine kits from siblings, e.g., uncles, with the same parents to get a “super uncle” kit?

2. Louis Kessler (lkessler)
Canada flag
Joined: Sun, 9 Mar 2003
287 blog comments, 245 forum posts
Posted: Mon, 6 May 2019  Permalink

spmcdonald: GEDmatch does not have a tool for that.

If you have 3 or more uncles or aunts who are all siblings, then you should use Visual Phasing to identify all their grandparents: https://thegeneticgenealogist.com/2016/11/21/visual-phasing-an-example-part-1-of-5/

If you have the raw data of any close relatives, you can use Kevin Borland’s tool: https://borlandgenetics.com/

3. cdxp (cdxp)
United States flag
Joined: Mon, 5 Aug 2019
1 blog comment, 0 forum posts
Posted: Mon, 5 Aug 2019  Permalink

Are FTDNA and Myheritage using the same testing facility? Some recent advertising and your results suggest a relationship of some kind. I’ve just purchased DNA tests from both to try and get more coverage than i get from LDNA but perhaps that is duplication.

4. Louis Kessler (lkessler)
Canada flag
Joined: Sun, 9 Mar 2003
287 blog comments, 245 forum posts
Posted: Mon, 5 Aug 2019  Permalink

Cdxp: Yes, the parent company of Family Tree DNA is Gene by Gene, and their testing facility in Houston does both Family Tree DNA and MyHeritage. Gene by Gene changed recently to a new chip that allows for some custom selection of SNPs. Family Tree DNA and MyHeritage have different custom SNPs, with MyHeritage selecting a number of health related SNPs and Family Tree DNA avoiding those.

 

The Following 3 Sites Have Linked Here

  1. Kitty Coopers Blog - Make a combined DNA kit for yourself : Mon, 8 Apr 2019
    Louis Kessler has done a nice detailed post about creating a GENESIS superkit here:

  2. This week’s crème de la crème — April 13, 2019 | Genealogy à la carte : Sat, 13 Apr 2019
    [...] Combine Kits into One Superkit on GEDmatch Genesis by Louis Kessler on Behold Genealogy. [...]

  3. GEDmatch Superkits - How To Reap The Benefits - Data Mining DNA : Mon, 23 Nov 2020
    [...] own combination of five DNA kits and uploaded the mash-up to GEDmatch. He then compared this kit against one created by GEDmatch. The Superkit result were actually very good, and he recommends using Superkits.As to the question [...]

Leave a Comment

You must login to comment.

Login to participate
  
Register   Lost ID/password?