NOTE: Getting Advice About Genetic Testing
In order to get an idea about how well the 23andMe risk calculator agrees with other algorithms (when using the same exact same SNP data), I searched for other tools that I could use to analyze my genetic data.
In order to get an idea about how well the 23andMe risk calculator agrees with other algorithms (when using the same exact same SNP data), I searched for other tools that I could use to analyze my genetic data.
For this post, I have compared my risk assessments from 23andMe to those provided by Promethease (which uses the information available in SNPedia). I also played around with the free version of Enlis Genome (Personal Edition), but I found the GUI to be a little buggy and they didn’t automatically prioritize risk assessments (unlike 23andMe and Promethease). So, this post will focus only on comparing my 23andMe assessment with my Promethease assessment.
To be fair, I should point out that I would not necessarily expect 100% concordance between my 23andMe and Promethease results for various reasons. For example, the “magnitude” score from promethease is a subjective measure, and the curation methods are different for these two tools. However, I think such a comparison will still be useful because it will still be encouraging to see any predictions that are shared by both tools, and I think both of these tools provide useful information since there is no “standard” way to combine all possible associated SNPs associated with a particular disease.
I will focus on my increased disease risks, but the same principles could be applied to decreased disease risk, drug response, or any other trait.
All Diseases with Increased Risk
Overall, I thought that there was pretty good agreement between the two methods. This may not be apparent from the table above, but that is because my list of “higher importance” SNPs is considerably smaller but with greater overlap. For example, I would have ideally preferred to look at SNPs with a 1.5x increase in risk and an absolute risk greater than 50%. The absolute cut-off of 50% is because I would prefer to look at SNPs where I am more likely to get the disease than not get the disease. The 1.5x (or 50% increase in risk) is a somewhat arbitrary cutoff that is loosely based upon my microarray data analysis experience. Since no SNPs meet both of these criteria, I chose to look at those with a greater than 1.5x relative risk and greater than 5% absolute risk (which, in my opinion, is still quite low). Now, take a look at my more subjective SNP list.
“Higher Priority” SNPs
Now, 2 out of the 3 SNPs have similar predictions. Although there wasn’t a high magnitude SNP in promethease for venous thromboembolism, this could be because I subjectively considered this disease to be less well known than arthritis or diabetes, so I figured less popular diseases may have lower magnitude scores. For this reason, I decided to look into what SNPs are used by 23andMe and promethease to determine venous thromboembolism risk. I also checked the Genome-Wide Association (GWAS) Catalog to try and get a idea which SNPs are the best established (according to the US National Human Genome Institute).
SNPs Associated with Venous Thromboembolism Risk
23andMe
|
Promethease
|
GWAS Catalog
| |
rs6025
|
Yes
|
Yes
|
No
|
i3002432
|
Yes
|
No
|
No
|
rs505922
|
No
|
Yes
|
Yes
|
NOTE: Promethease lists 19 SNPs associated venous thrombembolism. In order to simplify the table (and avoid listing some potentially inaccurate and/or low-confidence associations), I have only listed SNPs listed by 23andMe or the GWAS Catalog.
Now there is agreement between the 23andMe and Promethease results because both tools indicate that I have a mutation in rs6025, which results in an increased risk of developing venous thromboembolism. However, I think it is worth pointing out that the results are not quite as clean as they could be. For example, this SNP was not listed in the GWAS Catalog, and I couldn’t determine the dbSNP annotation for i3002432 (so it was relatively hard for me to cross-reference this result with other databases).
Another topic that is worth considering is family history. Before I saw my results, there were 3 diseases that I wanted to check due to family history: type I diabetes, type II diabetes, and macular degeneration. Thus, it was interesting to see type I diabetes come up in both reports. Although I doubt that I will get type I diabetes (since the absolute risk is low and this disease usually appears during childhood), this information may still be useful if these mutations have other affects and/or increase the likelihood of children inheriting type I diabetes.
On the other hand, I didn’t see results indicating a increase in risk for type II diabetes and there were some conflicting results about macular degeneration. Of course, family history is not a gold standard, and I may very well never develop type II diabetes or macular degeneration. However, I think it is important to think carefully about ambiguous or uncertain results. For example, this could be done by comparing SNP association to family history as well as considering both genetic and non-genetic risk factors for disease (the later is the topic of my second post).