<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>Free Association</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/" />
    <link rel="self" type="application/atom+xml" href="http://blogs.nature.com/ng/freeassociation/atom.xml" />
   <id>tag:blogs.nature.com,2008:/ng/freeassociation/3</id>
    <link rel="service.post" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3" title="Free Association" />
    <updated>2008-03-31T22:21:11Z</updated>
    <subtitle>Welcome to Free Association, the Nature Genetics blog. Check here regularly for links and editorial comment on research and news in genetics, as well as reader feedback.</subtitle>
    <generator uri="http://www.sixapart.com/movabletype/">Movable Type 3.2</generator>
 
<entry>
    <title>Microattribution for community annotation of the human genome</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2008/03/microattribution_for_community_1.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=5052" title="Microattribution for community annotation of the human genome" />
    <id>tag:blogs.nature.com,2008:/ng/freeassociation//3.5052</id>
    
    <published>2008-03-31T21:59:59Z</published>
    <updated>2008-03-31T22:21:11Z</updated>
    
    <summary>Human Variome Project Planning Meeting 25 - 29 May 2008, San Feliu de Guixols, Costa Brava, Spain PROBLEM BEING ADDRESSED Microcitation is a way to incentivize public data deposition by extending the practice of citing journal articles to database entries...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p><a href="http://www.humanvariomeproject.org/HVP2008/">Human Variome Project Planning Meeting </a> 25 - 29 May 2008, San Feliu de Guixols, Costa Brava, Spain</p>

<p><br />
PROBLEM BEING ADDRESSED<br />
Microcitation is a way to incentivize public data deposition by extending the practice of citing journal articles to database entries and by providing quantitative citation for every unique author. </p>

<p>SYSTEMS AND PLANS<br />
A pilot project, commissioning peer refereed locus reviews as journal articles with microattribution for individual variants was introduced in a recent <a href="http://www.nature.com/ng/journal/v40/n1/full/ng0108-1.html">Editorial</a> and was expanded upon in detail in this <a href="http://blogs.nature.com/ng/freeassociation/2007/11/towards_a_hermeneutics_of_quan.html">blog</a>. </p>

<p>Each journal article should have a publicly accessible Supplementary Table 1 listing all the accessions cited in the article. The accessions must be indexed to a unique sequence indicating a nucleotide position (an ssID in NCBI) and a unique allelic state. Each string must have an author ID and a unique locator for the citing journal. Thus a citation string is formed as a list of parameters carried on a URL that resolves to the appropriate database:</p>

<p> (ss71650991, A, TSC2DB, doi1038/ng.123, NM_000548.2:c.138+1G>A, OMIM191100, Popfreq=ALFRED#XXX,) </p>

<p>used as a URL, this resolves to:<a href="http://www.ncbi.nlm.nih.gov/SNP/snp_ss.cgi?subsnp_id=71650991">http://www.ncbi.nlm.nih.gov/SNP/snp_ss.cgi?subsnp_id=71650991</a><br />
 even though it does not cite all of the data parameters related to that accession in dbSNP and the string also carries a parameter pointing to the accession number of population frequency information that was submitted to another database.</p>

<p>Microattribution can operate locally, with<a href="http://blogs.nature.com/ng/freeassociation/2007/02/duke_of_url.html "> journals</a> and <a href="http://www.ncbi.nlm.nih.gov/sites/varvu?gene=7249">databases </a> each reporting quantitative citation of accessions. However, depositing the proposed Supplementary Table 1 in a central registry of cited accessions (at publication) has three great virtues. Firstly, different users can create citation counting interfaces to the same information, secondly, if the site is a proxy, it can record all microattribution (web traffic and vendor information as well as microcitation). Finally, the central site can be mined for citations associated with unique author identities and with each author’s publications and database entries. </p>

<p>To anticipate storage problems, parameter-rich accessions (ssID, allele, phenotype_tableID, submitter, curator, LSDB_ID, PBD_ID, ArrayExpress _ID, GeneTests_ID, PharmGKB_ID, local_confidential_record) would be stored for frequent online access, whereas less intensively curated accessions (ssID, allele, submitter, platform) might be stored on hard disks for occasional searching.</p>

<p>OpenURL conventions used by publishers in the <a href="http://www.doi.org/doi_proxy/parameter_passing.html">CrossRef</a> citation system already lay out rules for constructing parameter strings to be carried upon URLs.  This group is also developing a publishers’ version of <a href="http://www.crossref.org/CrossTech/2007/02/crossref_author_id_meeting.html">author disambiguation</a> and there are already web-wide projects that could be tapped, like <a href="http://OpenID.net/">OpenID</a>.</p>

<p>I suggest that parameter sets be nested within existing conventions to allow committees of publishers, microattribution activists, genome annotators, and mendelian mutation curators to define and update parameter forms that work for their communities. </p>

<p>(citer defined)						<br />
			          ..........(microattribution)			<br />
						                        ....................(g,e,n,_,m,e)<br />
                                 		                         ....................  |  |  |  ...   |<br />
						                        ....................(g,e,n,o,m,_)<br />
			          ..........(microattribution)			<br />
(target defined)</p>

<p>COLLABORATION AND SHARING CAPACITY<br />
Thanks to HVP, HGVS, HUGO, NCBI, EBI, UCSF, SNPedia, Genome Commons, and INSIGHT for their time and ideas. These ideas are not limited to the genome community but we have a unique indexing system in the genome and have an opportunity to demonstrate best scientific practice in accurate citation.</p>

<p>TOPIC SECTION<br />
Publication, credit & incentives<br />
</p>]]>
        
    </content>
</entry>
<entry>
    <title>Help! I’m becoming more normal.</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2008/02/help_im_becoming_more_normal.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=4758" title="Help! I’m becoming more normal." />
    <id>tag:blogs.nature.com,2008:/ng/freeassociation//3.4758</id>
    
    <published>2008-02-24T18:33:09Z</published>
    <updated>2008-02-24T18:43:37Z</updated>
    
    <summary>Association of common variants to diseases is still in a phase of rapid discovery. One immediate consequence is that the relative risk for an individual, predicted from very partial information, can change rapidly as more information is added. For example,...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>Association of common variants to diseases is still in a phase of rapid discovery. One immediate consequence is that the relative risk for an individual, predicted from very partial information, can change rapidly as more information is added.  For example, three of the 18 risk predictions have changed in my profile on the deCODEme site since November:</p>

<p>Restless legs syndrome: 	    OR 1.94>0.97, was 1, now 4 SNPs<br />
Prostate cancer: 		   OR 1.05>0.77, was 5, now 8 SNPs<br />
Type 2 diabetes:		   OR 1.45>1.10, still 8 SNPs</p>

<p>In the first two cases, the new information is the association of new variants that can be added to the risk calculation. In the third case I do not know if this is a recalculation or the result of more studies on the same 8 SNPs. I’ll make a rash prediction and be prepared to be proven wrong, but I think the more common variants are added, the more instances of (individual, disease) will tend to OR=1.0. Maybe I am just indulging in the fallacy called “the law of averages”, but it is at least a conservative, testable hypothesis.</p>

<p>Full genotypes are now available for download, so the project is about to become interesting.<br />
</p>]]>
        
    </content>
</entry>
<entry>
    <title>Do I need a personal trainer or a personal genetic counsellor?</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2008/01/do_i_need_a_personal_trainer_o.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=4443" title="Do I need a personal trainer or a personal genetic counsellor?" />
    <id>tag:blogs.nature.com,2008:/ng/freeassociation//3.4443</id>
    
    <published>2008-01-11T18:02:09Z</published>
    <updated>2008-01-11T22:57:31Z</updated>
    
    <summary>David Hunter, Muin Khoury and Jeffrey Drazen’s Perspective “Letting the Genome out of the Bottle- will we get our wish?” (NEJM 2008. 358;2 105-107) goes further than we have done (below) in its skepticism of consumer genomics services like deCODEme...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p><strong>David Hunter, Muin Khoury and Jeffrey Drazen’s Perspective “Letting the Genome out of the Bottle- will we get our wish?” </strong>  (<a href="http://content.nejm.org/cgi/content/full/358/2/105">NEJM 2008. 358;2 105-107</a>) goes further than we have done (below) in its skepticism of consumer genomics services like <a href="http://www.decodeme.com/">deCODEme</a> and <a href="http://www.navigenics.com/corp/Main/">Navigenics</a>. While <em>Nature Genetics</em> as a research journal can welcome the opportunities for public research participation and ever larger experiments and longitudinal studies, the doctors worry that they will soon be deluged with patients asking them what all this risk information means for them. The authors have some good points, but largely ignore the unpredictable motivational potential inherent in handing people their genomes and asking them to participate in finding out more about their variation and phenotypes. Sometimes, the best doctor will say “we don’t know yet, let’s find out together”. However, we do applaud efforts to build genetics into every stage of medical education and appreciate the authors making their editorial an opportunity to hammer that point.</p>

<p>I take some quotes from the article as the starting point for a few of my own thoughts:</p>

<p><em>“..premature attempts at popularizing genetic testing…”</em><br />
Not so much premature as truly disruptive. Personal genomic testing confers a personal stake in the ongoing research effort and a huge incentive to find out more. A personal stake in finding something that was not previously know is the key to getting students into research and may well be a powerful tool to interest individuals in the details of their own health and functioning.</p>

<p><em>“transparent quality control monitoring”</em><br />
It is true that the use of multiplex genotyping platforms for genetic epidemiology gains some buffering from large numbers of subjects and replication. At the individual level, it is hard to check for a single miscalled SNP, so it might be desirable to build some redundancy into the genotyping using haplotype information. For the genetics enthusiast, it would be reassuring to see the SNP calls in cluster plot format together with those of other anonymized participants.</p>

<p><em>“clinical validity…predictive value...the area in which the data are in the greatest flux.”</em><br />
This is true for conventional exposures and markers too. Physicians no longer prescribe smoking for its “health benefits”, at least not to their patients. Blood pressure and BMI limits now elicit more vigorous treatment from physicians as better information has been gained. It is to be expected that an individual’s risk profile will change with each new study since new variants will be discovered that moderate or exacerbate their individual risk.</p>

<p><em>“full accounting of disease susceptibility awaits the identification of these multiple variants and their interaction in well-designed studies.”</em><br />
I don’t think more retrospective studies will entirely solve this problem. In the meantime, individual genomics might be a great recruiting tool for a longitudinal study. Deliver a detailed genotype to the participants right at the beginning. </p>

<p><em>“assumption that interventions that have proven successful in the general population will behave the same way in a genetically at-risk population.”</em><br />
Optimizing generally applicable interventions with the enthusiastic participation of the research subjects themselves may be the best way forward.</p>

<p><em>“interventions - such as smoking cessation, weight loss, increased physical activity and control of blood pressure – are likely to be broadly beneficial in relation to many diseases, regardless of a person’s genetic susceptibility to a specific disease.”</em><br />
And we pay a physician for this information……why?<br />
It may take years of personal experimentation with different drugs, doses and dose regimens to achieve a balance of blood pressure control and side effects. Would it not be better to have a few clues as to who is likely to achieve blood pressure control via breathing exercises or salt reduction, who by exercise and weight reduction, and who actually needs the beta blockers? OK, gene tests don’t help at the moment with ACE inhibitors or diuretic dosing, but they could when your atherosclerosis gets to the thrombosis stage and are deciding how much <a href="http://content.nejm.org/cgi/content/abstract/352/22/2285">rat poison</a> you want to eat.</p>

<p><em>“patients who test negative may be falsely reassured”</em><br />
I don’t think I am “falsely reassured” by normal cholesterol, kidney function and blood pressure readings. Individuals have a remarkable ability to use information in their own self interest, indeed to integrate family history and the evidence of their own eyes with the gene test information. For a stunningly candid example of a family affected by Alzheimer disease without the major genetic risk factor, read the <a href="http://www.tangledneuron.info/the_tangled_neuron/2006/06/the_apoe_gene_a.html">Tangled Neuron</a>.</p>

<p><br />
<em>“but a detailed consumer report may be beyond most physicians’ skill sets.”</em><br />
It should not be beyond most physicians’ ability to explain the quantitative risks conferred by - and the research underlying – the predictors they currently use: BMI, cholesterol, blood pressure, age and sex. A physician should also be able to relate quantitative risks conferred by a family history containing one or more affected relative. A physician should also be able to advise the patient whether or not to participate in research tests such as that for eg. C-reactive protein, with reasons not including “the insurer doesn’t cover that” or “taking additional tests will remain on your insurance record”.</p>

<p><em>“More information is needed…potential value that genomic profiles can add to that of simpler tools, such as family health history.”</em><br />
Genomic profiles of families will be much more informative to individuals than their comparison with epidemiological studies. For many, the personal genomic profile is the stimulus to explore family history. Discussion of personal genetic risk factors may act to release family medical history kept private because it had no perceived use.  </p>

<p><em>“encourage them to enroll in formal scientific studies.”</em><br />
Maybe participation in personal genomics would have this desirable effect. With no personal stake and no current health problems, it may not occur to many individuals that their participation would be welcomed by scientists and the public alike. It would be interesting to compare the general health and health awareness of eg. Framingham Heart Study participants with nonparticipants, I just haven’t had the time to read any such sociology.</p>

<p><em>“genomic services will galvanize…translational research for the rational integration of genomic information into medical training and practice.”</em><br />
Here the authors get to the heart of their worries: this is a disruptive technology that has caught doctors in the middle of their effort to bring genetics to its proper place in medical education and they might get deluged with questions they aren’t yet ready to answer. For more on the change of emphasis in medical education toward genetics and the individual, see David Valle’s article referenced below.</p>

<p><em>“better off spending their money on gym membership or a personal trainer….follow a diet and exercise regimen that we know will decrease his risk of heart disease and diabetes.”</em><br />
Why not both? Different people have different motivation. Socially body conscious individuals may prefer the gym, the introverted but vain may prefer the attention of the trainer. Intellectually motivated people may choose their sport, or only exercise out of curiosity to see whether they can fulfil the promise of their “elite athlete” SNPs.</p>

<p><br />
<strong>Now, some of my favorite quotes from “A Science of the Individual: Implications for a Medical School Curriculum” Childs, Wiener and Valle. </strong> <a href="http://arjournals.annualreviews.org/doi/abs/10.1146/annurev.genom.6.080604.162345">Ann Rev. Genomics Hum Genet 2005. 6;313-330</a>.</p>

<p> <br />
<em>“Although hypertension is an elevation in systemic blood pressure, each patient reaches that phenotype by different paths, each determined by a unique combination of genetic makeup and experiences of the environment…..Individuality is also apparent in the treatment of hypertension.”</em></p>

<p><em>“contradiction between the singularity of the patient and the generality of treatments and prevention.”</em></p>

<p><em>“a tradition of typological thinking about biology and disease that does not accommodate variation.”</em></p>

<p><strong><br />
Finally, highlights from the “Risky Business” Editorial by my colleague, Alan Packer</strong> (<a href="http://www.nature.com/ng/journal/v39/n12/full/ng1207-1415.html">Nat Genet. 2007. 39;12 1415</a>).</p>

<p><em>“The need for both physicians and their patients to be better educated about complex genetics has taken on added urgency of late.”</em></p>

<p><em>“With the possible exception of age-related macular degeneration, how much can we say with confidence about the spectrum of risk?”</em><br />
Variants with low relative risk make poor classifiers. This point was made by Alan, and by the NEJM authors. So, individual genomics may not induce anyone to take a clinical test until the list of risk variants adds up the point where it can identify some particularly unlucky individuals. In the meantime, it will have informed thousands by providing a personal stake in one of the most exciting areas of medical research and it may recruit enthusiastic participants in a massive longitudinal study that they will have funded partially from their own pockets. </p>

<p>That being said, they are participants, not patients, and the experiment will be conducted on their own terms!<br />
</p>]]>
        
    </content>
</entry>
<entry>
    <title>For what it&apos;s worth</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/12/for_what_its_worth.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=4347" title="For what it's worth" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.4347</id>
    
    <published>2007-12-21T05:21:49Z</published>
    <updated>2007-12-21T05:23:19Z</updated>
    
    <summary> Source: deCODEme...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p><img alt="APOE.jpg" src="http://blogs.nature.com/ng/freeassociation/APOE.jpg" width="692" height="142" /></p>

<p>Source: <a href="http://www.decodeme.com/">deCODEme</a></p>]]>
        
    </content>
</entry>
<entry>
    <title>Adventures in personal genomicsland</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/12/adventures_in_personal_genomic.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=4346" title="Adventures in personal genomicsland" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.4346</id>
    
    <published>2007-12-21T04:38:39Z</published>
    <updated>2007-12-21T04:44:00Z</updated>
    
    <summary>SNPedia I recall a joke that probably plenty of folks have told; I heard it from Francis Collins, the head of NIH&apos;s Genome Project. A previously-married woman heads to bed for the first time with her new beau, and to...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p><a href="http://snpedia.blogspot.com/">SNPedia</a><br />
I recall a joke that probably plenty of folks have told; I heard it<br />
from Francis Collins, the head of NIH's Genome Project.<br />
A previously-married woman heads to bed for the first time with her<br />
new beau, and to his surprise, she admits to being a virgin. When he<br />
wonders why, she says, "Well, I was married to a genome biologist, and<br />
every night, he just sat in bed and talked about how great our sex<br />
life would be someday."</p>

<p><a href="http://www.thegeneticgenealogist.com/"><br />
The Genetic Genealogist</a><br />
myDNAchoice - Are Your Surfing Habits the Result of Your Genome?</p>

<p><a href="http://thegenesherpa.blogspot.com/">The Gene Sherpa</a><br />
One of these companies will get sued</p>

<p><a href="http://thepersonalgenome.com/">The Personal Genome</a><br />
For example, I share parts of my Y-chromosome with my father (I didn't<br />
ask his permission to post parts of it online it either).</p>

<p><a href="http://www.wingedpig.com/archives/2007/12/23andme_results_266.html">Wingedpig</a><br />
Googling around, I found that the APOE gene on chromosome 19 is of<br />
particular interest, specifically APOE e2, e3 and e4. In the Genome<br />
Explorer, I can type in APOE, and it takes me to a listing of 19 SNPs<br />
on the APOE gene. Ok, great. But I have no idea which one(s) of those<br />
SNPs are the ones we're talking about and what the mutations are.<br />
Without this last bit, the Genome Explorer is basically meaningless.</p>]]>
        
    </content>
</entry>
<entry>
    <title>Everything you need to last you two lifetimes</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/12/everything_you_need_to_last_yo.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=4311" title="Everything you need to last you two lifetimes" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.4311</id>
    
    <published>2007-12-16T16:16:43Z</published>
    <updated>2007-12-17T05:35:40Z</updated>
    
    <summary>I&apos;m roadtesting personal genomics services, starting with deCODEme, since many of the genotype-phenotype associations they report have been published in a reputable journal. I am the guinea pig and here are the ground rules: I will reveal everything I believe...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>I'm roadtesting personal genomics services, starting with <a href="http://www.decodeme.com/">deCODEme</a>, since many of the genotype-phenotype associations they report have been published in a reputable journal. I am the guinea pig and here are the ground rules: I will reveal everything I believe to be useful to future research. If that seems too coy, please comment and I will answer truthfully. I reserve the right to move the detailed discussion elsewhere, since space in Nature Genetics is limited, even in the blog (thanks for the space, Alan, apologies if this seems TMI).</p>

<p>These services offer the opportunity for real people to participate in research and  to address for the very first time the question, "I have this genotype, what will happen to me?". The tests offered are not clinical tests, so insurers, employers, physicians and family, please comment as fellow research participants and don't  try to make more of these information services than they purport to be. By real people, I mean individuals with their own responses and interpretations of the research as it affects them, rather than the anonymized people genetic epidemiology uses to make its predictions.</p>

<p>The first figure below shows my thoughts on the subject before I started to look at the results. My initial impression was that I was not going to pay attention to SNPs that on published precedent suggest my lifetime risk of any condition is less than 30%. I guessed I would research any biological hypothesis in the 30-60% range and possibly seek a clinically approved genetic test and medical advice for any genetic prediction of elevated risk over 60%. Given the predictable response of my fellow commuters to eg. seat belts, anti-lock brakes and airbags, I feared I might compensate behaviorally if I got a hint of protective alleles (eg. <a href="http://hmg.oxfordjournals.org/cgi/content/full/15/9/1539">ADH2*2</a>, <a href="http://www.nature.com/ni/journal/v8/n12/abs/ni1521.html">CCL3L1</a>). Impressed by Andrew Niccols' prescient (if insufficiently palindromic) <a href="http://www.imsdb.com/scripts/Gattaca.html">GATTACA</a> I assumed that there might even be SNPs that would convince Uma Thurman to have my babies.</p>

<p><br />
<a href="http://blogs.nature.com/ng/freeassociation/Umagraph003.jpg"><img alt="Umagraph003.jpg" src="http://blogs.nature.com/ng/freeassociation/Umagraph003-thumb.jpg" width="500" height="335" /></a></p>

<p>The first unexpected problem was to identify myself. Since the website is very new and I don't have the raw Illumina SNP calls or any population samples with which to examine the cluster plots myself, I can't verify the raw data. Even if I could do so, I have little but consistency with other genotyping services to ensure I am looking at my own genes. From <a href="http://www.nature.com/ng/journal/v39/n12/abs/ng.2007.13.html">Genographic,</a> I know I have Cambridge mitochondria (H, 16188G, 16311C, 16519C) and a R1b1c Y chromosome and from DNAPrint Genomics's "proprietary AIMs", I know only that I am mostly of European ancestry (which luckily tallies with the origins of my great great great grandparents: 12 German, 9 English, 5 Dutch, 2 Irish, 2 Welsh, 1 Swiss, 1 Scottish). Thanks to Daniel Gubjartsson's recent <a href="http://www.nature.com/ng/journal/v39/n12/abs/ng.2007.13.html">paper</a>, I also know I am quite likely to have brown hair and brown eyes. The main problem, apart from lack of SNP data to search on my own via Greg Lennon and Michael Cariaso's wonderful <a href="http://www.snpedia.com">SNPedia</a> site and the underlying literature, is the problem of self recognition. </p>

<p>Individual taste preferences might provide a solution, so I suggested the self-recognition problem might be solved via an <a href="http://www.nature.com/ng/journal/v34/n2/abs/ng1160.html">olfactory</a> SNP-social-network-wine-club. Another consumer genomics company does report on "taste-related" alleles but they haven't invited me to the party..... I guess the Icelanders are still running the gas chromatograph on the 70cl of wine the last foreign visitor brought into the country in their duty-free allowance.</p>

<p>Using the information at my disposal,  I first plotted my risk ratios against the prevalence of the conditions. </p>

<p><img alt="ORvspoprisk.jpg" src="http://blogs.nature.com/ng/freeassociation/ORvspoprisk.jpg" width="500" height="335" /></p>

<p>The rheumatoid arthritis results caught my attention since the combination of major and minor contributors and mix of risk and protection alleles pretty typical of the other common diseases for which more than one locus has been implicated. Here are the actual results:<br />
<img alt="RASNPs.jpg" src="http://blogs.nature.com/ng/freeassociation/RASNPs.jpg" width="763" height="529" /></p>

<p>I next tried plotting my risk against the population mean risk, as in my sketch above. A decade of research reveals a several things that I couldn't have predicted on the back of an index card. My zone of solidarity is - of course-  a cone of solidarity, since the variance of the risk increases with the risk.</p>

<p><a href="http://blogs.nature.com/ng/freeassociation/riskriskplot.jpg"><img alt="riskriskplot.jpg" src="http://blogs.nature.com/ng/freeassociation/riskriskplot-thumb.jpg" width="500" height="341" /></a></p>

<p>Seen this way, SNPs associated with larger risks of rarer conditions fall into perspective. So what can I offer Uma? It goes without saying that I wouldn't kick her out of bed for eating cookies. In the very unlikely event that I were to do so, I would draw her attention to my  elevated risk of "restless leg syndrome".  I would not be eating the cookies myself, because of the theoretical worry of type 2 diabetes. Observing a BMI of 27 from a safe distance, she would not be particularly convinced by my predicted genetic chance of resisting  the onset of middle-age spread.  </p>]]>
        
    </content>
</entry>
<entry>
    <title>Towards a hermeneutics of quantum citation</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/11/towards_a_hermeneutics_of_quan.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=4131" title="Towards a hermeneutics of quantum citation" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.4131</id>
    
    <published>2007-11-24T16:10:20Z</published>
    <updated>2007-12-11T15:14:23Z</updated>
    
    <summary>FAQ “I haven’t got time to read all this, what are you on about?” Microattribution [editorial1] [editorial2]should be built into database entries, databases, genome browsers, journal articles, journals and social network face pages, probably even into your fridge. Providing incentive...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>FAQ<br />
“I haven’t got time to read all this, what are you on about?”<br />
Microattribution [<a href="http://www.nature.com/ng/journal/v39/n8/full/ng0807-931.html">editorial1</a>] [<a href="http://www.nature.com/ng/journal/v39/n4/full/ng0407-423.html  ">editorial2</a>]should be built into database entries, databases, genome browsers, journal articles, journals and social network face pages, probably even into your fridge.<br />
Providing incentive for global community annotation of the human genome. Giving database accessions the same citation conventions and indices that journal articles currently enjoy, so that genotype-phenotype-frequency annotations are counted as genotype-author-phenotype-frequency annotations. Microattributions should be complemented by high level peer-refereed reviews and a convenient browser interface. Annotators should form a social network that displays their publications, microattributions, affiliations and credentials. Vendors can sustain free access by sponsoring particular sets of content (eg. loci, pathways, probesets) and via annotator endorsements. These models might be appropriate for a Variome browser, Variome reviews and a network of human genome annotators, HUGONet.</p>

<p>“Who are you anyway?”<br />
We are the world, we are the children. Seriously though, there are a lot of clever, highly motivated people who weren’t around when the human genome project went down and who would like to get involved,  perhaps by adopting a neglected region of the genome of local interest. There are also a lot of expert old lags hanging onto lovingly curated real pathogenic human mutation data collected in the pregenomic era that needs to be brought out to create a framework of human variation before the trace archive tsunami hits. Plenty of puzzled physicians who want to know which variants are disease-associated too.</p>

<p>“Are you trying to build your own database?”<br />
No databases will be harmed in the making of the <a href="http://www.variome.org/">Variome</a> project.  We are building a filtering and highlighting site that credits databases and has a feed (Variome track) to Ensembl and UCSC browsers. The Variome server will database  only the microattribution counter, the information required to mark variant ssID and annotations from NCBI as peer reviewed  or not,  it will have  the ability to mark a region between genome coordinates as open, under review or published, and it will have a microattribution wiki for community comments. The MTS peer review database can be left up to commissioning journals. </p>

<p>“Won’t Ensembl just pick all the information directly from the LSDB and NCBI?”<br />
Maybe, but even if intensive curation is enough to get the information right, is ending up on a browser track giving enough credit to the data producers to induce them to participate in the data transfer, checking and indexing process? Why not offer them microattribution and a publication as well?</p>

<p>“Why not just use the UCSC wiki browser?”<br />
We would if it had a graphic track showing quantitative microattribution in published literature and tally of wiki comments loaded onto each ssID, links to the appropriate database entries, and the ability to close commenting across a region, and to mark a region as published. I think we would still need a server on which to store a local copy as the annotation committee (authors, editor and referees) worked on the regions to publish.</p>

<p><br />
“But surely, all that is required is for variants to be submitted to NCBI and they will appear on the genome browser”<br />
This has not happened. LSDBs are holding a large number of well-annotated variants (representing years of work and many grants) that have not been moved to a common indexing system. There is no pipeline from the clinical labs and we are about to be deluged with resequencing deposited in sequence trace archives with little human annotation. </p>

<p>“Why doesn’t NCBI just provide citation statistics for every ssID, author handle and annotation?”<br />
They should. The microattribution concept means that every database entry should not only provide the links to, but should provide a current count of its forward citations in papers and database entries. If NCBI did this, we would still need the high level reviews to ensure locus annotation quality and give high level priority to the annotator community.</p>

<p>“Why not just provide browser feeds from the LSDBs?”<br />
LSDB server capacity and maintenance is variable,  the databases are in a variety of formats and the variants are not indexed to genome, duplication of NCBI on a small scale is inefficient, LSDB curators lack the resources to convert their data.</p>

<p>“Why is it not sufficient to use existing wiki (SNPedia) or wiki browser (UCSC)?”<br />
Wiki comments,  even from an approved  annotator  community, will vary in credibility and detail and even if in theory they are scrutinized by readers, it is not certain that they will be corrected unless there is a high level incentive (the Variome Review) to encourage correction.</p>

<p>“Why do we need a wiki at all, since anyone can submit variant annotations to NCBI curators?”<br />
The activation barrier to wiki commenting is very low and because the results can be seen immediately, alert students, as well as experienced researchers, can readily make a big difference.<br />
Curators need to sleep, whereas the wiki is automated and can take comments from all time zones.</p>

<p>“Why peer review? Surely the data producers together with NCBI curators can provide authoritative variant reports indexed to the genome?”<br />
This works for the variant report alone, but doesn’t provide consistent quality control or scrutiny across the locus. David Ravine’s <a href="http://www.nature.com/ng/journal/v39/n4/abs/ng0407-427.html">review of PKD1 variants</a> (Nature Genetics April 2007 -which is the model for a Variome Review journal article) revealed that 5% of the published variants  were wrong. Community scrutiny under a uniform, editorially controlled process not only provides high level reviews to bring credit to the data producers, but highlights which datasets can be used with confidence.<br />
 <br />
“Is there a better way to view genotype-author-phenotype compound statements and their associated accession numbers than a microattribution wiki browser or a database table?<br />
Undoubtedly. If you have built one, we will use it.</p>

<p>“I have a grant to build/have built/have thought of and am going to build/a better solution and/or I am a hugely influential funding body and we have a plan that doesn’t include you so you are wasting your time.”<br />
Fine, we are providing only what is missing: respect and credit for database builders, curators, authors,  data producers,  cogs in big teams,  funding bodies, research participants (with their permission)  and yes, eventually even  journals and editors. Maybe you’d like to be appreciated too?</p>

<p>“Microattribution is not even a new idea, it is obvious.”<br />
Yes, isn’t it.</p>

<p>“Isn’t Gogol going to index teh everything anyway?”<br />
Maybe, but the ssID is a pretty cool way to index data (thanks, Donna!) and  wouldn’t you rather trust your peers to evaluate genome variants in the first instance, so that the plex will develop their genome tools in ways we will be able to use?</p>

<p>“Won’t your filter become redundant if all NCBI entries are correct and journals provide quantitative attribution via a whizzy interface?”<br />
Yes,  I look forward to this day. If this process is catalytic and every database and journal provides microattribution credit, the Variome Browser filter will have served its purpose. </p>

<p>“Why not organize the HVP along the lines of the <a href="http://www.sanger.ac.uk/PostGenomics/decipher/">DECIPHER</a>  network?”<br />
The problems are fundamentally different. DECIPHER exists to catalog clinical importance of structural genome variation,  starting from scratch. For mendelian mutations,  existing annotator communities have done much of the work,  but now need credit for reformatting and indexing their variation collections to the genome and scrutinizing the annotations, locus by locus.</p>

<p>“Surely you are duplicating the work of <a href="http://www.pharmgkb.org/">PharmGKB</a>, <a href="http://www.hgmd.cf.ac.uk/ac/index.php">HGMD </a>(Cardiff) and <a href="http://www.ncbi.nlm.nih.gov/sites/entrez?db=OMIM">OMIM</a> ?”<br />
These are databases without the intention to create systematic review and journal articles.<br />
None provides quantitative citation credit, although they reference the original sources of the variants. </p>

<p>“Aren’t you just duplicating the effort of <a href="http://www.cdc.gov/genomics/hugenet/default.htm">HUGENET</a> ?”<br />
HUGENET provides disease-centered meta-analysis of genetic epidemiology,  Variome integrates findings of rare variants in rare mendelian diseases, rare variants in common diseases and common variants in common diseases.</p>

<p>“How will you distinguish peer reviewed from entries that were curated but not reviewed?”<br />
We make no distinction at the browser level between  curated entries and those added by wiki,  only between reviewed and non-reviewed information. The source will be evident upon clicking the link, since these lead to NCBI and to the locally databased wiki respectively.<br />
NCBI entries and wiki comments will be visible on the Variome browser in gray, the information that survived collaborative annotation and peer review will be in black.</p>

<p>“Surely the peer review process will entail rewriting and correcting NCBI entries, rather than merely filtering out those that are wrong?”<br />
Irretrievably wrong or unattributable entries will need to be excluded. If the original data producers are unwilling to reannotate them correctly, new entries will be made. If the original data are largely correct, an annotation (corrigendum) will be appended to the existing NCBI entry.</p>

<p>“If competing journals have commissioned reviews on different regions of the genome,  how can we distinguish one annotation group from another (some authors will overlap). “<br />
This happens at the moment with papers. Editors, referees and authors behave remarkably ethically. Authors can be added at the editor and senior author’s discretion. With microattribution, if an author on a collaborative annotation makes less than one annotation between the locus genome coordinates, their author status may be questioned.</p>

<p> “Won’t this confuse journal impact factors? What microcitation measure is the right one?”<br />
Professional bibliometrists, get to work! Since when was more information a bad thing? The existence of the $20 bill does not do away with a need for $1 bills.</p>

<p>“How will authors know if they are citing the right entity?”<br />
This problem exists already in journal articles. Authors will cite a <a href="http://www.ncbi.nlm.nih.gov/geo/">GEO</a> GPL platform accession number rather than the appropriate experiment accession (GSE number). As journals become more database-like,  the database part of the article will reference the correct compound (ssID, author, phenotype, frequency) to support the assertion made (this variant is always associated with this disease).</p>

<p>“Are you planning to provide a disease-centered interface,  like <a href="http://www.nature.com/ng/journal/v39/n1/abs/ng1934.html">AlzGene</a> <a href="http://www.alzgene.org ">[link2 ]”</a><br />
These objectives are on the HVP wish list for Informatics but outside the remit of Publication and Credit. The physician community will probably want to design a pipeline from the clinical sequencing labs to NCBI and a disease-oriented browser interface that starts with the variants for which clincal tests are available, then moves on to a list of variants sorted by whether they are pathogenic or not. These projects will be enabled by - but are are outside the immediate scope of - the microcitation proposal.</p>

<p>“Will other journals provide microcitation statistics?”<br />
Hurry! Nature Genetics already <a href="http://blogs.nature.com/ng/freeassociation/2007/02/duke_of_url.html">does</a>  and we are now looking for better ways to display and use the information (data producer index, annotator index, collaboration index, mentorship index).</p>

<p>“What else could HUGONet members put on their face page?”<br />
If you were the referee who provided the decisive experiment for a highly cited Nature paper, you might like want to reveal  that fact (after publication) and post your review on your face page,  even if you didn’t want your comments publicly on the Nature web site? If you were sparsely linked within the network,  but had a postdoc position you needed to fill, why not pay HUGO to link you up for a month to the highest priority part of the network (or to everyone within the network), HUGO needs money and you need a postdoc.</p>

<p>“What about genes where tests for variants have been patented?”<br />
Companies holding a lot of patents on particular genome regions (eg. BRCA1) might be motivated to adopt that gene within the Variome system, providing information, advertising and funding. I see no reason why their annotations should not be subjected to peer review in the usual way, since journals are happy to publish corporate research so long as methods are transparent and materials are available.</p>

<p>“What kinds of annotation will be counted?”<br />
Ideall all, but the intention was to highlight all forms of annotation that require human effort, so the citation counter highlights papers and reviews separately from annotations with phenotypic information and annotations without phenotypic information. The compound (ssID, author, phenotype,  frequency) has more value than the compound (ssID, author, vendor,  probe) or the fundamental microcitation particle (ssID, author) even if it is cited less often.<br />
For example, the SNP Consortium will be recognized for its discovery of a SNP every time a user cites a ssID or its rsID synonym.  Affymetrix will get a microcitation for every paper that lists a probe used in high throughput genotyping. A mendelian geneticist will be interested in a single clinical report (dbGAP entry) coupled to a sequence report (ssID) and frequency (NCBI annotation ID).</p>

<p>“What happens when the journals are no longer interested in publishing reviews?”<br />
Some loci receive more interest than others. Annotations of genes with many mutations and phenotypes may comprise publications on their own, while others may be annotated mutations affecting a pathway or process. Annotation of neglected regions will accrue progressively more attention via the microattribution process and will be adopted by interested groups for publication.<br />
In the spirit of working with all interested parties, it is not anticipated that Variome will launch a journal to house the reviews,  but this is an option to discuss. <br />
</p>]]>
        
    </content>
</entry>
<entry>
    <title>Tiny bubbles</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/09/tiny_bubbles.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=3704" title="Tiny bubbles" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.3704</id>
    
    <published>2007-09-25T21:49:26Z</published>
    <updated>2007-09-25T22:00:06Z</updated>
    
    <summary>With at least the provisional success of genome-wide association studies to identify common disease-related variants now apparent, and highly significant P values floating up out of figures looking like nothing so much as a series of champagne flutes full to...</summary>
    <author>
        <name>Alan Packer</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>With at least the provisional success of genome-wide association studies to identify common disease-related variants now apparent, and highly significant P values floating up out of figures looking like nothing so much as a series of champagne flutes full to the brim, it’s interesting to take a look back at a paper that arguably provided the key impetus for the field. In their 1996 <em>Science</em> paper <a href="http://www.sciencemag.org/cgi/reprint/273/5281/1516.pdf">“The Future of Genetic Studies of Complex Human Diseases”</a> (cited 2,231 times as of this writing), Neil Risch and Kathleen Merikangas asked:</p>

<blockquote>“Has the genetic study of complex disorders reached its limits?”</blockquote>

<p>Remarkably, this was the key question only 11 years ago. Of course, Risch & Merikangas were referring to the specific question of whether linkage studies would be adequate to detect variants of modest effect in the realm of complex disease. In <a href="http://www.journals.uchicago.edu/AJHG/journal/issues/v76n2/42053/42053.web.pdf">his address </a>upon receiving the Curt Stern award at the 2004 meeting of the American Society of Human Genetics, Risch described how the article came about:</p>

<blockquote>“One colleague I worked with extensively, both in teaching and research, was <a href="http://intramural.nimh.nih.gov/research/pi/pi_merikangas.html">Kathleen Merikangas</a>, a psychiatric epidemiologist with interests in genetics. We spoke frequently about the state of the field of genetic epidemiology and where it was and should be going. We continued these discussions even after my move to Stanford in 1995. We began to develop an awareness that the linkage approach, although having some modest success in complex diseases, was unlikely to identify the large majority of genes. We were influenced by a news item that appeared in <em>Science</em> on July 14, 1995, entitled “<a href="http://www.sciencemag.org/cgi/reprint/269/5221/164.pdf">Epidemiology Faces Its Limits</a>”….Although the article did not discuss human genetics or genetic epidemiology, we realized that many of the comments could apply to the developing situation in human genetics as well”.</blockquote>

<p>Interestingly, the author of that <em>Science</em> news article, Gary Taubes, is back in The New York Times magazine, with <a href="http://www.nytimes.com/2007/09/16/magazine/16epidemiology-t.html">another piece </a>pouring cold water on the field of epidemiology.</p>

<p>In any case, after penning a draft entitled “Human Genetics Facing Its Limits”, they realized that a more optimistic slant would be required, preferably one that offered an alternative approach. Again, from the Stern address:</p>

<blockquote>“If we could have any tool to use for mapping disease genes, we wondered what would it be? Again, on the basis of my experience with HLA-associated diseases and my knowledge about disease associations with other blood-group systems, I knew that many of these associations, although highly significant statistically, would not produce substantial or robust linkage signals. Therefore, why not reverse the process of positional cloning? Instead of searching randomly through the genome by location, why not start with genetic variants and test them directly as candidates? The problem with candidate-gene association studies had been the limited number of candidates and, therefore, the low prior probability of a ‘hit’. But what if we could compile a list of all polymorphisms in the human genome?”</blockquote>

<p>You know the rest. What’s remarkable about the Risch & Merikangas paper, beyond the power calculations showing that the relative gain in power for association studies as opposed to linkage, was the authors’ prescience in outlining the key issues. They noted the stringent genome-wide significance level that would be required for testing on the order of 1 million variants, while also pointing out the likelihood that linkage disequilibrium would allow this number to be reduced substantially. They also implored investigators to preserve all of their samples for future large-scale testing, and it could be argued that the collection of samples is now the rate-limiting step in association studies. Concluding, they wrote:</p>

<blockquote>“Thus, the primary limitation of genome-wide association tests is not a statistical one but a technological one. A large number of genes (up to 100,000) and polymorphisms…must first be identified, and an extremely large number of polymorphisms will need to be tested”.</blockquote>

<p>And finally:</p>

<blockquote>“The human genome project can have more than one reward. In addition to sequencing the entire human genome, it can lead to identification of polymorphisms for all the genes in the human genome and the diseases to which they contribute”.</blockquote>

<p>This is a reminder to those of us who, in the wake of so many robust associations, thought this was considered to be the reward from the very beginning. It wasn’t always obvious, it seems.</p>

<p> </p>

<p></p>

<p><br />
</p>]]>
        
    </content>
</entry>
<entry>
    <title>Cover puzzle</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/09/cover_puzzle.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=3514" title="Cover puzzle" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.3514</id>
    
    <published>2007-09-04T16:35:46Z</published>
    <updated>2007-09-04T16:53:44Z</updated>
    
    <summary>Anthony Edwards has produced an elegant representation of the genetic code in all its degenerate complexity for this month&apos;s , cover explained in Touching Base. Now you have a chance to use his device to solve a puzzle. Please post...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>Anthony Edwards has produced an elegant representation of the genetic code in all its degenerate complexity for this month's , <a href="http://www.nature.com/ng/journal/v39/n9/covers/index.html">cover</a> explained in <a href="http://www.nature.com/ng/journal/v39/n9/full/ng0907-1061.html">Touching Base</a>. Now you have a chance to use his device to solve a puzzle. Please post your solutions  to the Nature Precedings website, or send them to me and I'll add them to this blog.</p>

<p>Problem: Find a ‘Gray code’ order for the codons.<br />
When the numbers 1 to 2n are written in binary form in their natural order the number of digits, 0 or 1, that change on proceeding from one number to the next varies. There exist, however, orderings in which only a single digit changes each time and the last number only differs from the first in respect of a single digit as well. These are known as Gray codes, the numbers forming a complete cycle.<br />
The same principle can be applied to the codon triplets, ordering all 64 in a cycle such that each differs from its predecessor in exactly one position. There are many such orders, each forming a Gray code. They differ in the extent to which they group together triplets that code for the same amino-acid.<br />
One measure of success in forming such groups would be the number of times in the cycle that neighbouring triplets code for different amino-acids (or a stop signal). Since there are 21 of these the absolute minimum of such changes between neighbours is simply 21, but this may not be attainable. <br />
It is easy to find a Gray code ordering with 25 changes by threading a regular route through the standard table of the genetic code. But can you find one with fewer? There’s a route through the Edwards–Venn diagram given in Figure 3 of ‘Picturing the genetic code’ (<em>Nature Precedings</em>  <a href="http://dx.doi.org/10.1038/npre.2007.682.1">doi:10.1038/npre.2007.682.1</a>) with only 23 changes of amino-acid.</p>

<p>P.S. Why ARE there two groups of serine codons, anyway?<br />
</p>]]>
        
    </content>
</entry>
<entry>
    <title>Trust but verify</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/06/trust_but_verify.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=2841" title="Trust but verify" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.2841</id>
    
    <published>2007-06-08T04:49:37Z</published>
    <updated>2007-06-08T14:56:08Z</updated>
    
    <summary>The Wellcome Trust Case Control Consortium presents associations to seven common diseases and I can&apos;t help asking, &quot;so how have we done so far?&quot; Judging by the reference list of the recent WTCCC paper in Nature, the results of 10...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>The Wellcome Trust Case Control Consortium presents associations to seven common diseases and I can't help asking, "so how have we done so far?" Judging by the reference list of the recent <a href="http://www.nature.com/nature/journal/v447/n7145/abs/nature05911.html">WTCCC paper </a>in <em>Nature</em>, the results of 10 of the 13 <em>Nature Genetics</em> papers listed were replicated associations or reported associations replicated by this study.</p>

<p>In the data of the paper itself, evidence is presented that replicates the associations published in (at a quick but incomplete count) 6/17 past <em><a href="http://www.connotea.org/user/ngnomics/tag/WTCCC">Nature Genetics</em> papers</a>, validating or revalidating 14/28 of the loci we have published. The reasons for failure are a rather uninstructive mixture. There are unimpressive p values for SNPs in common between studies that might indicate the initial report was a false positive, or signal a real population difference. There are many SNPs not used in both studies, a situation calling for comparison using proxy SNPs. Some previously reported loci are not considered in the WTCCC paper, and the status of these will have to be dug from the raw association data by dedicated meta-analysts.</p>

<p>Despite the awesome scale of the WTCCC scan, it is considered by many to be a hypothesis generating exercise and an accompanying <a href="http://www.nature.com/nature/journal/v447/n7145/full/447655a.html">Feature</a> from Stephen Chanock, Teri Manolio and many colleagues warns of the necessity to replicate association results.</p>

<p>Replication is certainly powerful in the wake of such a big screen. The primary scan of nearly 500,000 SNPs across 17,000 people for seven diseases has so far resulted in evidence for 24 newly or multiply replicated common variant loci for three diseases that have been verified in just three <a href="http://www.connotea.org/user/ngnomics/tag/WTCCC%20replication%20study">replication papers</a>.</p>]]>
        
    </content>
</entry>
<entry>
    <title>Selfish factoids</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/05/selfish_factoids.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=2550" title="Selfish factoids" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.2550</id>
    
    <published>2007-05-02T18:37:36Z</published>
    <updated>2007-05-02T19:15:29Z</updated>
    
    <summary>Why do editors prune appealing afterthoughts and out of focus information to leave only the highly-scrutinized core of the research paper? ‘Appealing’ is the key word, that stuff has a life of its own and can literally beg you to...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>Why do editors prune appealing afterthoughts and out of focus information to leave only the highly-scrutinized core of the research paper? ‘Appealing’ is the key word, that stuff has a life of its own and can literally beg you to add it to your papers. “Add me”, the factoid pleads, “and your work will make it into the news, will get discussed”.</p>

<p>Supplementary information may be capable of transmitting itself, regardless of its utility and accuracy. However, it may be under less selective pressure of reader scrutiny than the main findings. Should we worry about the quality of the information transmitted? Are there examples of runaway transmission?</p>

<p>I’ll try to provide an example, starting with a surprising re-uppopping of one of the earliest examples of a selfish element, supernumary or B-chromosomes, first reported by E.B. Wilson in <em>Metapedius</em> bugs in <a href="http://www.sciencemag.org/cgi/reprint/26/677/870.pdf">Science</a> in 1907.</p>

<p>Selfish factoids can even infect <a href="http://www.mattridley.co.uk/">Matt Ridley</a>, probably the most accessible and informative science writer alive. I enjoyed reading his “<a href="http://www.amazon.com/Origins-Virtue-Instincts-Evolution-Cooperation/dp/0140264450/ref=pd_lpo_k2_dp_k2a_3_txt/002-4223983-2075251">Origins of Virtue</a>- Human Instincts and the Evolution of Cooperation” but found this:<br />
Chapter 1, footnote 29. <strong>“I am indebted to David Haig for the information that human beings have B chromosomes at the rate of 2-3 per cent of live births.”</strong></p>

<p>Before rushing off to spread or apply this information and freak about its impact on genome-wide association studies for common diseases, I did a few reality checks.</p>

<p>Haig himself can’t resist transmitting B-chromosome factoids in a <a href="http://www.oeb.harvard.edu/faculty/haig/pdfs/99Autosomes.pdf">1999 paper </a>that is ostensibly about the origins of human autosomes.  He performs some comparative paleocytology on our putative common ancestors, and (coolest of all) identifies them as the werepig and the weremouse. <br />
“An extra ‘marker' chromosome is detected in about one in 1000 prenatal diagnoses (refs), and a similar frequency is found among newborn infants (<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&list_uids=7747772&dopt=Abstract">Gravholt and Friedrich 1995</a>). "</p>

<p>That reference quotes the newborn frequency 0.7/1000.</p>

<p>So Ridley was exaggerating by a factor of forty-ish. But are there any risks associated with inheriting these chromosome “passengers”?</p>

<p><a href="http://www.nature.com/nrg/journal/v7/n2/pdf/nrg1767.pdf">Lars Feuk and colleagues</a> in <em>Nature Reviews Genetics </em><br />
“…structural abnormalities occur in about 1 out of 375 live births, with a quarter of these being unbalanced (Ref 37). The risk of congenital abnormalities that are associated with such variants is 6.7% for reciprocal translocations and inversions and 25.6% for all types of marker chromosome (Ref 12)”. </p>

<p>I’d be interested to know how this number was calculated, Ref 12 is <a href="http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=1928105">Dorothy Warburton’s 1991 review </a>in which she actually found abnormal phenotype associated with 10/68 nonsatellited marker chromosomes and 6/55 for satellited marker chromosomes identified de novo at prenatal diagnosis (10.9-14.7%). </p>

<p><a href="http://jmg.bmj.com/cgi/content/abstract/43/8/660">M. D. Graf et al. </a>also studied prenatally ascertained marker chromosomes and found “at least a 26% (28/108)  risk for phenotypic abnormality for any non sex de novo supernumary marker chromosome”….“However, if high resolution ultrasound studies are normal, this risk reduces to 18%”.</p>

<p><a href="http://www.nature.com/ejhg/journal/v13/n11/abs/5201473a.html">Oliver Bartsch et al</a> found 42 supernumary marker chromosomes of which 29 were de novo and 13 familial (10 of the resulting children with familial markers appeared unaffected) in 43,273 prenatal diagnoses. So Haig’s 1999 figure is still in the ball park. He concluded that human ‘markers’ are too infrequent to qualify as “fully fledged B chromosomes”. </p>

<p>They got wings tho’…..</p>

<p>Still, ‘marker chromosomes’ are a diagnosis by exclusion. The exclusion criteria vary, and ultimately, their detection is subject to the resolution limit of light microscopy. As Feuk and colleagues point out in their review, researchers can now detect variants well below that limit and all of us have several familial structural variants at birth. Some may indeed be very small marker chromosomes, but most are copy and structural variations in existing chromosomes. Given the caveats above, I won’t hazard a guess at the rate of de novo structural genome variants per live birth here, but will follow this area of the literature with great interest.</p>

<p>For more on human supernumaries, see <a href="http://www.med.uni-jena.de/fish/sSMC/00START.htm">Thomas Liehr’s site in Jena</a></p>]]>
        
    </content>
</entry>
<entry>
    <title>Making really good jam</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/04/making_really_good_jam.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=2519" title="Making really good jam" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.2519</id>
    
    <published>2007-04-27T20:43:39Z</published>
    <updated>2007-04-27T21:53:22Z</updated>
    
    <summary>Three splendid papers on type 2 diabetes in Science yesterday set a new standard for genome wide associations (GWAS) and emphasize the importance of multiple replications. For those with strong nerves, competitive collaboration is a fine way in which to...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>Three splendid papers on type 2 diabetes in <em>Science</em> yesterday set a new standard for genome wide associations (GWAS) and emphasize the importance of multiple replications. For those with strong nerves, competitive collaboration is a fine way in which to scrutinize the evidence for robust associations. </p>

<p>Several new loci have been successfully identified. In addition, these papers are now the best available model for the ideal way to publish GWAS results showing the genetic landscape of common diseases once the most prominent associations have been noted. Below, I list some of the highlights of the <em>Science</em> papers from the point of view of an editor looking forward to robust standards of proof in this new field.</p>

<p>David Altshuler in Nicholas Wade’s article in the <a href="http://www.nytimes.com/2007/04/27/us/27diabetes.html?hp">NYTimes April 26th</a><em><br />
“I tip my hat to <a href="http://www.nature.com/ng/journal/vaop/ncurrent/abs/ng2043.html">DeCode</a>,” </em>he said. <em>"But the technology is now widely available,”</em> and <strong>in his view the only barrier to other teams contributing to gene discovery would be if they dropped the high standards of statistical rigor developed by the three academic consortiums.</strong></p>

<p>10.1126/science.1142364 Zeggini et al.<br />
<em>“The first wave of validated SNPs sent for replication was selected from the 30 SNPs, in 9 distinct chromosomal regions (excluding <a href="http://www.nature.com/ng/journal/v38/n3/abs/ng1732.html">TCF7L2</a>) which had, in the WTCCC scan alone, attained the most extreme (P<10-5) significance values on Cochran-Armitage tests of association.”</em><br />
MA: I hope the Wellcome Trust Case-Control Consortium scan (unpublished, but cited by Zeggini et al. as a personal communication from Peter Donnelly and the WTCCC) does not provide a precedent for using a single trend test to establish association in a single step!</p>

<p>10.1126/science.1142364 Zeggini et al.<br />
<em>“We prioritized the 5367 SNPs in this range, using additional criteria: (a): evidence of association in DGI and FUSION (6,7); b) presence of multiple, independent (r2 < 0.4) associations within the same locus; and (c) biological candidacy (8, 18)."</em><br />
MA: (a) yes, (b) good idea, let’s add that criterion, (c) no, this seems like moving the dogs back to the lamp post, to abuse Francis Collins’s metaphor :<br />
Lauran Neergaard, <a href="http://www.wtopnews.com/?nid=106&sid=1126266">AP April 26th</a><br />
<em>"We have been for all of the last decade or more looking under the lamppost to try to find those genes ... and lots of times the lamplight was not actually where we wanted it," </em>said Dr. Francis Collins, genetics chief at the National Institutes of Health, a co-author of the research unveiled Thursday. This new approach <em>"allows us to light up the whole street, and look what we find."</em></p>

<p>10.1126/science.1142364 Zeggini et al.<br />
<em>“Our study provides an important validation of the genome-wide indirect association mapping approach and a demonstration of the value of <strong>aggressive data sharing efforts</strong>."</em><br />
MA: _giggles_</p>

<p>10.1126/science.1142382 Scott et al.<br />
<em>“….we….imputed genotypes for an additional >2 million autosomal SNPs.”</em><br />
MA: This is the shape of the future. Both Goncalo Abecasis and Peter Donnelly have talked recently at conferences about getting lots more science for the genotyping by using the <a href="http://www.nature.com/nature/journal/v437/n7063/full/nature04226.html">HapMap</a> to impute genotypes.</p>

<p>10.1126/science/ 1142358 DGI Broad/ Lund/Novartis <br />
<em>“Given this distribution, and <strong>as WGAS are hypothesis generating, we sought replication in independent samples</strong>.”</em><br />
MA: Amen, brother.</p>

<p>10.1126/science/ 1142358 DGI Broad/ Lund/Novartis<br />
<em>….“An intriguing aspect of this association in its location far from any annotated gene.”</em><br />
MA: Yes, this phenomenon was found for <a href="http://www.nature.com/ng/journal/v39/n5/full/ng0507-579.html">prostate cancer </a>too. I think it may be common for common disease associations: enhancers etc. can be far from the nearest gene and we must expect other mechanisms at work not consistently affecting transcription in a haplotype-specific manner.</p>

<p>10.1126/science/ 1142358 DGI Broad/ Lund/Novartis<br />
<em>“We observe strong evidence (P<10-4, rank in the top 100) for six previously reported common variants that influence lipid levels (Table 3).”</em><br />
MA: Admittedly this is post-hoc and the previously reported variants were chosen by similar methods, but this replication seems enormously reassuring.<br />
</p>]]>
        
    </content>
</entry>
<entry>
    <title>Nature Genetics in the news</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/04/nature_genetics_in_the_news_1.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=2344" title="Nature Genetics in the news" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.2344</id>
    
    <published>2007-04-03T18:30:30Z</published>
    <updated>2007-04-03T18:31:16Z</updated>
    
    <summary>This week&apos;s new papers Here’s what we told the world’s journalists last week. You can use Google News to see what they made of our briefing. Please cite Nature Genetics as the source of the following items. If publishing online,...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p><strong><a href="http://www.nature.com/ng/journal/vaop/ncurrent/index.html">This week's new papers </a></strong><br />
Here’s what we told the world’s journalists last week. You can use <a href="http://news.google.com/news?hl=en&ned=us&q=%22nature+%2B+genetics%22&btnG=Search+News">Google News </a>to see what they made of our briefing.<br />
Please cite Nature Genetics as the source of the following items. If publishing online, please carry a hyperlink to <a href="http://www.nature.com/naturegenetics">http://www.nature.com/naturegenetics</a>.</p>

<p><strong><a href="http://www.nature.com/ng/journal/vaop/ncurrent/abs/ng1999.html">Prostate cancer risk variants</a></strong><br />
<a href="http://www.nature.com/uidfinder/10.1038/ng1999">DOI:10.1038/ng1999</a><br />
<a href="http://www.nature.com/uidfinder/10.1038/ng2015">DOI:10.1038/ng2015</a><br />
<a href="http://www.nature.com/uidfinder/10.1038/ng2022">DOI:10.1038/ng2022</a><br />
Several newly identified genetic variants on chromosome 8 are associated with increased risk of developing prostate cancer, according to three studies to be published online this week in Nature Genetics. <br />
Last year scientists in Iceland reported that a variant on chromosome 8 is associated with elevated risk of prostate cancer. Three groups have now revisited this region on chromosome 8, and have found additional variants that independently contribute to prostate cancer susceptibility. One group, led by Kari Stefansson, reports that a second variant is associated with the disease in several populations. This variant is relatively uncommon in individuals of European descent, but quite common in African Americans, suggesting that it accounts for some of the higher risk for prostate cancer observed in individuals of African descent. <br />
The second group, led by David Reich, identified this same variant, as well as five previously undescribed risk variants in populations representing five different ethnic backgrounds. Finally, the third group, led by Stephen Chanock, found one of the variants reported by the Broad study to be associated with elevated risk in five different populations. <br />
According to Reich and colleagues, combinations of these risk variants could account for up to two-thirds of prostate cancer cases in African Americans, and up to one-third of cases in European Americans. The variants do not map to a particular gene, but each group speculates that collectively they could promote amplification of this entire region of chromosome 8, an event that is commonly observed in prostate tumours. </p>]]>
        
    </content>
</entry>
<entry>
    <title>Nature Genetics in the news</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/03/nature_genetics_in_the_news.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=2248" title="Nature Genetics in the news" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.2248</id>
    
    <published>2007-03-22T23:13:21Z</published>
    <updated>2007-03-26T16:14:04Z</updated>
    
    <summary>This week&apos;s new papers Here’s what we told the world’s journalists last week. You can use Google News to see what they made of our briefing. Please cite Nature Genetics as the source of the following items. If publishing online,...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p><strong><a href="http://www.nature.com/ng/journal/vaop/ncurrent/index.html">This week's new papers </a></strong><br />
Here’s what we told the world’s journalists last week. You can use <a href="http://news.google.com/news?hl=en&ned=us&q=%22nature+%2B+genetics%22&btnG=Search+News">Google News </a>to see what they made of our briefing.<br />
Please cite Nature Genetics as the source of the following items. If publishing online, please carry a hyperlink to <a href="http://www.nature.com/naturegenetics">http://www.nature.com/naturegenetics</a>.</p>

<p><strong><a href="http://www.nature.com/ng/journal/vaop/ncurrent/abs/ng2005.html">A variant in a gene called GDF5 is associated with the risk of developing osteoarthritis</a></strong><br />
<a href="http://www.nature.com/uidfinder/10.1038/ng2005">DOI:10.1038/ng2005</a><br />
Osteoarthritis is the most common form of arthritis, affecting more than 20 million people in the United States. The disease is caused by the breakdown and loss of the cartilage in one or more joints during ageing, leading to swelling, pain, and limited mobility. Shiro Ikegawa and colleagues analyzed variation across the GDF5 gene and found that one particular variant was significantly more frequent in two independent populations of Japanese individuals with osteoarthritis of the hip compared to disease-free individuals. The same variant was also found to be significantly more frequent in both Japanese and Chinese individuals suffering from osteoarthritis of the knee. Depending on the study, the variant conferred 30%-80% additional risk of developing the disease. The authors showed that this variant probably reduces the amount of GDF5 produced. As GDF5 is a protein that is secreted by cells and is known to be involved in cartilage development, lower levels of GDF5 may affect the maintenance of cartilage in joints. </p>]]>
        
    </content>
</entry>
<entry>
    <title>Add value to your favorite gene</title>
    <link rel="alternate" type="text/html" href="http://blogs.nature.com/ng/freeassociation/2007/03/add_value_to_your_favorite_gen.html" />
    <link rel="service.edit" type="application/atom+xml" href="http://blogs.nature.com/cgi-bin/mt/mt-atom.cgi/weblog/blog_id=3/entry_id=2170" title="Add value to your favorite gene" />
    <id>tag:blogs.nature.com,2007:/ng/freeassociation//3.2170</id>
    
    <published>2007-03-10T16:16:42Z</published>
    <updated>2007-03-10T21:59:24Z</updated>
    
    <summary>I’m currently attending the ISN-Nature Genetics Forefronts Symposium on Nephrogenetics in Danvers MA. If the Human Variome Project were to produce a comprehensive synopsis of all the variants of a single gene important for kidney function, together with all the...</summary>
    <author>
        <name>Myles Axton</name>
        
    </author>
    
    <content type="html" xml:lang="en" xml:base="http://blogs.nature.com/ng/freeassociation/">
        <![CDATA[<p>I’m currently attending the <a href="http://www.nature.com/isn/news/press_room/2007/0703/full/press_070309_1.html">ISN-Nature Genetics Forefronts Symposium on Nephrogenetics</a> in Danvers MA.</p>

<p>If the <a href="http://www.variome.org/">Human Variome Project </a>were to produce a comprehensive synopsis of all the variants of a single gene important for kidney function, together with all the available evidence for their involvement in human disease, which gene would you choose for them to highlight? The synopsis would cover all the mendelian mutations, association studies with common variants and resequencing for rare variants. Please consider:<br />
1) the prevalence and seriousness of the clinical condition(s)<br />
2) the value added by combining all three types of genetic study listed above<br />
3) the availability of all three kinds of evidence</p>

<p>Please send your recommendations with brief reasons to <strong>variome.pub(at)gmail.com </strong>which is an email address I have set up to collate responses to this question. If you are not a nephrogeneticist (or even if you are) please don’t restrict your answers to kidney genes, this is an important question for the future of the Human Variome Project. I will attempt to post all answers to the blog and would prefer to publish your name with your contribution. Please let me know if you would prefer the comment to be anonymous.</p>

<p>The gene-based view of human variation is fine for molecular biologists and will help clinical geneticists decide whether a variant is likely to cause disease, but I suspect most physicians will be more interested in a disease-based view. In other words, “how many genes can be mutated to produce this phenotype, and what tests are available to help me distinguish between them?”</p>

<p>At this meeting, the difficulty of producing such a view is obvious since the correspondence between genes and kidney diseases is often many-to-many rather than one-to-one. Mutations in the renin-angiotensin system result in tubular dysgenesis. Mutations affecting cilia and centrosomes mostly produce polycystic kidney disease, Bardet-Biedl syndrome and nephronophthisis. However, mutations affecting glomerular filtration by the podocytes and slit diaphragm rather unpredictably produce different clinical conditions: familial glomerulosclerosis and steroid resistant nephrotic syndrome.</p>

<p>Keynote speaker Richard Lifton (Yale) emphasized that studies of human heredity have shown that the ten mutations that raise blood pressure and nine that lower it all influence salt homeostasis in the kidney. There is every indication that variants in these same genes will be important at the population level. </p>

<p>It is certainly difficult to lower blood pressure simply by restricting salt intake. Some find that regular exercise or losing excess weight is also needed. Many also are prescribed diuretics and beta-blockers or ACE inhibitors on the advice of their physician. But to what extent is salt imposed on us by a food industry that exploits our evolutionary adaptation to crave salt? My co-convenor, Qais Al-Awqati (Columbia)  <a href="http://www.nature.com/ki/journal/v69/n10/abs/5001520a.html">said</a> that it is the practice of the <a href="http://www.saltinstitute.org/">Salt Institute</a> to highlight conflicting publications in order to represent the science of blood pressure regulation as “controversial”. Great argument, “the scientists can’t agree, so salt can’t be that bad, can it?”. Now where did I hear that one before, climate change?</p>

<p>To be fair, the Salt Institute response to Qais’s editorial of May last year can be found <a href="http://www.nature.com/ki/journal/v71/n1/abs/5001950a.html">here</a>. </p>]]>
        
    </content>
</entry>

</feed> 

