Review ArticleMolecular genetic studies of complex phenotypes
Section snippets
Complexity of the Nuclear Genome
The human nuclear genome (the genome) is an apparently simple and yet an exceedingly complex structure. The genome contains 3.2 billion nucleotides, which are composed of four repeating units that are ordered seemingly in random and are packed inside the nucleus as a 2-m-long polymer covered by the octomeric units of histones. A complex system orchestrates the accessibility of the double-stranded DNA to various proteins that regulate DNA synthesis and gene expression in response to internal and
Diversity of the Human Genomes
Humans are genetically diverse. They differ in approximately 0.1% of their genomes. The single-nucleotide polymorphism (SNP) database (dbSNP, Build 132) lists more than 37 million variants among humans. With the exception of identical twins, no two humans have identical genomes. Every genome contains approximately 4 million DSVs that affect half of the genes in each genome collectively, and many are private (Table I).5, 6, 7, 8, 9, 10, 11, 12 Most DSVs in the genome are SNPs, but structural
Etiologic Complexity of Complex Phenotypes
The plethora of DSVs in the genome and the multilayer regulation of gene expression and function are indicative of the intricacy of the determinants of the complex phenotypes.17 The clinical phenotypes are presumed to result from the additive effects of and interactions among multiple causative alleles with various genomics and environmental factors. In a complex phenotype, the effect sizes of the involved alleles are expected to vary and to follow a gradient that ranges from minimal or
Genetic Approaches to Complex Phenotypes
The full spectrum of allele frequency in a population is expected to follow a gradient ranging from private to extremely common alleles.25 Conventionally, however, the variants are categorized into 3 classes based on their minor allele frequencies (MAFs) in the population. Common and rare variants are those that have population MAFs of >5% and <1%, respectively. Variants that have population MAFs from 1% to 5% are considered uncommon or infrequent. As observed for the genetic causes of
Family studies
Complex phenotypes often show familial aggregation, in part because of shared genetic risk factors. When a single allele exerts a large effect on the phenotype, familial segregation will follow a Mendelian pattern of inheritance. However, the contribution of additional variants to the phenotype can give variable expressivity or incomplete penetrance.75 Unlike the phenotypes with a Mendelian pattern of inheritance, wherein the presence of the variant causes the phenotype, albeit with a variable
Design of Genetic Studies
Genetic studies aim to detect and quantify the risk of a disease or effectiveness of a specific therapy or the risk of adverse side effect at an individual level and yet must depend on group data to attain such information. Consequently, the robustness of the group data is imperative for appropriately extending the group data to an individual. Various factors determine robustness of the group data and the applicability of the findings to an individual, including sample size of the study,
Perspective
The GWAS, which has all but replaced the candidate gene approach, is built on the CD-CV hypothesis. It has been exceedingly successful in identifying many DSVs that are associated with the complex phenotypes. However, the direct clinical utility of the findings is usually limited, as the identified alleles have modest effect sizes on the phenotype, as anticipated. However, the lack or a paucity of the clinical use of GWAS should not lessen value of the main contribution of GWAS in providing
References (87)
Are rare variants responsible for susceptibility to complex diseases?
AmJHumGenet
(2001)- et al.
Common vs. rare allele hypotheses for complex diseases
Curr Opin Genet Dev
(2009) Next-generation DNA sequencing techniques
N Biotechnol
(2009)- et al.
Targeted next-generation sequencing of a 12.5 Mb homozygous region reveals ANO10 mutations in patients with autosomal-recessive cerebellar ataxia
Am J Hum Genet
(2010) - et al.
Targeted capture and next-generation sequencing identifies C9orf75, encoding taperin, as the mutated gene in nonsyndromic deafness DFNB79
Am J Hum Genet
(2010) - et al.
Exome sequencing of ion channel genes reveals complex profiles confounding personal risk assessment in epilepsy
Cell
(2011) - et al.
Linkage disequilibrium in humans: models and data
Am J Hum Genet
(2001) - et al.
Mapping genes by drift-generated linkage disequilibrium
Am J Hum Genet
(1998) - et al.
The sequence of the human genome
Science
(2001) - et al.
Initial sequencing and analysis of the human genome
Nature
(2001)
Initial impact of the sequencing of the human genome
Nature
Revealing the dark matter of the genome
Science
The diploid genome sequence of an individual human
PLoSBiol
Genomics: a picture worth 1000 Genomes
Nat Rev Genet
1000 Genomes Project gives new map of genetic diversity
Science
Comprehensive survey of SNPs in the Affymetrix exon array using the 1000 Genomes dataset
PLoS ONE
The diploid genome sequence of an Asian individual
Nature
The complete genome of an individual by massively parallel DNA sequencing
Nature
A highly annotated whole-genome sequence of a Korean individual
Nature
Mapping and sequencing of structural variation from eight human genomes
Nature
Diversity of human copy number variation and multicopy genes
Science
Mapping copy number variation by population-scale genome sequencing
Nature
Missing heritability and strategies for finding the underlying causes of complex disease
Nat Rev Genet
A map of human genome variation from population-scale sequencing
Nature
Strategic approaches to unraveling genetic causes of cardiovascular diseases
Circ Res
Network medicine: a network-based approach to human disease
Nat Rev Genet
Network for activation of human endothelial cells by oxidized phospholipids: a critical role of heme oxygenase 1
Circ Res
Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans
Nat Genet
Common variants at 30 loci contribute to polygenic dyslipidemia
Nat Genet
Identification of cis- and trans-acting genetic variants explaining up to half the variation in circulating vascular endothelial growth factor levels
Circ Res
Experimental evidence supports a sex-specific selective sieve in mitochondrial genome evolution
Science
Polymorphic Y chromosomes harbor cryptic variation with manifold functional consequences
Science
Nature's genetic gradients and the clinical phenotype
Circ Cardiovasc Genet
The future of genetic studies of complex human diseases
Science
Uncovering the roles of rare variants in common disease through whole-genome sequencing
Nat Rev Genet
Common and rare variants in multifactorial susceptibility to common diseases
Nat Genet
The allelic architecture of human disease genes: common disease-common variant…or not?
Hum Mol Genet
Molecular Genetic and functional characterization implicate muscle-restricted coiled-coil gene (MURC) as a causal gene for familial dilated cardiomyopathy
Circ Cardiovasc Genet
Rare variants create synthetic genome-wide associations
PLoS Biol
Genome-wide association studies for complex traits: consensus, uncertainty and challenges
Nat Rev Genet
Avoiding the high Bonferroni penalty in genome-wide association studies
Genet Epidemiol
Genome-wide association study identifies eight loci associated with blood pressure
Nat Genet
Biological, clinical and population relevance of 95 loci for blood lipids
Nature
Cited by (0)
Supported by Grant R01-088498 from NHLBI, Grant R21 AG038597-01 from NIA, Burroughs Wellcome Award in Translational Research #1005907, the TexGen Fund from Greater Houston Community Foundation and George and Mary Josephine Hamman Foundation.