A systematic analysis of disease-associated variants in the 3' regulatory regions of human protein-coding genes I: general principles and overview

Hum Genet. 2006 Aug;120(1):1-21. doi: 10.1007/s00439-006-0180-7. Epub 2006 Apr 28.

Abstract

The 3' regulatory regions (3' RRs) of human genes play an important role in regulating mRNA 3' end formation, stability/degradation, nuclear export, subcellular localization and translation and are consequently rich in regulatory elements. Although 3' RRs contain only approximately 0.2% of known disease-associated mutations, this is likely to represent a rather conservative estimate of their actual prevalence. In an attempt to catalogue 3' RR-mediated disease and also to gain a greater understanding of the functional role of regulatory elements within 3' RRs, we have performed a systematic analysis of disease-associated 3' RR variants; 121 3' RR variants in 94 human genes were collated. These included 17 mutations in the upstream core polyadenylation signal sequence (UCPAS), 81 in the upstream sequence (USS) between the translational termination codon and the UCPAS, 6 in the left arm of the 'spacer' sequence (LAS) between the UCPAS and the pre-mRNA cleavage site (CS), 3 in the right arm of the 'spacer' sequence (RAS) or downstream core polyadenylation signal sequence (DCPAS) and 7 in the downstream sequence (DSS) of the 3'-flanking region, with 7 further mutations being treated as isolated examples. All the UCPAS mutations and the rather unusual cases of the DMPK, SCA8, FCMD and GLA mutations exert a significant effect on the mRNA phenotype and are usually associated with monogenic disease. By contrast, most of the remaining variants are polymorphisms that exert a comparatively minor influence on mRNA expression, but which may nevertheless predispose to or otherwise modify complex clinical phenotypes. Considerable efforts have been made to validate/elucidate the mechanisms through which the 3' untranslated region (3' UTR) variants affect gene expression. It is hoped that the integrative approach employed here in the study of naturally occurring variants of actual or potential pathological significance will serve to complement ongoing efforts to identify all functional regulatory elements in the human genome.

Publication types

  • Review

MeSH terms

  • 3' Untranslated Regions / genetics*
  • Base Sequence
  • Genetic Predisposition to Disease / genetics*
  • Molecular Sequence Data
  • Mutation*
  • Open Reading Frames / genetics
  • Polymorphism, Genetic
  • Regulatory Sequences, Nucleic Acid*
  • Sequence Homology, Nucleic Acid

Substances

  • 3' Untranslated Regions