A new method to localize and test the significance of incongruence: detecting domain shuffling in the nuclear receptor superfamily

Syst Biol. 2000 Jun;49(2):183-201.

Abstract

When a data set is partitioned, the resulting subsets may contain phylogenetically conflicting signals if they have different evolutionary histories. In a data set with many taxa, a single taxon that contains multiple phylogenetic histories may result in global incongruence, but no methods are available in a parsimony framework to localize incongruence to specific clades in a phylogeny or to test the significance of incongruence on a local scale. Here we present a new method to quantify the conflict between data partitions for any clade in a phylogeny and to test the statistical significance of that conflict by using a metric called the local incongruence length difference. We apply this method to the evolutionary history of the nuclear receptor superfamily, a large group of transcriptional regulators that play essential roles in metazoan development and physiology. All nuclear receptors are composed of several discrete domains, including one that binds to DNA response elements on specific target genes and another that binds to the appropriate ligand. We have performed combined and separate phylogenetic analyses of these two domains and have tested the hypothesis that nuclear receptors evolved by a simple process of lineage splitting and divergence, without domain shuffling or other forms of sequence transfer between proteins. Our analysis indicates that significant conflict exists between the partitined domains at a few nodes on the tree, suggesting that several groups of receptors are "hybrid proteins" formed by domain shuffling or other forms of sequence transfer between more ancient nuclear receptors.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • DNA / genetics
  • Humans
  • Phylogeny*
  • Receptors, Cytoplasmic and Nuclear / classification*
  • Receptors, Cytoplasmic and Nuclear / genetics*
  • Reproducibility of Results

Substances

  • Receptors, Cytoplasmic and Nuclear
  • DNA