An introduction to artificial neural networks in bioinformatics--application to complex microarray and mass spectrometry datasets in cancer studies

Brief Bioinform. 2009 May;10(3):315-29. doi: 10.1093/bib/bbp012. Epub 2009 Mar 23.

Abstract

Applications of genomic and proteomic technologies have seen a major increase, resulting in an explosion in the amount of highly dimensional and complex data being generated. Subsequently this has increased the effort by the bioinformatics community to develop novel computational approaches that allow for meaningful information to be extracted. This information must be of biological relevance and thus correlate to disease phenotypes of interest. Artificial neural networks are a form of machine learning from the field of artificial intelligence with proven pattern recognition capabilities and have been utilized in many areas of bioinformatics. This is due to their ability to cope with highly dimensional complex datasets such as those developed by protein mass spectrometry and DNA microarray experiments. As such, neural networks have been applied to problems such as disease classification and identification of biomarkers. This review introduces and describes the concepts related to neural networks, the advantages and caveats to their use, examples of their applications in mass spectrometry and microarray research (with a particular focus on cancer studies), and illustrations from recent literature showing where neural networks have performed well in comparison to other machine learning methods. This should form the necessary background knowledge and information enabling researchers with an interest in these methodologies, but not necessarily from a machine learning background, to apply the concepts to their own datasets, thus maximizing the information gain from these complex biological systems.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence
  • Bayes Theorem
  • Computational Biology / methods*
  • Databases, Genetic*
  • Databases, Protein*
  • Genomics
  • Humans
  • Mass Spectrometry*
  • Microarray Analysis*
  • Neoplasms* / genetics
  • Neoplasms* / metabolism
  • Neural Networks, Computer*
  • Proteomics
  • Reproducibility of Results