Advances in Clinical and Experimental Medicine

Title abbreviation: Adv Clin Exp Med
5-Year IF – 2.0, IF – 1.9, JCI (2024) – 0.43
Scopus CiteScore – 4.3
Q1 in SJR 2024, SJR score – 0.598, H-index: 49 (SJR)
ICV – 161.00; MNiSW – 70 pts
Initial editorial assessment and first decision within 24 h

ISSN 1899–5276 (print), ISSN 2451-2680 (online)
Periodicity – monthly

Download original text (EN)

Advances in Clinical and Experimental Medicine

2007, vol. 16, nr 1, January-February, p. 85–93

Publication type: review article

Language: English

Bioinformatics: From arduous beginnings to molecular databases

Bioinformatyka: od trudnych początków do molekularnych baz danych

Michał Piast1,, Irena Kustrzeba−Wójcicka1,, Małgorzata Matusiewicz1,, Małgorzata Krzystek−Korpacka1,, Teresa Banaś1,

1 Department of Medical Biochemistry, Silesian Piasts of Medicine in Wrocław, Poland

Abstract

This is a brief review of the origins of bioinformatics and the development of biological databases and molecular analysis tools. The paper covers the period from 1945 (Sanger’s work on insulin) to 2004 (introduction of the latest MEGA version, GenBank release 143). For the purpose of this article, the term “bioinformatics” means the discipline involving biology and computer science, and the term “computational biology” is understood as a process of biological data analysis and interpretation.

Streszczenie

Zwięzły opis początków bioinformatyki, rozwoju molekularnych baz danych i narzędzi do analizy sekwencji aminokwasowych i nukleotydowych. Praca obejmuje okres od roku 1945 (prace Sangera nad insuliną) do roku 2004 (wprowadzenie najnowszej wersji programu MEGA, aktualizacja bazy GenBank do wersji 143). Na potrzeby niniejszej pracy określenie „bioinformatyka” oznacza dyscyplinę łączącą biologię i nauki komputerowe, a termin „biologia obliczeniowa” jest rozumiany jako proces analizy i interpretacji danych biologicznych.

Key words

Software Databases, Evolution, Phylogenetics, Sequence alignment

References (26)

  1. Hagen JB: The origins of bioinformatics. Nature 2000, 1, 231–236.
  2. Sanger F, Thompson EO: The amino−acid sequence in the glycyl chain of insulin. Biochem. J. 1952, 52, iii.
  3. Doolittle RF, Singer SJ, Metzger H: Evolution of immunoglobulin polypeptyde chains: carboxy−terminal of an IgM heavy chain. Science 1966, 154, 1561–1562.
  4. Doolittle RF: The evolution of vertebrate fibrinogen. Fed. Proc. 1976, 35, 2145–2149.
  5. Miller CJ, Attwood TK: Bioinformatics goes back to the future. Nature 2003, 4, 157–162.
  6. Roberts RJ: The early days of bioinformatics publishing. Bioinformatics 2000, 16, 2–4.
  7. Trifonov EN: Earliest pages of bioinformatics. Bioinformatics 2000, 16, 5–9.
  8. Brown SM: Bioinformatics becomes respectable. BioTechniques 2003, 34, 2–5.
  9. Baxevanis AD: The molecular biology database collection: an online compilation of relevant database resources. Nucleic Acids Res. 2000, 28, 1–7.
  10. Piast M, Pałyga J: A diversity of chordate histone H1 complement. 12th International Symposium Molecular and physiological aspects of regulatory processes of the organism, Kraków, 2003, 317.
  11. Pałyga J, Piast M: Predicting tolerated amino acid substitutions in avian and mammalian somatic histone H1 variants. 12th International Symposium “Molecular and physiological aspects of regulatory processes of the organism, Kraków 2003, 300–301.
  12. Piast M, Kustrzeba−Wójcicka I, Banaś T: Molecular evolution of enolase. Acta Biochim Polon 2005, 52, 507–513.
  13. Benson DA, Karsch−Mizrachi I, Lipman DJ, Ostell J, Rapp BA, Wheeler DL: GenBank. Nucleic Acids Res 2004, 32, D23–D26.
  14. Bairoch A, Apweiler R: The SWISS−PROT protein sequence data bank and its supplement TrEMBL in 1999. Nucleic Acids Res 1999, 27, 49–54.
  15. Yona G, Linial L, Linial M: ProtoMap: automatic classification of protein sequences and hierarchy of protein families. Nucleic Acids Res 2000, 28, 49–55.
  16. Perriere G, Duret L, Gouy M: HOBACGEN: Database system for comparative genomics in bacteria. Gen Res 2000, 10, 379–385.
  17. Duret L, Mouchiroud D, Gouy M: HOVERGEN: a database of homologous vertebrate genes. Nucleic Acids Res 1994, 22, 2360–2365.
  18. Korab−Laskowska M, Rioux P, Brossard N, Littlejohn TG, Gray MW, Franz Lang B, Burger G: The organelle genome database project (GOBASE). Nucleic Acids Res 1998, 26, 138–144.
  19. Mewes HW, Amid C, Arnold R, Frishman D, Guldener U, Mannhaupt G, Munsterkotter M, Pagel P, Strack N, Stumpflen V, Warfsmann J, Ruepp A: MIPS: analysis and annotation of proteins from whole genomes. Nucleic Acids Res 2004, 32, D41–D44.
  20. Piast M, Kustrzeba−Wójcicka I, Banaś T: Enolase (EC 4.2.1.11) – a theoretical model of molecular evolution. Eur J Biochem 2004, 271, 78.
  21. Piast M, Kustrzeba−Wójcicka I, Banaś T: Functional and molecular diversity of enolase gene family. Eur J Biochem 2005, 272, 96.
  22. Piast M, Kustrzeba−Wójcicka I, Banaś T: Influence of evolution on molecular diversity of enolase – an enzyme of Embden−Meyerhof−Parnas pathway. Ukr Biokhim Zh 2005, 77(2), 137.
  23. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 1997, 25, 4876–4882.
  24. Ng CG, Henikoff S: Predicting deleterious amino acid substitutions. Gen Res 2001, 11, 863–874.
  25. Ng CG, Henikoff S: SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res 2003, 31, 3812–3814.
  26. Kumar S, Tamura K, Nei M: MEGA3: Integrated software for Molecular Evolutionary Genetics analysis and sequence alignment. Brief Bioinform 2004, 5, 150–163.