Random Forest classification based on star graph topological indices for antioxidant proteins

Enrique Fernández-Blanco, Vanessa Aguiar-Pulido, Cristian Robert Munteanu, Julian Dorado

Research output: Contribution to journalArticlepeer-review

44 Scopus citations

Abstract

Aging and life quality is an important research topic nowadays in areas such as life sciences, chemistry, pharmacology, etc. People live longer, and, thus, they want to spend that extra time with a better quality of life. At this regard, there exists a tiny subset of molecules in nature, named antioxidant proteins that may influence the aging process. However, testing every single protein in order to identify its properties is quite expensive and inefficient. For this reason, this work proposes a model, in which the primary structure of the protein is represented using complex network graphs that can be used to reduce the number of proteins to be tested for antioxidant biological activity. The graph obtained as a representation will help us describe the complex system by using topological indices. More specifically, in this work, Randić's Star Networks have been used as well as the associated indices, calculated with the S2SNet tool. In order to simulate the existing proportion of antioxidant proteins in nature, a dataset containing 1999 proteins, of which 324 are antioxidant proteins, was created. Using this data as input, Star Graph Topological Indices were calculated with the S2SNet tool. These indices were then used as input to several classification techniques. Among the techniques utilised, the Random Forest has shown the best performance, achieving a score of 94% correctly classified instances. Although the target class (antioxidant proteins) represents a tiny subset inside the dataset, the proposed model is able to achieve a percentage of 81.8% correctly classified instances for this class, with a precision of 81.3%.

Original languageEnglish (US)
Pages (from-to)331-337
Number of pages7
JournalJournal of theoretical biology
Volume317
DOIs
StatePublished - Jan 21 2013
Externally publishedYes

Keywords

  • Antioxidant protein
  • Multi-target QSAR
  • Star Graph
  • Topological indices

ASJC Scopus subject areas

  • Statistics and Probability
  • Modeling and Simulation
  • Biochemistry, Genetics and Molecular Biology(all)
  • Immunology and Microbiology(all)
  • Agricultural and Biological Sciences(all)
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Random Forest classification based on star graph topological indices for antioxidant proteins'. Together they form a unique fingerprint.

Cite this