Title:
Inferring biological basis about psychrophilicity by interpreting the rules generated from the correctly classified input instances by a classifier

dc.contributor.authorAbhigyan Nath
dc.contributor.authorKarthikeyan Subbiah
dc.date.accessioned2026-02-07T06:01:47Z
dc.date.issued2014
dc.description.abstractOrganisms thriving at extreme cold surroundings are called as psychrophiles and they present a wealth of knowledge about sequence adjustments in proteins that had occurred during the adaptation to low temperatures. In this paper, we propose a new cascading model to investigate the basis for psychrophilicity. In this model, a superior classifier was used to discriminate psychrophilic from mesophilic protein sequences, and then the PART rule generating algorithm was applied on the input instances that are correctly classified by the classifier, to generate human interpretable rules. These derived rules were further validated on a structural dataset and finally analyzed to discover the underlying biological basis about the psychrophilicity. In this study, we have used one of the key features of psychrophilic proteins accountable for remaining functional in extreme cold temperature surroundings i.e., global patterns of amino acid composition as the input features. The rotation forest classifier outperformed all the other classifiers with maximum accuracy of 70.5% and maximum AUC of 0.78. The effect of sequence length on the classification accuracy was also investigated. The analysis of the derived rules and interpretation of the analyzed results had revealed some interesting phenomena such as the amino acids A, D, G, F, and S are over-represented, and T is under-represented in psychrophilic proteins. These findings augment the existing domain knowledge for psychrophilic sequence features. © 2014 Elsevier Ltd.
dc.identifier.doi10.1016/j.compbiolchem.2014.10.002
dc.identifier.issn14769271
dc.identifier.urihttps://doi.org/10.1016/j.compbiolchem.2014.10.002
dc.identifier.urihttps://dl.bhu.ac.in/bhuir/handle/123456789/26623
dc.publisherElsevier Ltd
dc.subjectAmino acid composition patterns
dc.subjectBiologically interpretable rules
dc.subjectCold adaptation
dc.subjectPART rule induction method
dc.subjectRotation forest
dc.titleInferring biological basis about psychrophilicity by interpreting the rules generated from the correctly classified input instances by a classifier
dc.typePublication
dspace.entity.typeArticle

Files

Collections