CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Improving the prediction of physical protein interaction by Balanced Random Forest interprotein residue contact predictions using sequence covariation information

عنوان مقاله: Improving the prediction of physical protein interaction by Balanced Random Forest interprotein residue contact predictions using sequence covariation information
شناسه ملی مقاله: IBIS10_126
منتشر شده در اولین همایش بین المللی و دهمین همایش ملی بیوانفورماتیک ایران در سال 1400
مشخصات نویسندگان مقاله:

Sara Salmanian - Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran,Iran
Hamid Pezeshk - School of Mathematics, Statistics and Computer Science, College of Science, University of Tehran, Tehran, Iran (currently visiting Department of Mathematics and Statistics, Concordia University, Montreal, Canada)- School of Biological Sciences, Institute
Mehdi Sadeghi - National Institute of Genetic Engineering and Biotechnology, Tehran, Iran

خلاصه مقاله:
Protein-protein interactions are essential for most cellular processes. There are a lot of protein interactionsand a large number of protein sequences with unknown interacting partners. Prediction of protein interactionfrom sequence information has always been a great challenge. Those predictions would be more challengingwhen someone is supposed to specifically detect physical but not functional protein interplays. Therefore,developing new approaches for the accurate prediction of sequence-based physical protein interactions couldbe an important advancement in computational biology. Inter-protein spatially interrelating residue positionsexhibit correlated patterns of sequence evolution in multiple sequence alignments. Those co-evolutions arewisely exploited for the prediction of physical protein interactions.It is shown that feeding norm values of whole covariation information of protein heterodimers into SupportVector Machines (SVM), could accurately predict the possibility of physical interaction of those dimers usingsequence information. In the present study, Balanced Random Forest (BRF) models were trained with thecovariations of inter-protein residues at different hypothetical interacting sites and then the models wereemployed for the prediction of possible inter-protein residue contacts. Instead of considering whole coevolutionaryinformation, those BRF predictions could take into account the covariation information of moreprobable physically interacting residues for further prediction of protein dimers at higher protein scales. BRFpredicted those more probable contacting residues as positive class and other interacting pairs of amino acidsas negative. After BRF predictions, previously computed covariation scores of negatively predicted residuepartners were zeroized, thereby the role of those pairs in the final calculation of norm values were driven out.Results of the current study indicated that feeding the updated norm values of residue-residue covariationmatrices, obtained after BRF predictions, into SVM models could significantly increase the accuracy of thefinal protein interaction predictions at the protein family level.

کلمات کلیدی:
residue contacts, physical interaction, covariation, protein interaction prediction

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/1473581/