CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

PSSMFG: a bioinformatics tool for generating descriptors based on PSSM profiles

عنوان مقاله: PSSMFG: a bioinformatics tool for generating descriptors based on PSSM profiles
شناسه ملی مقاله: IBIS07_111
منتشر شده در هفتمین همایش بیوانفورماتیک ایران در سال 1396
مشخصات نویسندگان مقاله:

Mozhgan Mozaffari legha - Department of Biophysics, Faculty of Biological Science, Tarbiat Modares University (TMU), Tehran, Iran
Javad Zahiri - Department of Biophysics, Faculty of Biological Science, Tarbiat Modares University (TMU), Tehran, Iran
S Shahriar arab - Department of Biophysics, Faculty of Biological Science, Tarbiat Modares University (TMU), Tehran, Iran

خلاصه مقاله:
One of fundamental step in the construction of machine learning based models is feature extraction. This step is important to determine the effectiveness of trained models in bioinformatics [1]. In the previous two decades, the variety of feature encoding schemes have been proposed in order to exploit suitable patterns from protein sequences. Most of the schemes are based on sequence information or representation of physicochemical properties. Although direct features derived from sequences themselves are regarded as essential for training models, an increasing number of studies have shown that evolutionary information in the form of PSSM profiles is much more informative than sequence information alone [2]. PSSM-based feature descriptors have been widely used to improve the performance of many types of research such as prediction of protein function, membrane proteins types, proteins structural class [3], protein binding site protein [4], secondary structure [5], disordered regions as well as sub-cellular localization of proteins [6], nevertheless, there is no universal tool for generating this wide variety of descriptors. In this study, PSSMFG (Position-Specific Scoring matrix-based feature generator), a tool that can generate a number of types of PSSM-based feature descriptors, is proposed. PSSMFG performances numerous algorithms available in the literature and provides an easy-to-use interface

کلمات کلیدی:
Feature encoding; Evolutionary information; PSSM

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/712510/