TY - JOUR
T1 - Fast prediction of protein-protein interaction sites based on Extreme Learning Machines
AU - Wang, Debby D.
AU - Wang, Ran
AU - Yan, Hong
N1 - Funding Information:
This work is supported by the Hong Kong Research Grants Council (Project CityU123809 ).
PY - 2014/3/27
Y1 - 2014/3/27
N2 - Conventional machine learning methods can be used to identify protein-protein interaction sites and study the gene regulatory networks and functions. However, when applied to large datasets, the computational complexities of these methods become a major drawback. With a significantly reduced computational complexity, the Extreme Learning Machines provide an attractive balance between computational time and generalization performance. In the method proposed in this paper, after searching for interfacial residues using a dynamic strategy and extracting spatially neighboring residue profiles for a set of 563 non-redundant protein chains, we implement the interface prediction either on multi-chain sets or on single-chain sets, using the two methods Extreme Learning Machines and support vector machines for a comparable study. As a consequence, in both multi-chain and single-chain cases Extreme Learning Machines tend to obtain higher Recall values than support vector machines, and in the multi-chain case Extreme Learning Machines as well show a remarkable advantage in the computational speed.
AB - Conventional machine learning methods can be used to identify protein-protein interaction sites and study the gene regulatory networks and functions. However, when applied to large datasets, the computational complexities of these methods become a major drawback. With a significantly reduced computational complexity, the Extreme Learning Machines provide an attractive balance between computational time and generalization performance. In the method proposed in this paper, after searching for interfacial residues using a dynamic strategy and extracting spatially neighboring residue profiles for a set of 563 non-redundant protein chains, we implement the interface prediction either on multi-chain sets or on single-chain sets, using the two methods Extreme Learning Machines and support vector machines for a comparable study. As a consequence, in both multi-chain and single-chain cases Extreme Learning Machines tend to obtain higher Recall values than support vector machines, and in the multi-chain case Extreme Learning Machines as well show a remarkable advantage in the computational speed.
KW - Extreme Learning Machines
KW - Protein-protein interaction sites
KW - Residue sequence profile
KW - Support vector machines
UR - https://www.scopus.com/pages/publications/84893651255
U2 - 10.1016/j.neucom.2012.12.062
DO - 10.1016/j.neucom.2012.12.062
M3 - Article
AN - SCOPUS:84893651255
SN - 0925-2312
VL - 128
SP - 258
EP - 266
JO - Neurocomputing
JF - Neurocomputing
ER -