Back close

Prediction of Protein Retention Times in Anion-exchange Chromatography Systems using Support Vector Regression

Publication Type : Journal Article

Thematic Areas : Center for Computational Engineering and Networking (CEN)

Source : Journal Chem Inf Computer Science . 2002

Url : https://pubmed.ncbi.nlm.nih.gov/12444731/

Campus : Coimbatore

School : School of Engineering

Center : Center for Computational Engineering and Networking, Computational Engineering and Networking

Department : Center for Computational Engineering and Networking (CEN)

Year : 2002

Abstract : Quantitative Structure-Retention Relationship (QSRR) models are developed for the prediction of protein retention times in anion-exchange chromatography systems. Topological, subdivided surface area, and TAE (Transferable Atom Equivalent) electron-density-based descriptors are computed directly for a set of proteins using molecular connectivity patterns and crystal structure geometries. A novel algorithm based on Support Vector Machine (SVM) regression has been employed to obtain predictive QSRR models using a two-step computational strategy. In the first step, a sparse linear SVM was utilized as a feature selection procedure to remove irrelevant or redundant information. Subsequently, the selected features were used to produce an ensemble of nonlinear SVM regression models that were combined using bootstrap aggregation (bagging) techniques, where various combinations of training and validation data sets were selected from the pool of available data. A visualization scheme (star plots) was used to display the relative importance of each selected descriptor in the final set of "bagged" models. Once these predictive models have been validated, they can be used as an automated prediction tool for virtual high-throughput screening (VHTS).

Cite this Research Publication : Minghu Song, Curt M. Breneman, Jinbo Bi, N. Sukumar, Kristin P. Bennett, Steven Cramer and Nihal Tugcu, “Prediction of Protein Retention Times in Anion-exchange Chromatography Systems using Support Vector Regression” J. Chem. Inf. Comput. Sci. 42, 1347-1357 (2002) DOI: 10.1021/ci025580t IF: 2.902

Admissions Apply Now