谭泗桥, 袁哲明, 柏连阳, 熊洁仪. 基于支持向量机回归与K-最近邻法的组合预测用于除草剂QSAR建模[J]. 农药学学报, 2007, 9(4): 324-329.
    引用本文: 谭泗桥, 袁哲明, 柏连阳, 熊洁仪. 基于支持向量机回归与K-最近邻法的组合预测用于除草剂QSAR建模[J]. 农药学学报, 2007, 9(4): 324-329.
    TAN Si-qiao, YUAN Zhe-ming, BAI Lian-yang, XIONG Jie-yi. The QSAR Modeling of Herbicide Using a Combinatorial Model Based on Support Vector Regression and K-nearest Neighbor[J]. Chinese Journal of Pesticide Science, 2007, 9(4): 324-329.
    Citation: TAN Si-qiao, YUAN Zhe-ming, BAI Lian-yang, XIONG Jie-yi. The QSAR Modeling of Herbicide Using a Combinatorial Model Based on Support Vector Regression and K-nearest Neighbor[J]. Chinese Journal of Pesticide Science, 2007, 9(4): 324-329.

    基于支持向量机回归与K-最近邻法的组合预测用于除草剂QSAR建模

    The QSAR Modeling of Herbicide Using a Combinatorial Model Based on Support Vector Regression and K-nearest Neighbor

    • 摘要: 为了提高定量构效关系(QSAR)研究的预测精度,发展了一种基于支持向量机回归(SVR)与K-最近邻法(KNN)的组合预测方法:以均方误差(MSE)最小为原则,基于SVR实施核函数寻优;基于MSE最小原则与最优核函数以SVR进行描述符筛选并得到保留描述符;通过"多轮末尾强制淘汰法"揭示各保留描述符对预测精度的影响程度;从保留描述符出发,以不同KNN预测值反映样本集异质性并构建子模型,然后基于SVR以留一法实施组合预测。运用该组合预测方法研究苯乙酰胺类除草剂QSAR建模,结果表明:基于SVR与KNN的组合预测方法在参比模型中预测精度最高,具结构风险最小、非线性、能有效克服过拟合、泛化推广能力优异等优点,在QSAR研究中具有广泛的应用前景。

       

      Abstract: To improve the precision of QSAR modeling,a novel nonlinear combinatorial forecast method based on support vector regression (SVR) and K-nearest neighbor (KNN) was proposed.Kernels and descriptors optimization based on SVR were evaluated by the rule of minimum MSE value,and "multi-round enforcement optimization" was taken to illuminate the effect of retained descriptors on forecasting precision.The heterogeneity of sample set was characterized by different KNN and multiple sub-models were assembled,then combinational forecast was carried out based on leave-one-out method.The new method had been employed to study for QSAR on a series of herbicidal materials,N-phenylacetamides,and has the highest prediction precision in all reference models.It has the advantages of structural risk minimization,non-linear characteristics,avoiding the over-fit and strong generalization ability,etc.The novel combination model,so,can be widely used in QSAR.

       

    /

    返回文章
    返回