ISSN:
1573-4951
Keywords:
chemoinformatics
;
classification
;
cluster analysis
;
discriminant analysis
;
recursive partitioning
;
topological descriptors
;
2D structural keys
Source:
Springer Online Journal Archives 1860-2000
Topics:
Chemistry and Pharmacology
Notes:
Abstract Classification methods based on linear discriminant analysis, recursive partitioning, and hierarchical agglomerative clustering are examined for their ability to separate active and inactive compounds in a diverse chemical database. Topology-based descriptions of chemical structure from the Molconn-X and ISIS programs are used in conjunction with these classification techniques to identify ACE inhibitors, β-adrenergic antagonists, and H_2 receptor antagonists. Overall, discriminant analysis misclassifies the smallest number of active compounds, while recursive partitioning yields the lowest rate of misclassification among inactives. Binary structural keys from the ISIS package are found to generally outperform the whole-molecule Molconn-X descriptors, especially for identification of inactive compounds. For all targets and classification methods, sensitivity toward active compounds is increased by making repetitive classifications using training sets that contain equal numbers of actives and inactives. These balanced training sets provide an average numerical class membership score which may be used to select subsets of compounds that are enriched in actives.
Type of Medium:
Electronic Resource
URL:
http://dx.doi.org/10.1023/A:1008061017938