Abstract

A modular distributable system has been built for high-throughput computation of molecular structures and properties. It has been used to process 250,000 compounds from the NCI database and to make the results searchable by structures and properties. The IUPAC/NIST InChI specification and algorithm has been used to index the structures and enforce integrity during computation. A number of novel features of the PM5 Hamiltonian were identified as a result of the high-throughput approach. The system and the data can be redistributed and reused and promote the value of computed data as a primary chemical resource Figure Workflow schematic for conversion of an SDF file to a Mopac input file .

Links and resources

Tags