The NCI collection is the chemically curated version of the most recent release of the Open Chemical Repository Collection of the National Cancer Institute. The NCI database contains very diverse molecules and has been widely used for virtual screening purposes. All entries have been processed by the mcule structure registration system. Molecules with structural problems and inconsistencies have been filtered out.