User Tools

Site Tools


smartsquery

This is an old revision of the document!


SMARTS query filter

The SMARTS language (ref1) is designed to describe substructure patterns in molecules, and to overcome the limitations of simple substructure matching. The SMARTS query filter at mcule.com provides nearly full SMARTS language support, based on the Indigo (ref2) implementation of SMARTS.

Using the SMARTS query filter you can select or eliminate molecules containing specific SMARTS patterns. The filter allows multiple SMARTS queries. Molecules matching with ALL or ANY of the queries can be either included to or excluded from the result collection.

When to use

SMARTS query filter is a powerful and flexible tool to perform complex structural queries impossible to describe with a simple Substructure search (link). For example, if you are seeking for molecules containing a phenol substituted with any halogen atoms at para position, you can easily define the corresponding SMARTS pattern, and do the search.

Things become more interesting if you consider the batch functionalities. Using proper lists of SMARTS queries you can effectively eliminate problematic (non-druglike, toxic, reactive) molecules from a collection; thus you can use the SMARTS filter to build up your own REOS (Rapid Elimination of Swill) filter (link to reos filter) (ref3). Another use case is to generate a collection of specific structures, for example, strong acids or bases, using specific SMARTS. All you need is a list of SMARTS queries and to apply the appropriate settings.

How to use

To create a SMARTS query, you can either go through the Daylight documentation (ref4), or use a molecule sketcher supporting the export of the SMARTS format (e.g. MarvinSketch from ChemAxon (ref5)).

Options

You can specify multiple SMARTS queries in one filter, and apply one of the following settings:

INCLUDE molecules matching ANY of the SMARTS queries (default) INCLUDE molecules matching ALL of the SMARTS queries EXCLUDE molecules matching ANY of the SMARTS queries EXCLUDE molecules matching ALL of the SMARTS queries

For example the first option returns all molecules that match any of the SMARTS patterns. To use a REOS-like setup, you should use the ‘EXCLUDE, ANY’ combination.

Results

- molecules satisfying search criteria

Limits

SMARTS filter available in the Free package (link) has the following limitations: - max 5 queries per filter - max 1 filter per workflow To get access to unlimited SMARTS query filter, subscribe to our Library Design (link) package.

SMARTS language limitations

Our implementation of SMARTS does not support the differentiation between explicit and implicit hydrogens. In addition, ‘up or unspecified’ and ‘down or unspecified’ notations (‘/?’ and ‘\?’), square-planar, triagonal-bipyramidal and octahedral stereo configurations are not supported.

Further information

If you would like to get further information on SMARTS format, visit the following web pages by Daylight (developer of the SMARTS language) 6):

SMARTS - A language describing molecular patterns

SMARTS Tutorial

SMARTS Examples

Additional information about the Indigo 7) implementation of SMARTS:

Indigo concepts - Daylight Formats

Difference between SMILES and SMARTS matching in Indigo

1) , 4) http://www.daylight.com/dayhtml/doc/theory/theory.smarts.html 2) , 7) http://ggasoftware.com/opensource/indigo 3) http://www.sciencedirect.com/science/article/pii/S135964469701163X 5) http://www.chemaxon.com/products/marvin/marvinsketch/ 6) http://www.daylight.com

smartsquery.1349993922.txt.gz · Last modified: 2012/10/11 22:18 by rkiss