User Tools

Site Tools


regsys

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
regsys [2013/02/27 08:04] rkissregsys [2013/10/10 16:03] flack
Line 1: Line 1:
 ====== Mcule Advanced Curation (MAC) ====== ====== Mcule Advanced Curation (MAC) ======
  
-The mcule database is curated by MAC (Mcule Advanced Curation) that involves a rigorous molecule registration system based on more than 80 structural checks, standardization, preparation and correction steps. MAC guarantees high quality search results and avoids common errors arising from mis-drawn and incorrect structures that can critically affect the quality of computational calculations and the efficiency of experimental results.+The mcule database is curated by **MAC (Mcule Advanced Curation)** that involves a rigorous molecule registration system based on more than 80 structural checks, standardization, preparation and correction steps. MAC guarantees high quality search results and avoids common errors arising from mis-drawn and incorrect structures that can critically affect the quality of computational calculations and the efficiency of experimental results.
  
-==== Quality is important ====+**Key features of MAC:** high level data curation, stereochemical standardization, robust novelty check and isomer detection, correct handling of salts & organometallics
  
-The design of screening libraries and the development of predictive drug discovery models all start with a high quality database. Chemical correctness is crucial because mis-drawn and imperfectly defined structures result in incorrect models, misleading predictions and inconsistent hits. Problematic structures should therefore be eliminated at the earliest possible stage from a drug discovery pipeline.+Continue reading for more information about MAC, or check our presentations from the 244th National Meeting of American Chemical Society:
  
 +[[http://mcule-blog.s3.amazonaws.com/acs12/mcule_ACS12_Phi_libraries.pdf|Evaluation of data quality in currently available compound libraries (slides)]]
 +
 +[[http://mcule-blog.s3.amazonaws.com/acs12/mcule_ACS12_libraries.jpg|Evaluation of data quality in currently available compound libraries (poster)]]
 +
 +
 +==== Quality is important ====
  
-The mcule structure registration system is primarily designed to handle chemical structures coming from different data sources, mainly from chemical suppliers, and load the structures into the mcule database. This is a non-trivial task which requires a careful structure check and preparation procedure. To reach a high curation level, the registration system should ensure database quality in terms of structure correctnessuniqueness and reliability as well as maintain high level of data standardization.+The design of screening libraries and the development of predictive drug discovery models **all start with a high quality database**Chemical correctness is crucial because mis-drawn and imperfectly defined structures result in incorrect modelsmisleading predictions and inconsistent hits. Problematic structures should therefore be eliminated at the earliest possible stage from drug discovery pipeline.
  
-All molecules with an MCULE ID have been processed by the mcule structure registration system. User uploaded molecules are not processed by the mcule structure registration system by default. We will enable this option in future.+The mcule structure registration system is primarily designed to correctly handle chemical structures coming from different data sources, mainly from chemical suppliers, and load the structures into the mcule database. This is a non-trivial task which requires a careful structure check and preparation procedure. To reach a high curation level, the registration system should ensure database quality in terms of structure correctness, uniqueness and reliability as well as maintain a high level of data standardization.
  
-**Key features:** high level data curation, stereochemical standardization, robust novelty check and isomer detection, handling salts & organometallics+**All molecules with an MCULE ID have been processed by MAC**. User uploaded molecules are not processed by MAC by default. We plan to enable this option in future.
  
-===Registration challenges===+==== Registration challenges ====
  
  
Line 186: Line 192:
 In most cases the system is also capable of identifying the main components, which can serve as the input set for virtual screens. In most cases the system is also capable of identifying the main components, which can serve as the input set for virtual screens.
  
-You can see below the index page of compound [[https://mcule.com/MCULE-3198812899|MCULE-3198812899]]. This is a maleic and/or fumaric acid salt+You can see below the index page of compound [[https://mcule.com/MCULE-3198812899/|MCULE-3198812899]]. This is a maleic and/or fumaric acid salt
 (uncertainty is marked by crossed double bond). Counter ions are marked, and component multiplicities are assigned correctly by the system. (uncertainty is marked by crossed double bond). Counter ions are marked, and component multiplicities are assigned correctly by the system.
  
 {{:regsys:reg_sys_12.png|}} {{:regsys:reg_sys_12.png|}}
regsys.txt · Last modified: 2013/10/19 11:36 by rkiss