Top Qs
Timeline
Chat
Perspective
Molecule mining
Data mining for patterns in molecule data From Wikipedia, the free encyclopedia
Remove ads
Remove ads
Molecule mining is the process of data mining, or extracting and discovering patterns, as applied to molecules. Since molecules may be represented by molecular graphs, this is strongly related to graph mining and structured data mining. The main problem is how to represent molecules while discriminating the data instances. One way to do this is chemical similarity metrics, which has a long tradition in the field of cheminformatics.
![]() | This article may require cleanup to meet Wikipedia's quality standards. The specific problem is: The article is just lists. The reason for each list needs an introduction. (September 2024) |
Typical approaches to calculate chemical similarities use chemical fingerprints, but this loses the underlying information about the molecule topology. Mining the molecular graphs directly avoids this problem. So does the inverse QSAR problem which is preferable for vectorial mappings.
Remove ads
Coding(Moleculei,Moleculej≠i)
Kernel methods
- Marginalized graph kernel[1]
- Optimal assignment kernel[2][3][4]
- Pharmacophore kernel[5]
- C++ (and R) implementation combining
Maximum common graph methods
- MCS-HSCS[9] (Highest Scoring Common Substructure (HSCS) ranking strategy for single MCS)
- Small Molecule Subgraph Detector (SMSD)[10]- is a Java-based software library for calculating Maximum Common Subgraph (MCS) between small molecules. This will help us to find similarity/distance between two molecules. MCS is also used for screening drug like compounds by hitting molecules, which share common subgraph (substructure).[11]
Remove ads
Coding(Moleculei)
Molecular query methods
Methods based on special architectures of neural networks
See also
References
External links
Wikiwand - on
Seamless Wikipedia browsing. On steroids.
Remove ads