[2103.13047] Knowledge-aware Contrastive Molecular Graph Learning

Significance

Molecular graph representation learning with hard-coded domain knowledge

Keypoints

  • Propose a framework to learn molecular representation explicitly incorporating the functional group and molecular fingerprint information
  • Experimentally show quantitative and qualitative strengths over other frameworks

Review

Background

Although GNNs have been successful for learning the molecular representation as a graph, previous methods rely on inductive bias obtained by the molecules embedded as the atom-bond graph. Practically, chemical properties are largely dependent on which functional group the molecule contains. Since the functional group is a subgraph level structure, conventional methods that only take atom as its node feature cannot capture this property explicitly.

Keypoints

Propose a framework to learn molecular representation explicitly incorporating the functional group and molecular fingerprint information

210325-1 Proposed CKGNN framework To first inject the functional group information of the molecule, the functional group embedding is concatenated to the atom embedding to serve as the node feature that is input to a GNN as presented in the upper part of the figure. Second information that is hard-coded is the fingerprint embedding. A minibatch is randomly selected from a molecule cluster, and an anchor molecule is selected. A molecule with the top similarity based on the fingerprint is labelled as the similar (positive) molecule, while others are labelled as the dissimilar (negative) molecules. The representation of the molecules obtained from the functional group aware GNN is trained contrastively with the anchor-positive-negative labels.

Experimentally show quantitative and qualitative strengths over other frameworks

210325-2 Molecule prediction performance of the CKGNN Quantitatively, the proposed CKGNN has shown improved performance when compared to baseline models in a number of benchmark datasets.

210325-3 Example of CKGNN predicted similar molecules Qualitatively, an example of the CKGNN inferring chemically-plausible similar structure is provided in the paper. The authors also mention that high variance of the similarity distribution reflects strength of the proposed model, but this claim seems to need further verification.

Related

Share

Comment

#image-generation #multi-modal #language-model #retrieval-augmentation #robotics #forecasting #psychiatry #instruction-tuning #diffusion-model #notice #graph-neural-network #responsible-ai #privacy-preserving #scaling #mixture-of-experts #generative-adversarial-network #speech-model #contrastive-learning #self-supervised #image-representation #image-processing #object-detection #pseudo-labeling #scene-text-detection #neural-architecture-search #data-sampling #long-tail #graph-representation #zero-shot #metric-learning #federated-learning #weight-matrix #low-rank #vision-transformer #computer-vision #normalizing-flow #invertible-neural-network #super-resolution #image-manipulation #thread-summarization #natural-language-processing #domain-adaptation #knowledge-distillation #scene-text #model-compression #semantic-segmentation #instance-segmentation #video-understanding #code-generation #graph-generation #image-translation #data-augmentation #model-pruning #signal-processing #text-generation #text-classification #music-representation #transfer-learning #link-prediction #counterfactual-learning #medical-imaging #acceleration #transformer #style-transfer #novel-view-synthesis #point-cloud #spiking-neural-network #optimization #multi-layer-perceptron #adversarial-training #visual-search #image-retrieval #negative-sampling #action-localization #weakly-supervised #data-compression #hypergraph #adversarial-attack #submodularity #active-learning #deblurring #object-tracking #pyramid-structure #loss-function #gradient-descent #generalization #bug-fix #orthogonality #explainability #saliency-mapping #information-theory #question-answering #knowledge-graph #robustness #limited-data #recommender-system #anomaly-detection #gaussian-discriminant-analysis #molecular-graph #video-processing