A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions

15 Apr 2024  ยท  PengFei Liu, Jun Tao, Zhixiang Ren ยท

The task of chemical reaction predictions (CRPs) plays a pivotal role in advancing drug discovery and material science. However, its effectiveness is constrained by the vast and uncertain chemical reaction space and challenges in capturing reaction selectivity, particularly due to existing methods' limitations in exploiting the data's inherent knowledge. To address these challenges, we introduce a data-curated self-feedback knowledge elicitation approach. This method starts from iterative optimization of molecular representations and facilitates the extraction of knowledge on chemical reaction types (RTs). Then, we employ adaptive prompt learning to infuse the prior knowledge into the large language model (LLM). As a result, we achieve significant enhancements: a 14.2% increase in retrosynthesis prediction accuracy, a 74.2% rise in reagent prediction accuracy, and an expansion in the model's capability for handling multi-task chemical reactions. This research offers a novel paradigm for knowledge elicitation in scientific research and showcases the untapped potential of LLMs in CRPs.

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Chemical Reaction Prediction Mol-Instruction SLM4CRP METEOR 0.901 # 1
Exact 0.674 # 1
Morgan FTS 0.854 # 1
Validity 0.998 # 1
Reagent Prediction Mol-Instruction SLM4CRP METEOR 0.744 # 1
Exact 0.284 # 1
Validity 1 # 1
Morgan FTS 0.649 # 1
Retrosynthesis Mol-Instruction SLM4CRP METEOR 0.95 # 1
Exact 0.757 # 1
Validity 0.994 # 2
Morgan FTS 0.905 # 1
Forward reaction prediction Mol-Instruction SLM4CRP METEOR 0.993 # 1
Exact 0.945 # 1
Validity 0.997 # 2
Morgan FTS 0.986 # 2

Methods


No methods listed for this paper. Add relevant methods here