Benchmarking machine-readable vectors of chemical reactions on computed activation barriers

Dataset

In recent years, there has been a surge of interest in predicting computed activation barriers, to enable the acceleration of the automated exploration of reaction networks. Consequently, various predictive approaches have emerged, ranging from graph-based models to methods based on the three-dimensional structure of reactants and products. In tandem, many representations have been developed to predict experimental targets, which may hold promise for barrier prediction as well. Here, we bring together all of these efforts and benchmark various methods (Morgan fingerprints, the DRFP, the CGR representation-based Chemprop, SLATMd, B²Rl², EquiReact and language model BERT + RXNFP) for the prediction of computed activation barriers on three diverse datasets. This record includes data to support the article "Benchmarking machine-readable vectors of chemical reactions on computed activation barriers". This supports the github repository https://github.com/lcmd-epfl/benchmark-barrier-learning which contains the codes and duplicates the data.

Identifier
Source	https://archive.materialscloud.org/record/2024.163
Metadata Access	https://archive.materialscloud.org/xml?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:materialscloud.org:2401

Provenance
Creator	van Gerwen, Puck; R. Briling, Ksenia; Calvino Alonso, Yannick; Franke, Malte; Corminboeuf, Clemence
Publisher	Materials Cloud
Publication Year	2024
Rights	info:eu-repo/semantics/openAccess; Creative Commons Attribution 4.0 International https://creativecommons.org/licenses/by/4.0/legalcode
OpenAccess	true
Contact	archive(at)materialscloud.org

Representation
Language	English
Resource Type	Dataset
Discipline	Materials Science and Engineering