Data and Code for "Machine Learning for Bridging the Gap between Density Functional Theory and Coupled Cluster Energies"
dc.contributor | Gerbig, Dennis | |
dc.contributor | Schreiner, Peter Richard | |
dc.contributor.author | Ruth, Marcel | |
dc.contributor.other | Institute of Organic Chemistry | de_DE |
dc.date.accessioned | 2023-03-03T08:35:02Z | |
dc.date.available | 2023-03-03T08:35:02Z | |
dc.date.issued | 2023-02-02 | |
dc.description.abstract | The datasets, models, and scripts were created to achieve an accurate prediction of the increment of single-point energies between density functional theory (DFT) and wavefunction-based methods, which led to our submitted article: 'A Machine Learning Approach for Bridging the Gap between Density Functional Theory and Coupled Cluster Energies'. We used the ORCA quantum chemical package to compute the geometries of each species at the B3LYP-D3(BJ)/cc-pVTZ level of theory. The optimized structure was subsequently employed for single-point (SP) computations at the DLPNO-CCSD(T)/cc-pVTZ and CCSD(T)/cc-pVTZ levels of theory. All data were extracted from the calculations and compiled in the provided .csv files. With the datasets and prediction scripts, it is possible to forecast the differences in single-point (SP) energies between the B3LYP-D3(BJ)/cc-pVTZ and DLPNO-CCSD(T)/cc-pVTZ (for monomers and dimers) levels of theory, as well as to the CCSD(T)/cc-pVTZ level of theory for monomers. The datasets can be opened and read with any text editor. The Pytorch models can be loaded and manipulated as usual (https://pytorch.org/tutorials/beginner/saving_loading_models.html). The prediction can be made by installing a suitable Python environment and setting the code line: test_database = f'TestDatabase_{mode}.csv' to the desired dataset for prediction. The format and column names of the file should match the uploaded dataset files. Once the line is modified, a prediction can be generated using the following command, for example, “python gen_predictions_CCSDt.py”. | de_DE |
dc.identifier.uri | https://jlupub.ub.uni-giessen.de//handle/jlupub/10034 | |
dc.identifier.uri | http://dx.doi.org/10.22029/jlupub-9418 | |
dc.language.iso | en | de_DE |
dc.relation | https://doi.org/10.1021/acs.jctc.3c00274 | |
dc.relation | http://dx.doi.org/10.22029/jlupub-18463 | |
dc.rights | CC0 1.0 Universal | * |
dc.rights.uri | http://creativecommons.org/publicdomain/zero/1.0/ | * |
dc.subject.ddc | ddc:540 | de_DE |
dc.title | Data and Code for "Machine Learning for Bridging the Gap between Density Functional Theory and Coupled Cluster Energies" | de_DE |
dc.type | Dataset | de_DE |
local.affiliation | FB 08 - Biologie und Chemie | de_DE |
local.project | SPP 2363, Schr 597/41-1 | de_DE |
Dateien
Originalbündel
1 - 20 von 49
Lade...
- Name:
- DLPNO_Monomers_SI.csv
- Größe:
- 2.04 MB
- Format:
- Unknown data format
- Beschreibung:
- Input features and target value for the DLPNO-CCSD(T)/cc-pVTZ model (monomers).
Lade...
- Name:
- DLPNO_Dimers_SI.csv
- Größe:
- 2.48 MB
- Format:
- Unknown data format
- Beschreibung:
- Input features and target value for the DLPNO-CCSD(T)/cc-pVTZ model (dimers).
Lade...
- Name:
- CCSDt_SI.csv
- Größe:
- 804.23 KB
- Format:
- Unknown data format
- Beschreibung:
- Input features and target value for the CCSD(T)/cc-pVTZ model (monomers).
Lade...
- Name:
- DFT_CCSDt_01.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 1. model for prediction of the B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lade...
- Name:
- DFT_CCSDt_02.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 2. model for prediction of B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lade...
- Name:
- gen_predictions_CCSDt.py
- Größe:
- 6.24 KB
- Format:
- Unknown data format
- Beschreibung:
- Script to predict the increment between SP energies of the B3LYP-D3(BJ)/cc-pVTZ and CCSD(T)/cc-pVTZ levels of theory.
Lade...
- Name:
- gen_predictions_DLPNO.py
- Größe:
- 6.28 KB
- Format:
- Unknown data format
- Beschreibung:
- Script to predict the increment between SP energies of the B3LYP-D3(BJ)/cc-pVTZ and DLPNO-CCSD(T)/cc-pVTZ levels of theory.
Lade...
- Name:
- graph_preprocessing_ccsdt.py
- Größe:
- 10.08 KB
- Format:
- Unknown data format
- Beschreibung:
- Script containing preprocessing steps to generate the molecular graphs.
Lade...
- Name:
- graph_preprocessing.py
- Größe:
- 10.08 KB
- Format:
- Unknown data format
- Beschreibung:
- Script containing preprocessing steps to generate the molecular graphs for DLPNO-CCSD(T).
Lade...
- Name:
- models.py
- Größe:
- 16.83 KB
- Format:
- Unknown data format
- Beschreibung:
- File containing the model class.
Lade...
- Name:
- data_preprocessing.py
- Größe:
- 9.02 KB
- Format:
- Unknown data format
- Beschreibung:
- Script containing preprocessing steps to format the raw data for DLPNO-CCSD(T).
Lade...
- Name:
- data_preprocessing_ccsdt.py
- Größe:
- 9.13 KB
- Format:
- Unknown data format
- Beschreibung:
- Script containing preprocessing steps to format the raw data for CCSD(T).
Lade...
- Name:
- config.py
- Größe:
- 1.71 KB
- Format:
- Unknown data format
- Beschreibung:
- Hyperparameters of the models.
Lade...
- Name:
- DFT_CCSDt_03.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 3. model for prediction of B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lade...
- Name:
- DFT_CCSDt_05.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 5. model for prediction of B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lade...
- Name:
- DFT_CCSDt_04.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 4. model for prediction of B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lade...
- Name:
- DFT_CCSDt_06.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 6. model for prediction of B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lade...
- Name:
- DFT_CCSDt_07.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 7. model for prediction of B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lade...
- Name:
- DFT_CCSDt_08.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 8. model for prediction of B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lade...
- Name:
- DFT_CCSDt_09.pt
- Größe:
- 91.05 MB
- Format:
- Unknown data format
- Beschreibung:
- The 9. model for prediction of B3LYP-D3(BJ)/cc-pVTZ to CCSD(T)/cc-pVTZ levels of theory SP energies.
Lizenzbündel
1 - 1 von 1
Lade...
- Name:
- license.txt
- Größe:
- 7.58 KB
- Format:
- Item-specific license agreed upon to submission
- Beschreibung: