Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data

This study presents Q-DFTNet, a chemistry-informed neural network (ChINN) framework designed to benchmark graph neural networks (GNNs) for dipole moment prediction using the QM9 dataset. Seven GNN architectures, GCN, GIN, GraphConv, GATConv, GATNet, SAGEConv, and GIN+EdgeConv, were trained for 100 e...

Full description

Saved in:
Bibliographic Details
Main Authors: Wayo, Dennis Delali Kwesi, Mohd Zulkifli, Mohamad Noor, Ganji, Masoud Darvish, Saporetti, Camila M., Goliatt, Leonardo
Format: Article
Language:en
Published: John Wiley and Sons Inc. 2025
Subjects:
Online Access:https://umpir.ump.edu.my/id/eprint/47149/1/Q-DFTNet_A%20chemistry-informed%20neural%20network%20framework.pdf
https://doi.org/10.1002/jcc.70206
https://umpir.ump.edu.my/id/eprint/47149/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1858062138545799168
author Wayo, Dennis Delali Kwesi
Mohd Zulkifli, Mohamad Noor
Ganji, Masoud Darvish
Saporetti, Camila M.
Goliatt, Leonardo
author_facet Wayo, Dennis Delali Kwesi
Mohd Zulkifli, Mohamad Noor
Ganji, Masoud Darvish
Saporetti, Camila M.
Goliatt, Leonardo
author_sort Wayo, Dennis Delali Kwesi
building UMPSA Library
collection Institutional Repository
content_provider Universiti Malaysia Pahang Al-Sultan Abdullah
content_source UMPSA Institutional Repository
continent Asia
country Malaysia
description This study presents Q-DFTNet, a chemistry-informed neural network (ChINN) framework designed to benchmark graph neural networks (GNNs) for dipole moment prediction using the QM9 dataset. Seven GNN architectures, GCN, GIN, GraphConv, GATConv, GATNet, SAGEConv, and GIN+EdgeConv, were trained for 100 epochs and evaluated across performance and interpretability metrics. GraphConv achieved the lowest test MSE (0.7054), MAE (0.6196), and the highest R2 (0.6513) with only 16.5k trainable parameters, confirming its optimal accuracy-complexity trade-off. GIN+EdgeConv followed closely with MSE of 0.7386, MAE of 0.6332, and R2 of 0.6349, leveraging edge-awareness for enhanced expressivity. In contrast, attention-based models like GATConv and GATNet underperformed, with test MSEs of 0.9667 and 1.0096, and R2 values of 0.5221 and 0.5009, despite their higher complexity (43.5k and 37.3k parameters). Latent space analysis via t-SNE, PCA, and UMAP showed superior cluster separability for GraphConv, GIN+EdgeConv, and GCN. Clustering metrics corroborated these observations: GraphConv achieved a Silhouette Score of 0.4665, a Davies–Bouldin Index of 0.7111, and a Calinski–Harabasz Score of 1278.40. Cluster-wise molecular dipole means for GIN+EdgeConv ranged from 2.6221 to 2.9606 Debye, reflecting high semantic coherence. Residual analysis and QQ plots confirmed that models with lower MSEs also had near-Gaussian error distributions, enhancing interpretability.
format Article
id my.ump.umpir-47149
institution Universiti Malaysia Pahang
language en
publishDate 2025
publisher John Wiley and Sons Inc.
record_format eprints
spelling my.ump.umpir-471492026-02-11T06:42:41Z https://umpir.ump.edu.my/id/eprint/47149/ Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data Wayo, Dennis Delali Kwesi Mohd Zulkifli, Mohamad Noor Ganji, Masoud Darvish Saporetti, Camila M. Goliatt, Leonardo TP Chemical technology This study presents Q-DFTNet, a chemistry-informed neural network (ChINN) framework designed to benchmark graph neural networks (GNNs) for dipole moment prediction using the QM9 dataset. Seven GNN architectures, GCN, GIN, GraphConv, GATConv, GATNet, SAGEConv, and GIN+EdgeConv, were trained for 100 epochs and evaluated across performance and interpretability metrics. GraphConv achieved the lowest test MSE (0.7054), MAE (0.6196), and the highest R2 (0.6513) with only 16.5k trainable parameters, confirming its optimal accuracy-complexity trade-off. GIN+EdgeConv followed closely with MSE of 0.7386, MAE of 0.6332, and R2 of 0.6349, leveraging edge-awareness for enhanced expressivity. In contrast, attention-based models like GATConv and GATNet underperformed, with test MSEs of 0.9667 and 1.0096, and R2 values of 0.5221 and 0.5009, despite their higher complexity (43.5k and 37.3k parameters). Latent space analysis via t-SNE, PCA, and UMAP showed superior cluster separability for GraphConv, GIN+EdgeConv, and GCN. Clustering metrics corroborated these observations: GraphConv achieved a Silhouette Score of 0.4665, a Davies–Bouldin Index of 0.7111, and a Calinski–Harabasz Score of 1278.40. Cluster-wise molecular dipole means for GIN+EdgeConv ranged from 2.6221 to 2.9606 Debye, reflecting high semantic coherence. Residual analysis and QQ plots confirmed that models with lower MSEs also had near-Gaussian error distributions, enhancing interpretability. John Wiley and Sons Inc. 2025-08-13 Article PeerReviewed pdf en https://umpir.ump.edu.my/id/eprint/47149/1/Q-DFTNet_A%20chemistry-informed%20neural%20network%20framework.pdf Wayo, Dennis Delali Kwesi and Mohd Zulkifli, Mohamad Noor and Ganji, Masoud Darvish and Saporetti, Camila M. and Goliatt, Leonardo (2025) Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data. Journal of Computational Chemistry, 46 (22). pp. 1-19. ISSN 0192-8651. (Published) https://doi.org/10.1002/jcc.70206 https://doi.org/10.1002/jcc.70206 https://doi.org/10.1002/jcc.70206
spellingShingle TP Chemical technology
Wayo, Dennis Delali Kwesi
Mohd Zulkifli, Mohamad Noor
Ganji, Masoud Darvish
Saporetti, Camila M.
Goliatt, Leonardo
Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data
title Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data
title_full Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data
title_fullStr Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data
title_full_unstemmed Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data
title_short Q-DFTNet: A chemistry-informed neural network framework for predicting molecular dipole moments via DFT-Driven QM9 data
title_sort q-dftnet: a chemistry-informed neural network framework for predicting molecular dipole moments via dft-driven qm9 data
topic TP Chemical technology
url https://umpir.ump.edu.my/id/eprint/47149/1/Q-DFTNet_A%20chemistry-informed%20neural%20network%20framework.pdf
https://doi.org/10.1002/jcc.70206
https://umpir.ump.edu.my/id/eprint/47149/
https://doi.org/10.1002/jcc.70206
https://doi.org/10.1002/jcc.70206
url_provider http://umpir.ump.edu.my/