by Vijay U. Rathod, Siddhesh Sanjay Amrutkar, Kirti A.
Patil, Aoudumber D. Londhe, Sandip Y.
Bobade, Virendrakumar A. Dhotre, Melkamu Workie Kebede Accurate disease prediction using clinical datasets is essential for improving early diagnosis and clinical decision-support systems; however, many existing deep learning approaches are disease-specific, computationally intensive, and difficult to generalize across heterogeneous biomedical datasets.
This study addresses this challenge by proposing a unified and dataset-aware deep learning framework that enables accurate and interpretable disease prediction across diverse clinical datasets. The framework adopts a modular architecture that selects appropriate models based on dataset characteristics such as feature dimensionality, sample size, and class imbalance.
It integrates multiple deep learning architectures, including MLP, one-dimensional CNN, FT-Transformer, autoencoder-based classifiers, and ensemble strategies. Robust preprocessing, fold-safe feature selection, and nested cross-validation are incorporated to ensure reliable performance evaluation.
The framework is evaluated on three heterogeneous benchmark datasets: the UCI Heart Disease dataset (303 samples, 13 clinical features), the PIMA Indians Diabetes dataset (768 samples, 8 metabolic features), and the Parkinson’s disease voice dataset (195 recordings, 22 acoustic features).
PLOS ONE (Medicine) published a clinical update in Research Highlights on 08 May 2026.
The item focuses on A modular deep learning architecture for interpretable disease prediction across tabular clinical and biometric datasets.
Review the original article for the full source wording and details.