Research – Open Access
Xu-Wen Wang, Tong Wang, Darius P. Schaub, Can Chen, Zheng Sun, Shanlin Ke, Julian Hecker, Anna Maaser-Hecker, Oana A. Zeleznik, Roman Zeleznik, Augusto A. Litonjua, Dawn L. DeMeo, Jessica Lasky-Su, Edwin K. Silverman, Yang-Yu Liu & Scott T. Weiss
Asthma is a heterogeneous disease with high morbidity. Advancement in high-throughput multi-omics approaches has enabled the collection of molecular assessments at different layers, providing a complementary perspective of complex diseases. Numerous computational methods have been developed for the omics-based patient classification or disease outcome prediction. Yet, a systematic benchmarking of those methods using various combinations of omics data for the prediction of asthma development is still lacking.
We aimed to investigate the computational methods in disease status prediction using multi-omics data.
We systematically benchmarked 18 computational methods using all the 63 combinations of six omics data (GWAS, miRNA, mRNA, microbiome, metabolome, DNA methylation) collected in The Vitamin D Antenatal Asthma Reduction Trial (VDAART) cohort. We evaluated each method using standard performance metrics for each of the 63 omics combinations.
Our results indicate that overall Logistic Regression, Multi-Layer Perceptron, and MOGONET display superior performance, and the combination of transcriptional, genomic and microbiome data achieves the best prediction. Moreover, we find that including the clinical data can further improve the prediction performance for some but not all the omics combinations.
Specific omics combinations can reach the optimal prediction of asthma development in children. And certain computational methods showed superior performance than other methods.