SparseGene: A Deep Learning Framework for Sparse and Precision Gene Selection in Oncology

Authors

  • Somaiya Rahman Liya Department of Dental Surgery, Dhaka Dental College (University of Dhaka), Dhaka, Bangladesh
  • Mehedi Hasan Pritom School of Business, International American University, Los Angeles, CA 90010, USA
  • Sakera Begum School of IT, Washington University of Science and Technology, Alexandria, VA 22314, USA
  • Md Ismail Jobiullah School of IT, Washington University of Science and Technology, Alexandria, VA 22314, USA

Keywords:

Gene Selection, Deep Learning, Sparsity, Non-Small Cell Lung Cancer, RNA-Seq, Precision Oncology

Abstract

In precision oncology, paths of the decision genes of interest are especially crucial to the advancement of stable diagnosis, individual therapeutic interventions. The problem of interpretation or overfitting is one of the inadequacies of the traditional method of LASSO regression, and also the DL model. We redress this through the paradigm of unsupervised representation learning through sparse supervised classification by a newly designed deep learning strategy, SparseGene, which investigates biologically meaningful possibilities on a subset of the genes in high-dimensional RNA-Seq data. SparseGene fits a deep autoencoder to the dimension reduction and a layer constrained in sparsity of classification, and detects a small, precise predictive set of genes. On a non-small cell lung cancer (NSCLC) dataset (GSE89843), SparseGene is able to distinguish between predictive models at 94% accuracy, 0.93 F1-score, with only 28 key genes selected, many of which correspond to other known predictive models in NSCLC that belong to other well-discovered oncogenic circuits such as PI3K-Akt and p53 signaling, as well as EGFR resistance. The robustness of the applicability and translatability to clinical practice, as well as its understandability, is proven by the further pathway enrichment and ROC analysis. The SparseGene provides a very good path to scalable, interpretable, and biologically-aware marker discovery of cancer genomics.

Published

28-08-2025

How to Cite

Somaiya Rahman Liya, Mehedi Hasan Pritom, Sakera Begum, & Md Ismail Jobiullah. (2025). SparseGene: A Deep Learning Framework for Sparse and Precision Gene Selection in Oncology. Well Testing Journal, 34(S3), 450–468. Retrieved from https://welltestingjournal.com/index.php/WT/article/view/208

Issue

Section

Original Research Articles

Similar Articles

1 2 3 4 5 6 7 8 > >> 

You may also start an advanced similarity search for this article.