1 Department of Information Technology Management, Cumberland University, Lebanon Tennessee United States.
2 Department of Project Management, University of Law, Birmingham, United Kingdom.
International Journal of Science and Research Archive, 2025, 16(01), 628-639
Article DOI: 10.30574/ijsra.2025.16.1.2022
Received on 31 May 2025; revised on 05 July 2025; accepted on 08 July 2025
The growing use of machine learning (ML) in healthcare is constrained by data scarcity, privacy regulations, fragmented data systems, and demographic imbalances. These limitations reduce model accuracy, hinder generalizability, and contribute to algorithmic bias, particularly affecting minority populations and underrepresented disease categories. Generative Adversarial Networks (GANs) have emerged as a promising solution by enabling the creation of synthetic datasets that preserve data utility while enhancing privacy and fairness. This paper explores the use of GAN-based synthetic data in addressing data limitations within healthcare ML pipelines. It examines key GAN architectures suited for structured clinical data, electronic health records (EHRs), and medical imaging, highlighting their training processes and privacy-preserving capabilities. Applications across clinical research, epidemiology, rare disease modeling, and privacy-conscious data sharing are reviewed. The paper further evaluates synthetic data quality using utility metrics, privacy risk assessments, and fidelity–privacy tradeoffs. While synthetic data offers transformative potential, challenges remain in GAN stability, ethical governance, and validation standards. Future directions include integrating federated learning, enhancing explainability, and advancing differential privacy to ensure ethical and inclusive AI development in healthcare.
Synthetic Data; GANs (Generative Adversarial Networks); Healthcare; Machine Learning; Data Scarcity
Preview Article PDF
Muhammad Faheem and Aqib Iqbal. Synthetic data generation in healthcare: Using GANs to overcome data scarcity and bias in machine learning. International Journal of Science and Research Archive, 2025, 16(01), 628-639. Article DOI: https://doi.org/10.30574/ijsra.2025.16.1.2022.
Copyright © 2025 Author(s) retain the copyright of this article. This article is published under the terms of the Creative Commons Attribution Liscense 4.0







