Synthetic Data Generation in the Medical Domain using Generative Adversarial Networks
Synthetic data generation has become important in the medical field for two reasons. First, medical data are not widely available to researchers due to patient privacy protections. Second, in some cases such as rare diseases, only limited data are available, making diagnosis or treatment difficult even for experts. Synthetic medical data generation can address these issues by providing artificial medical data that resembles real data while not associated with real patients.
The goal of this thesis is to generate synthetic tabular data relevant to the medical field using generative adversarial networks (GANs) and to investigate how synthetic data can improve data analysis (machine learning) performance compared to cases where only real data are considered.
Contact person: Dr. Sina Sadeghi <sina.sadeghi@medizin.uni-leipzig.de>