Download PDFOpen PDF in browser

Data Generation Using Gene Expression Generator

EasyChair Preprint no. 3948

12 pagesDate: July 25, 2020


Generative adversarial networks (GANs) could be used efficiently for image and video generation, where labeled training data are available in bulk. In general, building a good machine learning model requires a reasonable amount of labeled training data. However, there are areas such that the biomedical field where the creation of such a data set is time-consuming and requires expert knowledge. Our goal is to use data augmentation techniques as an alternative to data collection to improve data classification. We propose the use of a modified version of GAN named Gene Expression Generator (GEG) to augment data samples at hand. The proposed approach was used to generate synthetic data for binary biomedical data sets to trains existing supervised machine learning approaches. Experimental results showed that using GEG for data augmentation with a modified version of leave one out cross-validation increased the performance of classification accuracy.

Keyphrases: adversarial network, breast cancer dataset, Cancer Classification, classification accuracy, colon cancer dataset, cross-validation, data augmentation, data augmentation technique, data generation, gene expression data, Generative Adversarial Networks, machine learning, synthetic data

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
  author = {Zakarya Farou and Noureddine Mouhoub and Tomáš Horváth},
  title = {Data Generation Using Gene Expression Generator},
  howpublished = {EasyChair Preprint no. 3948},

  year = {EasyChair, 2020}}
Download PDFOpen PDF in browser