A dataset of deep learning performance from cross-base data encoding on MNIST and MNIST-C

Research output: Contribution to journalArticlepeer-review

Abstract

Effective data representation in machine learning and deep learning is paramount. For an algorithm or neural network to capture patterns in data and be able to make reliable predictions, the data must appropriately describe the problem domain. Although there exists much literature on data preprocessing for machine learning and data science applications, novel data representation methods for enhancing machine learning model performance remain highly absent within the literature. This dataset is a compilation of convolutional neural network model performance trained and tested on a wide range of numerical base representations of the MNIST and MNIST-C datasets. This performance data can be further analysed by the research community to uncover trends in model performance against the numerical base of its data. This dataset can be used to produce more research of the same nature, testing cross-base data encoding on machine learning training and testing data for a wide range of real-world applications.
Original languageEnglish
Article number111194
JournalData in Brief
Volume57
Issue numberIssue
DOIs
StatePublished - Dec 1 2024

Keywords

  • Data abstraction
  • Data representation
  • Image classification
  • Machine learning

Fingerprint

Dive into the research topics of 'A dataset of deep learning performance from cross-base data encoding on MNIST and MNIST-C'. Together they form a unique fingerprint.

Cite this