Big data mining and classification of intelligent material science data using machine learning

Research output: Contribution to journalArticlepeer-review

Abstract

There is a high need for a big data repository for material compositions and their derived analytics of metal strength, in the material science community. Currently, many researchers maintain their own excel sheets, prepared manually by their team by tabulating the experimental data collected from scientific journals, and analyzing the data by performing manual calculations using formulas to determine the strength of the material. In this study, we propose a big data storage for material science data and its processing parameters information to address the laborious process of data tabulation from scientific articles, data mining techniques to retrieve the information from databases to perform big data analytics, and a machine learning prediction model to determine material strength insights. Three models are proposed based on Logistic regression, Support vector Machine SVM and Random Forest Algorithms. These models are trained and tested using a 10‐fold cross validation approach. The Random Forest classification model performed better on the inde-pendent dataset, with 87% accuracy in comparison to Logistic regression and SVM with 72% and 78%, respectively.

Original languageEnglish
Article number8596
JournalApplied Sciences (Switzerland)
Volume11
Issue number18
DOIs
StatePublished - Sep 2021
Externally publishedYes

Keywords

  • Classification algorithms
  • Data mining
  • Logistic regression
  • Mongodb
  • No‐SQL database
  • Random forest
  • Support vector machine SVM

Fingerprint

Dive into the research topics of 'Big data mining and classification of intelligent material science data using machine learning'. Together they form a unique fingerprint.

Cite this