Development of a Data-Driven Scientific Methodology: From Articles to Chemometric Data Products

Maria Carballo-Meilan, Lewis McDonald, Wanawan Pragot, Lukasz Michal Starnawski, Ali Nauman Saleemi, Waheed Afzal* (Corresponding Author)

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)


Information and data science algorithms were combined to predict the outcome of an experiment in chemical engineering. Using the Scientific Method workflow, we started the journey with the formulation of a specific question. At the research stage, the common process of querying and reading articles on scientific databases was substituted by a systematic review with a built-in recursive data mining method. This procedure identifies a specific community of knowledge with the key concepts and experiments that are necessary to address the formulated question. A small subset of relevant articles from a very specific topic among thousands of papers was identified while assuring the loss of the least amount of information through the process. The secondary dataset was bigger than a common individual study. The process revealed the main ideas currently under study and identified optimal synthesis conditions to produce a chemical substance.
Once the research step was finished, the experimental information was compiled and prepared for meta-analysis using a supervised learning algorithm. This is a hypothesis generation stage whereby the secondary dataset was transformed into experimental knowledge about a particular chemical reaction. Finally, the predicted sets of optimal conditions to produce the desired chemical compound were validated in the laboratory.
Original languageEnglish
Article number104555
Number of pages13
JournalChemometrics and Intelligent Laboratory Systems
Early online date25 Apr 2022
Publication statusPublished - 15 Jun 2022

Bibliographical note

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.


  • Scientific Method
  • Data Mining
  • Meta-Methodology
  • Chemometrics
  • Scientometrics
  • Machine Learning


Dive into the research topics of 'Development of a Data-Driven Scientific Methodology: From Articles to Chemometric Data Products'. Together they form a unique fingerprint.

Cite this