Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/31603
Title: Small data materials design with machine learning: When the average model knows best
Authors: VANPOUCKE, Danny E.P. 
van Knippenberg, Onno S. J.
Hermans, Ko
Bernaerts, Katrien V.
Mehrkanoon, Siamak
Issue Date: 2020
Publisher: AMER INST PHYSICS
Source: Journal of applied physics, 128 (5) (Art N° 054901)
Abstract: Machine learning is quickly becoming an important tool in modern materials design. Where many of its successes are rooted in huge datasets, the most common applications in academic and industrial materials design deal with datasets of at best a few tens of data points. Harnessing the power of machine learning in this context is, therefore, of considerable importance. In this work, we investigate the intricacies introduced by these small datasets. We show that individual data points introduce a significant chance factor in both model training and quality measurement. This chance factor can be mitigated by the introduction of an ensemble-averaged model. This model presents the highest accuracy, while at the same time, it is robust with regard to changing the dataset size. Furthermore, as only a single model instance needs to be stored and evaluated, it provides a highly efficient model for prediction purposes, ideally suited for the practical materials scientist.
Keywords: Multitarget Optimization;Information;Regression;Discovery;Diamond
Document URI: http://hdl.handle.net/1942/31603
ISSN: 0021-8979
e-ISSN: 1089-7550
DOI: 10.1063/5.0012285
ISI #: WOS:000559809200001
Rights: © 2020 Author(s).
Category: A1
Type: Journal Contribution
Validations: ecoom 2021
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
AvgModel_printed.pdf
  Restricted Access
Published version2.62 MBAdobe PDFView/Open    Request a copy
Show full item record

WEB OF SCIENCETM
Citations

16
checked on Apr 23, 2024

Page view(s)

52
checked on Sep 7, 2022

Download(s)

6
checked on Sep 7, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.