Small Data Machine Learning in Materials Science
DOI:
https://doi.org/10.70705/ppp.dsei.2024.v02.i01.pp58-73Keywords:
Machine learning, Small data, Materials databaseAbstract
This review discussed the dilemma of small data faced by materials machine learning. First, we analyzed the limitations brought
by small data. Then, the workflow of materials machine learning has been introduced. Next, the methods of dealing with small
data were introduced, including data extraction from publications, materials database construction, high-throughput computations
and experiments from the data source level; modeling algorithms for small data and imbalanced learning from the algorithm
level; active learning and transfer learning from the machine learning strategy level. Finally, the future directions for small
data machine learning in materials science were proposed.

