HYPOD-X: Foundation Database Released to Advance Data-Driven Research in the Field of Quasicrystals

ISM2024-04
November 14, 2024

A research team, including members from the Institute of Statistical Mathematics, Tokyo University of Science, and the National Institute for Materials Science, has released "HYPOD-X", an extensive database for quasicrystals and their related materials called approximant crystals. Quasicrystals, with their unique ordered yet aperiodic atomic structures, exhibit distinctive physical properties, distinguishing them from conventional crystals. HYPOD-X is the first comprehensive database in this field, compiled through an extensive survey of a broad range of literature. This database is intended to serve as a foundation to advance data-driven research in quasicrystal science.

Background
Quasicrystals are materials with unique, non-periodic symmetry that distinguishes them from conventional crystals. Approximant crystals, often regarded as precursor materials closely related to quasicrystals, share similar compositional and structural features but retain periodic atomic arrangements. These materials exhibit distinct physical properties, such as unique temperature-dependencies in electrical and thermal conductivity compared to conventional metals. However, the lack of a comprehensive database has long been a significant barrier to advancing machine-learning-driven quasicrystal research. Furthermore, to deepen our understanding of the relationship between quasicrystal structures and their properties —and to stimulate the development of new materials — there is a growing need for a comprehensive open database.

Research Content and Results
The research group has developed the world’s first open database for quasicrystals and their approximants, called "HYPOD-X" (Hypermaterials Open Database for X, where X represents a wildcard for application targets, such as machine learning). HYPOD-X provides structured data on the composition, structure, and physical properties of quasicrystals and approximant crystals, extracted from texts and figures in scientific papers and books, in an accessible format for researchers and engineers. This database serves as a foundation for data-driven researches in the field of quasicrystal research.

As shown in Figure 1, HYPOD-X comprises three datasets: the composition dataset, the phase diagram dataset, and the property dataset. The data, which have been manually or semi-automatically extracted, undergo rigorous expert review before being added to the database.

Figure 1. Three datasets comprising HYPOD-X and their data collection procedures

The composition dataset serves as a foundational source of information on quasicrystals and approximants. The data, including compositions, structural types, and heat treatment conditions, have been manually collected and submitted into the database after rigorous validation by experts. Automated algorithms for error data extraction has also served to enhance data quality. The data volume is approximately ten times greater than that of a previous study [1] that complied the compositions of quasicrystals. Using this dataset, the research group successfully discovered new quasicrystals with a machine learning algorithm called TSAI 1.0 [2].

The properties dataset includes temperature-dependent data for thermal conductivity, electrical properties, and magnetic properties, extracted from figures and tables in scientific papers and books. By analyzing this data, new patterns that have been overlooked even by experts in quasicrystals could be discovered. For instance, quasicrystals tend to exhibit an increase in thermal conductivity at higher temperatures, which has not been typically observed in conventional metals or crystals. This unique property could be utilized in the development of thermal rectifying materials that control the heat flow in specific directions. Identifying quasicrystals with favorable promising temperature dependencies from this dataset may accelerate the development of new thermal management devices.

The phase diagram dataset contains digitalized data extracted from figures in the vast literature to date. Specifically, it stores data quantifying the boundary composition of each phase region, providing compositional ranges and other conditions under which quasicrystals and approximant crystals are thermodynamically stabilized. Applying machine learning to this dataset enables the prediction of new phases for quasicrystals and approximant crystals [2].

Future prospects
HYPOD-X offers a valuable new resource to advance quasicrystal research. The research group plans to continually expand the database. While data-driven research is becoming popular across various fields of materials science, the limited availability of data has hindered progress of data-driven quasicrystal research. With the launch of HYPOD-X, a diverse array of data-driven research is expected to arise. Furthermore, by providing a comprehensive view of extensive data, it is anticipated that new insights and scientific principles will be discovered in quasicrystal science.

Published paper
Title: Comprehensive experimental datasets of quasicrystals and their approximants
Authors: Erina Fujita1,2, Chang Liu1, Asuka Ishikawa3, Tomoya Mato2, Koichi Kitahara4, Ryuji Tamura3, Kaoru Kimura1,2, Ryo Yoshida1,2,5, Yukari Katsura2,6,7
Journal: Scientific Data
DOI: 10.1038/s41597-024-04043-z
Published date: 2024年11月13日

1. The Institute of Statistical Mathematics, 2. National Institute for Materials Science, 3. Tokyo University of Science, 4. National Defense Academy, 5. SOKENDAI, 6. Tsukuba University, 7. RIKEN

References
[1] W. Steurer and S. Deloudi, Crystallography of Quasicrystals, Springer Series in Materials Science 126 (Springer, Berlin, Heidelberg, 2009).
[2] C. Liu, K. Kitahara, A. Ishikawa, T. Hiroto, A. Singh, E. Fujita, Y. Katsura, Y. Inada, R. Tamura, K. Kimura, and R. Yoshida, Phys. Rev. Materials 7, 093805 (2023).


Acknowledgements
This work was supported by a MEXT KAKENHI Grant-in-Aid for Scientific Research in Innovative Areas (19H05817, 19H05818, 19H05820) and JST CREST (JPMJCR22O3).

Contact


[Research content]
Erina Fujita, Project Researcher E-mail: fujita-e@ism.ac.jp
Ryo Yoshida, Professor (Director) E-mail: yoshidar@ism.ac.jp
Research Center for Materials Informatics, The Institute of Statistical Mathematics, Research Organization of Information and Systems

[News, public relations]
URA Station, Planning Unit,Administration Planning and Coordination Section
The Institute of Statistical Mathematics, Research Organization of Information and Systems
TEL: +81-50-5533-8580
E-mail: ask-ura@ism.ac.jp

press release

AltStyle によって変換されたページ (->オリジナル) /