Search from over 60,000 research works

Advanced Search

Crystal structure generation with autoregressive large language modeling

[thumbnail of Open Access]
Preview
s41467-024-54639-7.pdf - Published Version (2MB) | Preview
Available under license: Creative Commons Attribution
[thumbnail of CrystaLLM_Nature__Final_Revisions_.pdf]
CrystaLLM_Nature__Final_Revisions_.pdf - Accepted Version (8MB)
Restricted to Repository staff only
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Antunes, L. M., Butler, K. T. and Grau-Crespo, R. orcid id iconORCID: https://orcid.org/0000-0001-8845-1719 (2024) Crystal structure generation with autoregressive large language modeling. Nature Communications, 15. 10570. ISSN 2041-1723 doi: 10.1038/s41467-024-54639-7

Abstract/Summary

The generation of plausible crystal structures is often the first step in predicting the structure and properties of a material from its chemical composition. However, most current methods for crystal structure prediction are computationally expensive, slowing the pace of innovation. Seeding structure prediction algorithms with quality generated candidates can overcome a major bottleneck. Here, we introduce CrystaLLM, a methodology for the versatile generation of crystal structures, based on the autoregressive large language modeling (LLM) of the Crystallographic Information File (CIF) format. Trained on millions of CIF files, CrystaLLM focuses on modeling crystal structures through text. CrystaLLM can produce plausible crystal structures for a wide range of inorganic compounds unseen in training, as demonstrated by ab initio simulations. Our approach challenges conventional representations of crystals, and demonstrates the potential of LLMs for learning effective models of crystal chemistry, which will lead to accelerated discovery and innovation in materials science.

Altmetric Badge

Item Type Article
URI https://reading-clone.eprints-hosting.org/id/eprint/119153
Item Type Article
Refereed Yes
Divisions Life Sciences > School of Chemistry, Food and Pharmacy > Department of Chemistry
Uncontrolled Keywords large language models, crystal structure prediction, materials design
Publisher Nature Publishing Group
Download/View statistics View download statistics for this item

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record

Search Google Scholar