Search papers, labs, and topics across Lattice.
This paper introduces Experimental X-ray Diffraction Integrated Transformer (EXIT), a multimodal transformer that combines MOFid (encoding MOF identity) with X-ray diffraction (XRD) data to enable sample-aware prediction of MOF properties. EXIT is pre-trained on one million hypothetical MOFs with simulated XRD patterns to learn transferable representations. Fine-tuning on experimental datasets demonstrates that incorporating XRD data improves prediction accuracy for surface area and pore volume compared to models using only MOF identity, showcasing the model's ability to differentiate samples of the same MOF with varying XRD patterns.
Experimental data can resolve discrepancies in MOF property predictions, with a multimodal transformer leveraging XRD patterns to distinguish between samples sharing the same framework.
Metal-organic frameworks (MOFs) are a major target of machine-learning-based property prediction, yet most models assume that a single framework representation maps to a single property value. This assumption becomes problematic for experimental MOFs, where samples reported as the same framework can exhibit different properties because of differences in crystallinity, phase purity, defects, and other sample-dependent factors. Here we introduce Experimental X-ray Diffraction Integrated Transformer (EXIT), a multimodal transformer for sample-aware prediction of MOF properties that combines MOFid with X-ray diffraction (XRD). In EXIT, MOFid encodes MOF identity, whereas XRD provides complementary information about the experimentally realized sample state. EXIT is pre-trained on one million hypothetical MOFs with simulated XRD to learn transferable representations, leading to improved downstream performance relative to existing approaches. EXIT is fine-tuned on literature-derived experimental datasets for surface area and pore volume prediction. Incorporating experimental XRD improves predictive performance relative to models without experimental XRD, and attention analysis and sample-level case studies further show that EXIT assigns different predictions to samples sharing the same MOF identity when their XRD patterns differ. These results establish a practical step from framework-aware to sample-aware MOF property prediction and highlight the value of incorporating experimental characterization into porous materials informatics.