From Composition to Structure: The SMACT Approach#
You’ve learnt to find chemically viable compositions through combinatorial screening and intelligent filtering. But here’s the next challenge: How do you predict what crystal structure those compositions will adopt?
This is where structure prediction becomes crucial. Having a composition like “LiCoO₂” tells you the recipe - but will it form a layered structure (good for batteries) or a spinel structure (different properties)? The crystal structure determines the material’s properties.
The Challenge of Structure Prediction#
Traditional structure prediction is quite difficult:
Quantum mechanical calculations are accurate but computationally expensive
Crystal structure databases only contain known materials
Experimental synthesis is time-consuming and may not work
Pure guessing has extremely low success rates
We need smarter approaches that leverage existing knowledge to predict new structures.
SMACT’s Substitution-Based Approach#
SMACT’s structure prediction module takes a data-driven approach based on a simple but powerful idea:
“Similar ions can often substitute for each other in crystal structures”
If we know that Na⁺ and K⁺ behave similarly (both large, singly-charged cations), then we can predict that compounds like KCl might adopt the same structure as NaCl.
This approach is based on the methodology from Hautier et al., who showed that ionic substitution patterns can be learnt from databases and used to predict new materials:
Hautier, G., Fischer, C., Ehrlacher, V., Jain, A., and Ceder, G. (2011)
Data Mined Ionic Substitutions for the Discovery of New Compounds.
Inorganic Chemistry, 50(2), 656-663.
doi:10.1021/ic102031h
How It Works: The Substitution Strategy#
The SMACT approach works in several steps:
1. Build a Structure Database#
Import known crystal structures from Materials Project/ICSD
Create a searchable database of structure-composition relationships
Focus on the most reliable, experimentally-validated structures
2. Learn Substitution Patterns#
Analyse which ionic substitutions occur frequently in nature
Build probability models based on ionic size, charge, and electronegativity
Create “lambda tables” that encode substitution likelihoods
3. Predict New Structures#
Take your target composition (e.g., from previous screening steps)
Find similar compositions in the database
Apply substitution rules to predict likely crystal structures
Rank predictions by probability
4. Validate and Refine#
Check chemical reasonableness of predictions
Calculate formation energies if needed
Provide confidence estimates based on similarity to known materials
What Makes This Useful#
The substitution-based approach offers several advantages:
Speed: Much faster than ab initio crystal structure prediction
Chemical basis: Based on real substitution patterns observed in nature
Reliability: Predictions are grounded in experimentally-validated structures
Integration: Works well with composition screening results
Interpretability: You can understand why a structure was predicted
Scalability: Can screen thousands of compositions quickly
However, this approach also has important limitations:
Conservative: Can only predict structures similar to known materials
Database dependency: Limited by the quality and coverage of your structure database
Incremental discovery: Unlikely to find radically new structural motifs
Substitution rules: Bound by statistical patterns that may not capture all possibilities
Building Toward Chemeleon#
The SMACT structure prediction approach represents the conventional way of doing computational materials discovery:
Start with known structures
Apply chemical rules and statistical analysis
Make incremental improvements through substitution
This approach works well when you want to explore variations of existing materials, but it has clear limitations. You can only predict structures that are chemically similar to what already exists in your database.
What if we could go beyond incremental changes? What if we could generate entirely new crystal structures from scratch, guided only by chemical principles and text descriptions? This is where Chemeleon comes in - offering an AI-driven approach to structure generation that can explore regions of chemical space not accessible through substitution methods.
What You’ll Learn#
In this section, we’ll explore:
Practical Skills#
How to set up and use SMACT’s structure prediction module
Building and querying structure databases
Implementing ionic substitution models
Generating and ranking structure predictions
Conceptual Understanding#
The principles behind substitution-based prediction
How to interpret and validate predictions
When this approach works well (and when it doesn’t)
How traditional methods prepare us for AI approaches
Real Applications#
Predicting structures for battery materials
Finding new phases in known chemical systems
Screening large numbers of compositions for structure types
Preparing inputs for more detailed computational studies
Integration with Previous Learning#
This section builds directly on everything you’ve learnt:
Combinatorial Explosion: Showed us the scale of chemical space
Chemical Filters: Taught us to eliminate impossible combinations
Compositional/Stoichiometry Screening: Generated viable compositions
Structure Prediction: Now predicts crystal structures for those compositions
The Path Forward#
By the end of this section, you’ll have solid grounding in traditional structure prediction methods. This foundation is essential for understanding and appreciating the new approaches that Chemeleon brings to the field.
Let’s begin exploring the world of ionic substitutions and data-driven prediction!