A method is presented for comparing crystal structures to identify similarity in molecular packing environments. The relative position and orientation of molecules is captured using interatomic distances, which provide a representation of structure that avoids the use of space-group and cell information. The method can be used to determine whether two crystal structures are the same to within specified tolerances and can also provide a measure of similarity for structures that do not match exactly, but have structural features in common. Example applications are presented that include the identification of an experimentally observed crystal structure from a list of predicted structures and the process of clustering a list of predicted structures to remove duplicates. Examples are also presented to demonstrate partial matching. Such searches were performed to analyse the results of the third blind test for crystal structure prediction, to identify the frequency of occurrence of a characteristic layer and a characteristic hydrogen-bonded chain.
[1]
James A. Chisholm,et al.
A new algorithm for performing three-dimensional searches of the Cambridge Structural Database
,
2004
.
[2]
G. G. Stokes.
"J."
,
1890,
The New Yale Book of Quotations.
[3]
W. Kabsch.
A solution for the best rotation to relate two sets of vectors
,
1976
.
[4]
W. Kabsch.
A discussion of the solution for the best rotation to relate two sets of vectors
,
1978
.
[5]
B. Rohde,et al.
Continuous similarity measure between nonoverlapping X‐ray powder diagrams of different crystal modifications
,
1993,
J. Comput. Chem..
[6]
Jan Kroon,et al.
Fast clustering of equivalent structures in crystal structure prediction
,
1997,
J. Comput. Chem..
[7]
R. Wehrens,et al.
A generalized expression for the similarity of spectra: application to powder diffraction pattern classification
,
2001,
J. Comput. Chem..