Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Next Article in Journal
Multi-Objective Path-Decision Model of Multimodal Transport Considering Uncertain Conditions and Carbon Emission Policies
Previous Article in Journal
On the Modified Stokes Second Problem for Maxwell Fluids with Linear Dependence of Viscosity on the Pressure
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complexity of Molecular Nets: Topological Approach and Descriptive Statistics

by
Alexander M. Banaru
1,2,* and
Sergey M. Aksenov
2,3,*
1
Faculty of Chemistry, Moscow State University, Vorobievy Hills, 119991 Moscow, Russia
2
Laboratory of Arctic Mineralogy and Materials Sciences, Kola Science Centre, Russian Academy of Sciences, 14 Fersman Str., 184209 Apatity, Russia
3
Geological Institute, Kola Science Centre, Russian Academy of Sciences, 14 Fersman Str., 184209 Apatity, Russia
*
Authors to whom correspondence should be addressed.
Symmetry 2022, 14(2), 220; https://doi.org/10.3390/sym14020220
Submission received: 5 November 2021 / Revised: 13 December 2021 / Accepted: 12 January 2022 / Published: 24 January 2022
(This article belongs to the Special Issue Mathematical Crystallography 2021)

Abstract

:
The molecular net complexity (HmolNet) is an extension of the combinatorial complexity (Hmol) of a crystal structure introduced by Krivovichev. It was calculated for a set of 4152 molecular crystal structures with the composition of CxHyOz characterized by the structural class P21/c, Z = 4 (1). The molecular nets were derived from the molecular Voronoi–Dirichlet Polyhedra (VDPmol). The values of the molecular coordination number (CNmol) and critical coordination number (CNcrit) are discussed in relation with the complexity of the crystal structures. A statistical distribution of the set of molecular crystals based on the values of CNmol, CNcrit, and the complexity parameters is obtained. More than a half of the considered structures has CNmol = 14 and CNmol′ = 9 with the Wyckoff set of edges e5dcba. The average multiplicity of intermolecular contacts statistically significantly decreases from 1.58 to 1.51 upon excluding all contacts except those bearing the molecular net. The normalized value of HmolNet is of the logistic distribution type and is distributed near 0.85HmolNet with a small standard deviation. The contribution of Hmol into HmolNet ranges from 35 to 95% (mean 79%, SD 6%), and the subset of bearing intermolecular contacts accounts for 41 to 100% (mean 62%, SD 11%) of the complexity of the full set of intermolecular contacts.

1. Introduction

According to the approach initially developed by Shannon in his theory of communication [1], the complexity of a message consisting of symbols depends on the probability of occurrence of each symbol in the message. In particular, quantifying information content of a message in bits corresponds as the function:
H = i = 1 k L p i
L p i   =   0     p i = 0 , p i log 2 p i   ( p i > 0 )
where pi is the probability of i-th symbol to appear in the given message. Any graph with certain types of vertices may be considered as a message, as well. Finite graphs corresponding to the molecules belong to a wide class of so-called chemical graphs, and the approaches of measuring information content for them were introduced in the 1950s by Trucco [2] and in the early 2000s reviewed by Bonchev [3]. Commonly, the vertices of a chemical graph G are referred to as equivalent if they belong to the same orbit of the automorphism group of the graph Aut(G), which is isomorphic to the maximal symmetry group of the graph. The information measures of molecules and their ensembles was recently reviewed by Sabirov and Shepelevich [4]. Information content of molecules is of a specific interest due to the studying of chemical reactions [5], molecular aggregation processes [6], searching the reason for the first bioorganic molecules to appear [7], etc.
A crystal structure can also be represented by a finite graph called quotient graph introduced by Chung et al. in the 1980s [8]. In fact, the quotient graph is a “molecule” of a non-molecular crystal. A quotient graph of the crystal structure maps atoms onto the vertices and chemical bonds onto the edges or loops and reflects the connectivity of the reduced unit cell of the structure. The quotient graph is a useful tool to enumerate nets occurring in crystal structures [9] and perform a topological analysis of underlying nets [10,11]. The cyclomatic number of the quotient graph equals the dimensionality n of the Euclidean space En in which the net derived from the quotient graph may be embedded being periodic in the same number of linearly independent directions. In such a space, the deletion of any edge lattice of the net leads to a disconnected net, and the net is referred to as minimal [12,13,14]. For instance, the diamondoid net is minimal in E3, while the quartz net is minimal in E4. Embeddings of some typical nets in E3 were enumerated in Reticular Chemistry Structure Resource (RCSR) [15], where each net is characterized by the maximal possible symmetry achieved by a barycentric placement of the vertices [16,17].
The amount of information stored by the quotient graph of the crystal structure was introduced by Krivovichev [18,19] to quantify the information content of the crystal. In this case, the probability pi from (2) is calculated as pi = mi/v, where mi is the multiplicity of the i-th crystallographic orbit occupied by atoms; v—the number of atoms in the reduced unit cell. Later, Hornfeck [20] complemented this measure by terms considering the degrees of freedom associated with a translational motion of an atom along the Wyckoff position and Kaußler and Kieslich [21] adapted this measure to positionally disordered crystals. However, for molecular crystals the information content calculated using this approach indicates the complexity of the molecule itself instead of the crystal structure. The possible scheme of avoiding this problem was proposed in [22]:
H molNet = H 2 N , CN mol   + 2 N 2 N + CN mol H mol + CN mol 2 N + CN mol H edge ;
H 2 N , CN mol   = 2 N 2 N + CN mol log 2 2 N 2 N + CN mol CN mol 2 N + CN mol log 2 CN mol 2 N + CN mol ;
H molNet ,   tot =   N +   CN mol / 2 H molNet
where N is the number of atoms in the molecule, CNmol—the molecular coordination number, Hedge—the information content of the molecule, Hedge—the information content of the edge net of the molecular net, HmolNet—the combination of Hmol and Hedge with the property of strong additivity [20,23]. The value of HmolNet is meaningful even for high symmetric molecular structures with the only orbit occupied by the atoms (i.e., I2, S6, and α-N2) [22]. It should be noted that the molecule in the crystal structure is commonly distorted, and the only symmetry operation retained in a molecule (in more than 90% cases) is the inversion center [24], which requires for the preserving of dense packing according to Kitaigorodskii [25]. Generally, a more conformational lability of the molecule promotes a more diverse set of contacts in the coordination shell and should result in the increasing of the molecular net complexity. On the other hand, certain molecular fragments have the opportunity to form a specific intermolecular interaction, such as H-bonds, π … π interactions, Hal…Hal, etc. In such a case the small subset of interactions often predominates in the crystal structure and, in fact, is bearing the entire net of the intermolecular contacts. However, the subset of bearing contacts may include excessive interactions and thus be redundant. The portion of the bearing subnet complexity attributable to the target engineered interactions may serve an indicator of effectiveness of a crystal engineering technique, as the latter aims to reproduce targeted bearing contacts.
In this work the formalism (3)–(5) previously discussed in [26] is tested for the set of more than 4000 homomolecular crystals with the general formula CxHyOz of a structural class P21/c, Z = 4(1) (such notation indicates that there is exactly one symmetrically unique molecule occupying a general orbit in the space group P21/c). This structural class is of the special interest as the most widespread among organic crystals and corresponding to ~1/3 of all homomolecular structures and more than 1/2 of homomolecular racemates [27]. The aim of this work is to investigate the partitioning of intermolecular contacts from the coordination shell of the molecule into equivalence classes and to obtain the distribution type and the descriptive statistics of HmolNet.

2. Methods

The initial set of the crystal structures was extracted from Cambridge Structural Database (CSD) [28] using the following restrains: the presence of atomic coordinates, the absence of errors and/or disorder, and R-factor < 5%. Out of 4249 high-quality molecular crystal structures [26] selected from CSD ver. 5.41 (with updates), the set of 4152 structures without duplicates was retained for further investigation. The criteria of considering a structure as a duplicate were the same cell dimensions (with the tolerance of 2σ), the same chemical composition, space group and Wyckoff sequence.
The construction of molecular nets was carried out using the ToposPro program [29] by calculating the solid angles of the molecular Voronoi-Dirichlet polyhedron (VDPmol). According to Blatov [30], VDPmol is the superposition of atomic VDPs in a molecule, and the solid angle (Ω) corresponding to an intermolecular contact arises from interatomic contacts as:
Ω   = Ω i j Ω Σ × 100 %
where Ωij is the solid angle for the intermolecular contact ij, and ΩΣ—the sum of solid angles for all the interatomic contacts for the given molecule with the adjacent ones. Interatomic contacts with Ωij < 1.5% of 4π steradians are omitted. In the same way, intermolecular contacts with Ω < 1.5% in this work have been omitted, while the adjacent molecules with Ω ≥ 1.5% are considered as the coordination shell of the initial molecule (Figure 1). As a rule, for non-specific van der Waals interactions the descending order of Ω corresponds to the decrease of interaction energy, allowing to avoid energy calculations for the assessment of supramolecular arrangement [26].
To derive the molecular net, the atoms were pulled to the mass center of the molecule. The molecular coordination number (CNmol), which includes only symmetrically independent intermolecular contacts, is marked by a prime (CNmol′), i.e., acrylic acid (ACRLAC04 [31]) has CNmol = 12 (cuboctahedron) and CNmol′ = 8. The subset of bearing contacts generating so-called critical net for a given molecule was defined in [32]. In a monosystemic crystal structure the center of gravity of each molecule is connected with the centers of gravity of CNmol adjacent molecules, and VDP faces have the following order of the solid angles: Ω1 > Ω2 > Ω3 > … > Ωn (symmetrically equivalent contacts have the same Ω). For any value of n, there is 1 ≤ kn such that if all edges corresponding to the solid angles Ωk, Ωk+1, …, Ωn are removed from the net, the resulted net becomes disconnected. The value max(k) is called a “critical coordination number with a prime” (CNcrit′). If all symmetrically equivalent contacts are considered, the corresponding value is called a “critical coordination number without a prime” (CNmol). For instance, acrylic acid (ACRLAC04) has CNcrit = 5 (square pyramid) and CNmol′ = 4. To derive a CNcrit, firstly, the edges of the net of intermolecular contacts, for which Ω > 15%, were removed from the net. In all cases, this led to reduction of the net’s dimensionality from 3D to 2D, 1D, or 0D. Then the contacts with Ω = 14.5–15.0% were returned to the adjacency matrix of the centers of gravity of the molecules, and a check was performed to establish the dimensionality of the net again. If the dimensionality was 3D, the returned contacts were referred as Ωcrit = Ωmax(k), and the constructed 3D net was considered a net of bearing contacts. If the dimensionality of the net did not increase to 3D, then the contacts with Ω = 14.0–14.5% were added to the adjacency matrix, and the dimensionality of the net was checked again. This procedure was repeated with the step of 0.5% until Ωcrit was found for all the structures. The less step values are not reliable since the measurement error is about 0.5%. The obtained distribution of the crystal structures of the considered set is close to normal (Figure 2).
The nets of intermolecular contacts in the most symmetrical embedding in E3 are classified either in accordance with RCSR [15] or TopCryst database [33] (when RCSR classification is lacking). The nets those remain unclassified in RCSR and TopCryst database up to date are characterized by a point symbol. The net for the crystal structure of acrylic acid has the RCSR code fcu (cubic closest packing), while the net of bearing contacts—sqp (Figure 3). For a CN-coordinated net there are CN(CN–1)/2 angles. The shortest cycle in each angle should be identified. The point symbol in the form Aa.BbCc indicates that there are a angles that are A-cycles, b angles that are B-cycles, etc. (A < B < … < C) [34]. For instance, the fcu net has 12∙11/2 = 66 angles in each vertex, and its point symbol is 324.436.56, while the sqp net has 5∙4/2 = 10 angles in each vertex, and its point symbol is 44.66.
If there are p sorts of vertices and q sorts of edges in the net, then the net is called p,q-transitive. For instance, the fcu and sqp nets are 1,1 and 1,2-transitive, respectively. In fact, p and q denote the minimal number of orbits occupied by the molecular centers of gravity and the contacts, respectively, and interrelate with the molecular net complexity for its most symmetric embedding in E3.
The complexity of a molecular net was calculated using (3)–(5). The structural information content (SIC = 0–1) [4] meaning the same as a normalized informational complexity [19] and was calculated as follows:
SIC = H/max(H)
where max(H) is the maximal possible value of H, when each vertex constitutes its own equivalence class: max(Hmol) = log2N; max(Hedge) = log2CNmol; max(HmolNet) = log2(2N + CNmol).
The molecule of acrylic acid has N = 9 atoms and all of them are symmetrically unique (the Wyckoff set e9 in the space group P21/c), mi = 4, v = 36. Consequently, Hmol = −9∙4/36∙log2(4/36) = 3.170 bits/atom. The edge net of the CNmol-coordinated molecular net is generated by the midpoints added to each edge of the molecular net. Two midpoints are connected if and only if they are adjacent to the same vertex, and the vertices of the initial net are removed. The final net (edge net) is 2(CNmol − 1) = CNedge-connected. The edge net for acrylic acid is 22-connected and contains 8 symmetrically independent vertices with the Wyckoff sequence e4dcba, v = 24, Hedge = −16/24∙log2(16/24) − 4∙2/24∙log2(2/24) = 2.918 bits/contact; H(2N, CNmol) = H(18, 12) = 0.971 bits/d.f. (per a degree of freedom), HmolNet = 4.040 bits/d.f., SICmolNet = 0.823, HmolNet,tot = 4.040∙15 = 60.60 bits/molecule. Note that if just bearing contacts are included in the net, then the edge net would be 8-connected and contain only 4 of 8 independent vertices with the Wyckoff sequence edca, v = 10, Hedge = −4/10∙log2(4/10) − 3∙2/10∙log2(2/10) = 1.922 bits/contact. This net is characterized by the unknown topological type.
The discriminatory power of H, based on the probability of two unrelated objects being characterized as the same type, was calculated according to the following equation [35]:
D = 1 1 N N 1 j = 1 s x j x j 1
where N is the number of the tested crystal structures, s the number of different types of structures with respect to H, and xj the number of objects belonging to the j-th type. The correlations between calculated values were sought in the Mathematica software ver. 11.0 [36].

3. Results and Discussion

Crystal structures of the analyzed set are distributed over CNmol, generally, in accordance with the earlier results obtained by Carugo et al. [37]. More than a half of the crystal structures have CNmol = 14, and the second ranked value CNmol = 16. The most frequent values of CNcrit are 5, 4, and 6, but unlike CNmol there is no sharp peak on any of the values (Figure 4). More than a half of the structures is characterized by CNmol′ = 9 (with e5—2355 structures; with e6—84 structures; with e4—13 structures; e7ba—1 structure with refcode HINSOM [38]), and most abundant CNcrit′ is its least value 3 (eba, ecb, e—856 structures; e2a, e2b, e2—737 structures; e3—81 structures).
In the structural class P21/c, Z = 4(1) each molecule can form contacts with a multiplicity 1 or 2. The former corresponds to a so-called involution, a symmetry element of the order 2 (the midpoint of a contact occupies the Wyckoff position e). The only involution presence in the space group P21/c is the inversion center 1 ¯ (the midpoint of a contact occupies the Wyckoff position a, b, c or d). All other contacts are formed via a screw axis 21, or a glide plane c, or a translation along some direction. It is easy to show that the average multiplicity in between 1 and 2 equals to v/2CN′. According to two-sample t-test, the difference of the mean values for all contacts and for those bearing the net is statistically significant (p-value < 0.001). Moreover, the minimal multiplicity is 1.375 (3 structures) for the hole net of molecular contacts unlike 1.000 (1 structure with refcode KOLRAF [39]) for the critical subnet (Table 1). That is why the subnet of bearing contacts is, in average, more enriched by the inversion centers than the hole molecular net. As shown above, 2355 structures have the Wyckoff sequence e5dcba (or similar) for the edge net, and mean multiplicity in this case is (5∙2 + 4)/9 = 1.556. Motherwell [40] previously studied the projection patterns formed by projecting coordination shell of a molecule into 2D in different space groups with none of the special positions occupied. The majority of projection patterns in P21/c contained at least one contact via an inversion center.
The distribution of molecular nets in the considered series over the topological types is, generally, in accordance with the trend previously found by Carugo et al. for 105 549 packings of small molecules [37]. The most widespread topological type is bcu-x, a type derived from the body-centered cubic lattice where the coordination shell of the atom is extended by the second coordination shell (CNmol = 8 + 6). This topological type has the least topological density TD10 that reflects the total number of vertices in the first 10 of coordination shells, among all 14-coordinated nets reported for centrosymmetric [41] and non-centrosymmetric [32] crystalline hydrocarbons, some inorganic molecular crystals [22] (i.e., 14T191 in the orthorhombic sulfur, α-S8), and those with the most popularity amongst all small molecular crystals [37]. Recently studied crystal structure of 2-(tert-butyl)-4-chloro-6-phenyl-1,3,5-triazine with 2 symmetrically independent molecules [42] is characterized by the 14T319 type topology (after neglecting contacts with Ω ≤ 2%), which occupies the opposite side of 14-coordinated molecular nets with respect to TD10 (Table 2). The more 2nd or 3rd CN does not mean the more 4th and 5th CNs. For instance, in the 2nd coordination sphere there are 54 vertices in 14T134 topological type and 53 in 14T10; nevertheless, TD10 for 14T10 is slightly higher. Remark that the TopCryst database has been extended last years by many new topological types with large CNs, including CN = 14. Thus, the previously found in 2019 a 14T134 topological type [32] in the crystal structure of spiropentane (refcode VAJGOC [43]) has no reference code in the TopCryst database. The corresponding molecular net in the most symmetric embedding in E3 is 1,6-transitive and has the space group R 3 ¯ c with the only general position occupied by centers of gravity of molecules.
Consider three typical examples of molecular nets realized in α-methyl-trans-cinnamic acid (refcode: BEJVOB [45]), 5-methoxyindan-1-one (refcode: KACSOX01 [46]), which are both isomers with the chemical formula C10H10O2 (Figure 5), and (1RS,3SR,4SR)-trispiro(2.0.0.2.1.1)nonane-1-carboxylic acid with the chemical formula C10H12O2 (refcode: FAFDEW [47]). The Wyckoff sequences for the molecules are: e22 for BEJVOB and KACSOX01, and e24 for FAFDEW. This leads to a slightly different values of Hmol: Hmol = 4.459 bits/atom for BEJVOB and KACSOX01, and Hmol = 4.585 bits/atom for FAFDEW. All other structures from the set of 4152 structural files show exactly the same distribution of atoms over general positions, i.e., they have the maximal Hmol for the given N (SIC = 1). Indeed, if a molecular center of gravity occupies a general position, then no atom is able to occupy an inversion center, otherwise the other atoms should be related by the inversion center and the molecule would either occupy the special position or have a symmetry-induced disorder (the latter was restricted by the structure selection). The linear correlation coefficient between N and the molecular mass in the analyzed set is 0.936, between N and Hmol − 0.959, and between the molecular mass and Hmol − 0.889. Consequently, there is a strong positive linear correlation between the molecular mass, N, and Hmol.
There are three crystal structures with CNmol = 14, but characterized by the different topological types (Table 3): bcu-x, gpu-x, and tcg-x. Furthermore, all the structures have CNmol′ = 9 and the same Wyckoff sequence for the midpoints of intermolecular contacts (e5dcba). This means the same Hedge = 3.093 as for the other 2352 structures of the Wyckoff sequence which contains e5, including e5dcba (2324 structures).
The critical nets for BEJVOB, KACSOX01, and FAFDEW are of different topological type. It was shown in [26] that the value Δ = CNcrit′ − minCNcrit′ adopts almost normal discrete distribution, where 92% of structures demonstrate Δ ≤ 2 (for the set of crystalline hydrocarbons this portion was even more 95% [48]). In the space group P21/c there are 3 generators in a minimal generating subset [49]. If a molecule occupies some special position of the space group with a site-symmetry group containing a generating element of the space group ( 1 ¯ in P21/c), then a fewer number of intermolecular contacts along the other symmetry elements could be sufficient for generating of a molecular net. However, for the structural class P21/c, Z = 4(1) the value minCNcrit′ = 3. For KACSOX01 and FAFDEW the critical molecular nets are parsimonic (CNcrit′ = 3), while for BEJVOB the net is not parsimonic (CNcrit′ = 5). The last one contains two redundant contacts via the inversion centers. Any pair of two inversion centers separated by a translation generate this translation; however, if it is accompanied by a contact with the multiplicity 2 along the same direction, the pair of inversion centers becomes redundant. Conversely, a sole contact with the multiplicity 2 in the critical net cannot be redundant because a triplet of inversion centers would never generate a 3D-space group instead of a plane group with the triplet belonging to the plane. As a result of the redundancy the critical net in BEJVOB is more complex than in KACSOX01 and FAFDEW (Hedge,crit = 2.252, 1.500 and 1.522 bits/atoms, respectively), i.e., about a half of the molecular net information content for KACSOX01 and FAFDEW, and more than 2/3 of that for BEJVOB. The nets are shown in Figure 6.
The topological types of the molecular and critical nets, which are subnets of the former, are shown in Figure 7. Surprisingly, for BEJVOB the prototype molecular net bcu-x has 2 kinds of edges, while the prototype critical net sxa has three kinds of edges because some Wyckoff positions are split when the symmetry group descends from I m 3 ¯ m (bcu-x) to Cmme (sxa). The group Cmme has five elements in a minimal generating set [49], and there are Z = 4 (mm2) equivalent vertices in sxa. As the point group mm2 has two generators, the vertex configuration of sxa can be generated by 5 − 2 = 3 “contacts” of the vertices, therefore, the net sxa could be realized even for CNcrit′ = 3. On the contrary, another similar 6-coordinated net sxb of the strucutral class Cccm, Z = 4(2/m) could not be realized in any space group at CNcrit′ = minCNcrit′, since Cccm is generated by just a pair of elements. Recently, sxb was found in a metal-organic framework (MOF) [Mg3(btdc)3(dmf)4] [50], which was synthesized by a topotactic reaction from [Mg3(btdc)3(dmf)4]∙DMF of the pcu type upon heating, thus, the former MOF is not parsimonic in principle.
The set of the combinatorically distinctive critical nets depends on the topology of the initial molecular nets. For bcu-x, gpu-x, and tcg-x in the three above mentioned crystal structures, all subsets of edges, which may correspond to a CNcrit′ = minCNcrit′ = 3, were enumerated (Table 4). As all the initial nets have edges with the Wyckoff sequence e5dcba, there are 4 involutions and 5 contacts with the multiplicity 2. In BEJVOB (bcu-x), KACSOX01 (gpu-x) and FAFDEW (tcg-x) there are four contacts along the pair of screw axes 21, four contacts along the pair of glide c-planes, and two contacts along the translation vector, but their combination with the four involutions in different topological types is different. This leads to a different number and types of the critical subnets.
Apparently, the complexity of the partition of subnets into the combinatorically distinctive Wyckoff sequences (e, e2, and e3), as well as into the topological types (dia, cds, dmp, etc.), can be easily measured in terms of (1) and (2), but this is out of the topic of this work. In fact, the coordination shell of a molecule may be referred as fuzzy [51], because upon the crystallization different subsets of bearing contacts arise simultaneously. In summary, the subnets of gpu-x are obviously more diverse and include such exotic topological types as 4-coordinated 4T19 (2 subnets) and 5-coordinated 5T12 (2 subnets). Meanwhile, the leading topological type of the subnet in all cases is the diamondoid type dia. In the structural class P21/c, Z = 4(1) dia, as any other 4-coordinated subnet, is formed by two involutions and two contacts with the multiplicity 2. The formation of dia is limited by two combinatorically different options [48]. In the first one, the generators are the glide c-plane and the inversion centers located at a distance of b/4 from each other along Y. The second option entails the screw axis 21 located at the distance of c/4 from one of the inversion centers (Figure 8, top). If one of the inversion centers in the first dia subtype is shifted by b/2, the subnet transforms into the cds type, for the second option it transforms into dmp. The bnn subnet, as dia, exists in two different subtypes (Figure 8, bottom). Each subtype has the only contact via the inversion center and 2 contacts along the translation a. The only difference is the last contact with the multiplicity 2, either along the glide c-plane or along the screw axis 21. However, if the contacts along a are replaced by the contacts along the 21 axis located at a distance of (a/2 + c/4) from the initial inversion center, then the bnn subtypes are transformed into nov and sqp, respectively. Finally, the pcu subnet of each initial net is generated by three contacts with the multiplicity 2.
Of course, among the critical nets in P21/c, Z = 4(1) there are those having CNcrit′ > minCNcrit′, for instance, noz (5-coordinated), acs, bsn, sxd (6-coordinated), the net of simple hexagonal packing hex (8-coordinated), the body-centered cubic net with unextended coordination shell bcu (8-coordinated), etc. Nevertheless, these topological types may be represented as an extension of some of the 5 minimal nets in E3 without collisions and with equal vertex degrees (CNs): dia, cds, ths, pcu, and srs [13]. The last minimal net of this kind, the 3-coordinated srs, was not observed in any crystal structure for the bearing contacts so far. Similarly, the 3-coordinated net ths was not observed in P21/c, Z = 4(1), but it is possible in some other monoclinic structural classes such as C2/c, Z = 8(1). Up to date, in the TopCryst database it was exemplified not by a molecular crystal, but by a MOF of the crystal structure with the refcode RAGFAJ [52]. The quotient graph of any critical net, including a redundant one, may be derived by an addition of an edge to the undirected quotient graph of some minimal net (Figure 9). Remark that Δ = CNcrit′ − minCNcrit′ = 0 does not necessarily corresponds to a minimal net, because the deletion of an edge lattice and the deletion of the symmetrically equivalent edges are not exactly the same processes. The deletion of all equivalent edges implies the deletion of translationally equivalent edges, but the converse is not true. As a result, a series of not minimal nets such as bnn, sqp, dmp, nov (Figure 8) also corresponds to Δ = 0.
The contribution of Hedge,crit into Hedge varies from 33.9 to 100% (Table 5). Indeed, there are 4 crystal structures with CNcrit = CNmol and Hedge,crit = Hedge, these are extremal cases with the most redundant critical net. The contribution of Hmol, Hedge and H(2N, CNmol) into HmolNet, is, on the average, is 78.9, 9.5 and 11.6%, respectively, with the value of σ being a few percent, i.e., the complexity of the molecular net is substantially defined by the value of Hmol, but the impact of Hedge and H(2N, CNmol) is meaningful. The differences of min and max values for the contributions of Hedge and H(2N, CNmol) into HmolNet are much more than σ, that means the outliers being not numerous. The values of SIC, calculated using (7), also show different variances. As it was mentioned above, since there are no atoms in a special position, SICmol = 1 for all the structures. As the maximal multiplicity of a contact is 2, theoretically, the minimal SICedge, crit = − CNcrit/2∙2/CNcrit∙log2(2/CNcrit)/log2CNcrit = 1 − 1/log2CNcrit. All the structures with average multiplicity 2 in the set have CNcrit = 6, consequently, the minimal SICedge,crit = 1 − 1/log26 = 0.613. The maximal SICedge, crit = 1 corresponds to the average multiplicity 1 in the structure of (6-methoxycarbonylmethoxynaphthalen-1-yloxy)acetic acid methyl ester (refcode: KOLRAF) [39] with the Wyckoff sequence of edges dcba and the critical net dia. The values of SICedge and SICmolNet have much smaller σ than SICedge,crit.
The distribution of the crystal structures by Hmol and HmolNet is shown in Figure 10. Both values are best approximated by a logistic distribution applicable to the modeling of the degrees of pneumoconiosis in coal miners, chronic obstructive respiratory disease prevalence on smoking, survival time of diagnosed leukemia patients, etc. [53]. Generally, it has the probability density function:
f x ; μ ; β   = e x μ / β β 1 + e x μ / β 2
For Hmol μ ≈ 5.252, β ≈ 0.30; for HmolNet μ ≈ 5.572, β ≈ 0.25. Thus, the difference of the expected values μ is about 0.320 bits/d.f., and the variance for Hmol is greater than for HmolNet.
The discriminatory powers D for Hmol, Hedge,crit, Hedge, H(2N, CNmol), and HmolNet are listed in Table 6. The simple combinatorial complexity Hmol distinguishes only 99 values, whereas HmolNet—531 values. Surprisingly, Hedge has the least D = 0.6372 and distinguishes only 26 values, while Hedge,crit has even greater D = 0.8762 and s = 28. The reason of such substantial difference of D at a small difference of s is the abnormality of distribution. As shown above, 2355 structures have e5dcba or similar Wyckoff sequence for the edge net (in this case Hedge = 1.556 bits/contact). Meanwhile, the most widespread Wyckoff sequence for the critical net is eba (or similar)—856 structures (Hedge,crit = 1.500 bits/contact), i.e., with about three times less probability. The H(2N; CNmol) has a remarkably high value of s and D in comparison with Hmol, because CNmol may vary at equal N, i.e., at equal Hmol.

4. Conclusions

For molecular crystals, unlike those with infinite chains, layers, or frameworks, the simple combinatorial information content is of limited usefulness. When each atom occupies its own crystallographic orbit, the value of Hmol reflects only the number of atoms in a molecule. On the contrary, the information content of the molecular net Hedge combined with Hmol gives a hybrid function HmolNet dependent not only on the number of atoms in a molecule, but on the molecular coordination number CNmol and the number of orbits occupied by the midpoints of the molecular contacts CNmol′. In comparison with Hmol, this hybrid function has a greater discriminatory power and is more favorable for molecular crystals. The edge net complexity Hedge and that originated from mixing two sources of information H(2N; CNmol) add, on the average, a little more than 10% Hmol each. The normalized values of Hedge and HmolNet (SICedge and SICmolNet) are distributed near 0.80–0.85 with a small standard deviation. The distribution of both Hmol and HmolNet is approximately logistic.
More than a half of 4152 considered structures have CNmol = 14 and CNmol′ = 9 with the Wyckoff set of edges e5dcba. The average multiplicity of intermolecular contacts statistically significantly decreases upon excluding all contacts except those bearing the molecular net, i.e., the critical net is more saturated with involutions (the inversion centers) than the initial net. The critical net contains more than 40% information of the molecular net, and Hedge,crit has a more discriminatory power.
The minimal possible CNcrit′ is the invariant of a structural class. Each molecular coordination shell may be split in the finite number of critical coordination shells, from which the complexity of the fuzzy coordination shell arises.

Author Contributions

Conceptualization, A.M.B.; methodology, A.M.B.; writing—original draft preparation, A.M.B.; writing—review and editing, A.M.B. and S.M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Russian Science Foundation, grant number 20-77-10065.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
  2. Trucco, E. On the information content of graphs: Compound symbols; Different states for each point. Bull. Math. Biophys. 1956, 18, 237–253. [Google Scholar] [CrossRef]
  3. Bonchev, D. Shannon’s Information and Complexity. In Complexity: Introduction and Fundamentals; CRC Press: London, UK, 2003; pp. 157–187. ISBN 978-0-429-16545-0. [Google Scholar]
  4. Sabirov, D.S.; Shepelevich, I.S. Information Entropy in Chemistry: An Overview. Entropy 2021, 23, 1240. [Google Scholar] [CrossRef]
  5. Sabirov, D.S. Information entropy changes in chemical reactions. Comput. Theor. Chem. 2018, 1123, 169–179. [Google Scholar] [CrossRef]
  6. Sabirov, D.S. Information entropy of mixing molecules and its application to molecular ensembles and chemical reactions. Comput. Theor. Chem. 2020, 1187, 112933. [Google Scholar] [CrossRef]
  7. Sabirov, D.S. Information entropy of interstellar and circumstellar carbon-containing molecules: Molecular size against structural complexity. Comput. Theor. Chem. 2016, 1097, 83–91. [Google Scholar] [CrossRef]
  8. Chung, S.J.; Hahn, T.; Klee, W.E. Nomenclature and generation of three-periodic nets: The vector method. Acta Crystallogr. Sect. A 1984, 40, 42–50. [Google Scholar] [CrossRef] [Green Version]
  9. Klee, W.E. Crystallographic nets and their quotient graphs. Cryst. Res. Technol. 2004, 39, 959–968. [Google Scholar] [CrossRef]
  10. Eon, J.-G. From symmetry-labeled quotient graphs of crystal nets to coordination sequences. Struct. Chem. 2012, 23, 987–996. [Google Scholar] [CrossRef]
  11. Eon, J.-G. Topological features in crystal structures: A quotient graph assisted analysis of underlying nets and their embeddings. Acta Crystallogr. Sect. A Found. Adv. 2016, 72, 268–293. [Google Scholar] [CrossRef]
  12. Beukemann, A.; Klee, W.E. Minimal nets. Z. Krist.-New Cryst. Struct. 1992, 201, 37–51. [Google Scholar] [CrossRef]
  13. Bonneau, C.; Delgado-Friedrichs, O.; O’Keeffe, M.; Yaghi, O.M. Three-periodic nets and tilings: Minimal nets. Acta Crystallogr. Sect. A Found. Crystallogr. 2004, 60, 517–520. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Eon, J.G. Euclidian embeddings of periodic nets: Definition of a topologically induced complete set of geometric descriptors for crystal structures. Acta Crystallogr. Sect. A Found. Crystallogr. 2011, 67, 68–86. [Google Scholar] [CrossRef] [PubMed]
  15. O’Keeffe, M.; Peskov, M.A.; Ramsden, S.J.; Yaghi, O.M. The Reticular Chemistry Structure Resource (RCSR) Database of, and Symbols for, Crystal Nets. Acc. Chem. Res. 2008, 41, 1782–1789. [Google Scholar] [CrossRef]
  16. Delgado-Friedrichs, O.; O’Keeffe, M. Identification of and symmetry computation for crystal nets. Acta Crystallogr. Sect. A 2003, 59, 351–360. [Google Scholar] [CrossRef]
  17. Sunada, T. Lecture on topological crystallography. Jpn. J. Math. 2012, 39, 1–39. [Google Scholar] [CrossRef]
  18. Krivovichev, S.V. Topological complexity of crystal structures: Quantitative approach. Acta Crystallogr. Sect. A Found. Crystallogr. 2012, 68, 393–398. [Google Scholar] [CrossRef] [PubMed]
  19. Krivovichev, S.V. Which inorganic structures are the most complex? Angew. Chem.-Int. Ed. 2014, 53, 654–661. [Google Scholar] [CrossRef]
  20. Hornfeck, W. On an extension of Krivovichev’ s complexity measures. Acta Crystallogr. Sect. A Found. Adv. 2020, 76, 534–548. [Google Scholar] [CrossRef]
  21. Kaußler, C.; Kieslich, G. crystIT: Complexity and configurational entropy of crystal structures via information theory. J. Appl. Crystallogr. 2021, 54, 306–316. [Google Scholar] [CrossRef] [PubMed]
  22. Banaru, A.M.; Aksenov, S.M.; Krivovichev, S.V. Complexity Parameters for Molecular Solids. Symmetry 2021, 13, 1399. [Google Scholar] [CrossRef]
  23. Csiszár, I. Axiomatic Characterizations of Information Measures. Entropy 2008, 10, 261–273. [Google Scholar] [CrossRef] [Green Version]
  24. Pidcock, E.; Motherwell, W.D.S.; Cole, J.C. A database survey of molecular and crystallographic symmetry. Acta Crystallogr. Sect. B Struct. Sci. 2003, 59, 634–640. [Google Scholar] [CrossRef] [PubMed]
  25. Slovokhotov, Y.L. Organic crystallography: Three decades after Kitaigorodskii. Struct. Chem. 2019, 30, 551–558. [Google Scholar] [CrossRef]
  26. Banaru, A.M.; Aksenov, S.M.; Banaru, D.A. Critical Molecular Coordination Numbers in the Structural Class P2(1)/c, Z = 4(1). Mosc. Univ. Chem. Bull. 2021, 78, 325–333. [Google Scholar] [CrossRef]
  27. Zorky, P.M. Symmetry, pseudosymmetry and hypersymmetry of organic crystals. J. Mol. Struct. 1996, 374, 9–28. [Google Scholar]
  28. Groom, C.R.; Bruno, I.J.; Lightfoot, M.P.; Ward, S.C. The Cambridge Structural Database. Acta Crystallogr. B Struct. Sci. Cryst. Eng. Mater. 2016, 72, 171–179. [Google Scholar] [CrossRef] [PubMed]
  29. Blatov, V.A.; Shevchenko, A.P.; Proserpio, D.M. Applied Topological Analysis of Crystal Structures with the Program Package ToposPro. Cryst. Growth Des. 2014, 14, 3576–3586. [Google Scholar] [CrossRef]
  30. Blatov, V.A. Voronoi–dirichlet polyhedra in crystal chemistry: Theory and applications. Crystallogr. Rev. 2004, 10, 249–318. [Google Scholar] [CrossRef]
  31. Oswald, I.D.H.; Urquhart, A.J. Polymorphism and polymerisation of acrylic and methacrylic acid at high pressure. CrystEngComm 2011, 13, 4503–4507. [Google Scholar] [CrossRef] [Green Version]
  32. Gridin, D.M.; Banaru, A.M. Coordination Numbers and Topology of Crystalline Hydrocarbons. Mosc. Univ. Chem. Bull. 2020, 75, 354–367. [Google Scholar] [CrossRef]
  33. The Samara Topological Data Center TopCryst. Available online: https://topcryst.com/ (accessed on 13 January 2022).
  34. Blatov, V.A.; O’Keeffe, M.; Proserpio, D.M. Vertex-, face-, point-, Schläfli-, and Delaney-symbols in nets, polyhedra and tilings: Recommended terminology. CrystEngComm 2010, 12, 44–48. [Google Scholar] [CrossRef] [Green Version]
  35. Hunter, P.R.; Gaston, M.A. Numerical index of the discriminatory ability of typing systems: An application of Simpson’s index of diversity. J. Clin. Microbiol. 1988, 26, 2465–2466. [Google Scholar] [CrossRef] [Green Version]
  36. Kroll, L.S. Mathematica—A System for Doing Mathematics by Computer. Wolfram Research. Am. Math. Mon. 1989, 96, 855–861. [Google Scholar] [CrossRef]
  37. Carugo, O.; Blatova, O.A.; Medrish, E.O.; Blatov, V.A.; Proserpio, D.M. Packing topology in crystals of proteins and small molecules: A comparison. Sci. Rep. 2017, 7, 13209. [Google Scholar] [CrossRef]
  38. Starck, F.; Jones, P.G.; Herges, R. Synthesis of Photoresponsive Polyethers. Eur. J. Org. Chem. 1998, 1998, 2533–2539. [Google Scholar] [CrossRef]
  39. Mondal, P.; Karmakar, A.; Singh, W.M.; Baruah, J.B. Crystal packing in some flexible carboxylic acids and esters attached to a naphthalene ring. CrystEngComm 2008, 10, 1550–1559. [Google Scholar] [CrossRef]
  40. Motherwell, W.D.S. Architecture of packing in molecular crystals. CrystEngComm 2017, 19, 6869–6882. [Google Scholar] [CrossRef]
  41. Banaru, A.M.; Gridin, D.M. Coordination Numbers and Critical Topology of Centrosymmetric Hydrocarbons. J. Struct. Chem. 2019, 60, 1885–1895. [Google Scholar] [CrossRef]
  42. Song, X.; Tang, Z.; Zuo, Z.; Duan, J. The crystal structure of 2-(tert-butyl)-4-chloro-6-phenyl-1,3,5-triazine, C13H14Cl1N3. Z. Krist.-New Cryst. Struct. 2018, 233, 779–781. [Google Scholar] [CrossRef]
  43. Boese, R.; Blaeser, D.; Gomann, K.; Brinker, U.H. Spiropentane as a tensile spring. J. Am. Chem. Soc. 1989, 111, 1501–1503. [Google Scholar] [CrossRef]
  44. Shirazi, M.; Soltani, M.-R.; Jahanabadi, Z.; Abdollahifar, M.-A.; Tanideh, N.; Noorafshan, A. Stereological comparison of the effects of pentoxifylline, captopril, simvastatin, and tamoxifen on kidney and bladder structure after partial urethral obstruction in rats. Korean J. Urol. 2014, 55, 756–763. [Google Scholar] [CrossRef]
  45. Bryan, R.F.; White, D.H. α-Methyl-trans-cinnamic acid (m.p. 355 K). Acta Crystallogr. Sect. B 1982, 38, 1332–1334. [Google Scholar] [CrossRef]
  46. Abid, O.-R.; Qadeer, G.; Rama, N.H.; Wong, W.-Y. 5-Methoxyindan-1-one. Acta Crystallogr. Sect. E 2007, 63, o165–o166. [Google Scholar] [CrossRef]
  47. de Meijere, A.; Khlebnikov, A.F.; Kozhushkov, S.I.; Kostikov, R.R.; Schreiner, P.R.; Wittkopp, A.; Rinderspacher, C.; Menzel, H.; Yufit, D.S.; Howard, J.A.K. The First Enantiomerically Pure [n]Triangulanes and Analogues: σ-[n]Helicenes with Remarkable Features. Chem.-Eur. J. 2002, 8, 828–842. [Google Scholar] [CrossRef]
  48. Banaru, A.M.; Banaru, D.A. Zorkii structural classes and critical topology of molecular crystals. J. Struct. Chem. 2020, 61, 1485–1502. [Google Scholar] [CrossRef]
  49. Lord, E.A.; Banaru, A.M. Number of generating elements in space group of a crystal. Mosc. Univ. Chem. Bull. 2012, 67, 50–58. [Google Scholar] [CrossRef]
  50. Dubskikh, V.A.; Lysova, A.A.; Samsonenko, D.G.; Dybtsev, D.N.; Fedin, V.P. Topological polymorphism and temperature-driven topotactic transitions of metal–organic coordination polymers. CrystEngComm 2020, 22, 6295–6301. [Google Scholar] [CrossRef]
  51. Banaru, A.M. A Fuzzy Set of Generating Contacts in a Molecular Agglomerate. Mosc. Univ. Chem. Bull. 2019, 74, 101–105. [Google Scholar] [CrossRef]
  52. Li, S.-L.; Wang, J.; Zhang, F.-Q.; Zhang, X.-M. Light and Heat Dually Responsive Luminescence in Organic Templated CdSO4-type Halogeno(cyano)cuprates with Disorder of Halogenide/Cyanide. Cryst. Growth Des. 2017, 17, 746–752. [Google Scholar] [CrossRef]
  53. Nadarajah, S.; Kotz, S. A generalized logistic distribution. Int. J. Math. Math. Sci. 2005, 2005, 894212. [Google Scholar] [CrossRef] [Green Version]
Figure 1. VDPmol as the sum of atomic VDPs (represented by different colors) and the 1-st molecular coordination shell in the crystal structure of acrylic acid (CSD-refcode: ACRLAC04).
Figure 1. VDPmol as the sum of atomic VDPs (represented by different colors) and the 1-st molecular coordination shell in the crystal structure of acrylic acid (CSD-refcode: ACRLAC04).
Symmetry 14 00220 g001
Figure 2. The distribution of the set of crystal structures by Ωcrit rounded to half-integer %.
Figure 2. The distribution of the set of crystal structures by Ωcrit rounded to half-integer %.
Symmetry 14 00220 g002
Figure 3. A fragment of the net of all intermolecular contacts fcu type (left) and that of bearing intermolecular contacts sqp type (right) for the crystal structure of acrylic acid.
Figure 3. A fragment of the net of all intermolecular contacts fcu type (left) and that of bearing intermolecular contacts sqp type (right) for the crystal structure of acrylic acid.
Symmetry 14 00220 g003
Figure 4. The distribution of crystal structures of the analyzed set over CNmol and CNcrit (top), CNmol′ and CNcrit′ (bottom).
Figure 4. The distribution of crystal structures of the analyzed set over CNmol and CNcrit (top), CNmol′ and CNcrit′ (bottom).
Symmetry 14 00220 g004aSymmetry 14 00220 g004b
Figure 5. The structural formulas of isomeric α-methyl-trans-cinnamic acid (refcode: BEJVOB), 5-methoxyindan-1-one (refcode: KACSOX01), and (1RS,3SR,4SR)-trispiro(2.0.0.2.1.1)nonane-1-carboxylic acid (refcode: FAFDEW).
Figure 5. The structural formulas of isomeric α-methyl-trans-cinnamic acid (refcode: BEJVOB), 5-methoxyindan-1-one (refcode: KACSOX01), and (1RS,3SR,4SR)-trispiro(2.0.0.2.1.1)nonane-1-carboxylic acid (refcode: FAFDEW).
Symmetry 14 00220 g005
Figure 6. A fragment of the molecular nets of BEJVOB, KACSOX01, and FAFDEW, view along Y (β-setting). The edges of the critical nets are shown—by black solid lines, those disposable for the molecular net—by blue dashed lines, the molecular centers of gravity—by blue circles.
Figure 6. A fragment of the molecular nets of BEJVOB, KACSOX01, and FAFDEW, view along Y (β-setting). The edges of the critical nets are shown—by black solid lines, those disposable for the molecular net—by blue dashed lines, the molecular centers of gravity—by blue circles.
Symmetry 14 00220 g006
Figure 7. The topological types of BEJVOB (left), KACSOX01 (center), and FAFDEW (right) for the molecular (top) and critical (bottom) nets.
Figure 7. The topological types of BEJVOB (left), KACSOX01 (center), and FAFDEW (right) for the molecular (top) and critical (bottom) nets.
Symmetry 14 00220 g007
Figure 8. A scheme of the critical subnets of the molecular nets in P21/c, Z = 4(1) at CNcrit = 4 (top) and 5 (bottom). Molecules inverted and turned to the viewer by the reverse side, are shaded up. Molecules shifted by ±t/2 towards the viewer, are outlined by a dashed line. Double lines denote contacts between the initial molecule and two adjacent ones located above and below.
Figure 8. A scheme of the critical subnets of the molecular nets in P21/c, Z = 4(1) at CNcrit = 4 (top) and 5 (bottom). Molecules inverted and turned to the viewer by the reverse side, are shaded up. Molecules shifted by ±t/2 towards the viewer, are outlined by a dashed line. Double lines denote contacts between the initial molecule and two adjacent ones located above and below.
Symmetry 14 00220 g008
Figure 9. The types of critical subnets in P21/c, Z = 4(1) derived from minimal nets (shown in red) by an addition of an edge into the quotient graph of the most symmetric embedding in E3.
Figure 9. The types of critical subnets in P21/c, Z = 4(1) derived from minimal nets (shown in red) by an addition of an edge into the quotient graph of the most symmetric embedding in E3.
Symmetry 14 00220 g009
Figure 10. The distributions of 4152 structures by Hmol (top) and HmolNet (bottom) with the step of 0.100 bits/d.f. and their approximation by the logistic distribution.
Figure 10. The distributions of 4152 structures by Hmol (top) and HmolNet (bottom) with the step of 0.100 bits/d.f. and their approximation by the logistic distribution.
Symmetry 14 00220 g010
Table 1. The min, max, mean value, and standard deviation of the multiplicity of the contact in the molecular net of all the intermolecular contacts (molecular) and the bearing ones only (critical) for the set of 4152 crystal structures.
Table 1. The min, max, mean value, and standard deviation of the multiplicity of the contact in the molecular net of all the intermolecular contacts (molecular) and the bearing ones only (critical) for the set of 4152 crystal structures.
The NetMinMaxMeanσ
molecular1.3752.0001.5820.060
critical1.0002.0001.5120.186
Table 2. The topological types found in crystalline hydrocarbons [32,44], some inorganic molecular crystals [22], and 2-(tert-butyl)-4-chloro-6-phenyl-1,3,5-triazine, ordered by the increase of TD10.
Table 2. The topological types found in crystalline hydrocarbons [32,44], some inorganic molecular crystals [22], and 2-(tert-butyl)-4-chloro-6-phenyl-1,3,5-triazine, ordered by the increase of TD10.
Topological TypePoint SymbolCoordination SphereTD10
1st2nd3rd4th5th
bcu-x336.448.5714501101943024641
gpu-x336.446.5914521142023144831
tcg-x336.446.5914521162043184893
14T34333.451.5714531172083244996
14T5336.445.51014531202123325106
14T6336.445.51014531202133355138
14T134334.447.51014541222163385201
14T10336.445.51014531222183395238
14T37333.445.51014531212173395239
14T65333.451.5714541222183425301
14T9336.445.51014531232213445329
14T24336.446.5914521202183445339
14T3336.444.51114541242223485373
14T8336.444.51114541262263545475
14T319330.450.51114581302323625581
14T18336.444.51114541302423825947
14T191333.447.51114591412564026246
Table 3. Topological types and structural characteristics of the molecular net in BEJVOB, KACSOX01, and FAFDEW.
Table 3. Topological types and structural characteristics of the molecular net in BEJVOB, KACSOX01, and FAFDEW.
Refcode in CSDBEJVOB [45]KASSOX01 [46]FAFDEW [47]
FormulaC10H10O2C10H10O2C10H12O2
Nameα-methyl-trans-cinnamic acid5-methoxyindan-1-one(1RS,3SR,4SR)-trispiro(2.0.0.2.1.1)nonane-1-carboxylic acid
Temperatureroomroom100 K
R-factor4.103.903.75
Structural classP21/c, Z = 4(1)P21/c, Z = 4(1)P21/c, Z = 4(1)
The molecular net
CNmol141414
Type (transitivity)bcu-x (1,2)gpu-x (1,4)tcg-x (1,6)
CNmol999
Wyckoff sequence
of intermolecular contacts
e5dcbae5dcbae5dcba
Hedge, bits/contact3.0933.0933.093
The critical net
CNcrit645
Type (transitivity)sxa (1,3)dia (1,1)bnn (1,2)
CNcrit533
Wyckoff sequence
of bearing contacts
edcbaedce2b
Hedge,crit, bits/contact2.2521.5001.522
Table 4. Possible critical subnets of bcu-x, gpu-x, and tcg-x with the Wyckoff sequence of edges e5dcba in P21/c.
Table 4. Possible critical subnets of bcu-x, gpu-x, and tcg-x with the Wyckoff sequence of edges e5dcba in P21/c.
CNcritWyckoff Sequences of EdgesSubnetsNets
Bcu-XGpu-XTcg-X
4e.dia18820
cds2
dmp4
4T192
5e2.sqp448
nov8610
bnn1616
5T122
6e3.pcu728
Total533062
Table 5. The min, max, mean value, and standard deviation of the contributions of H (%) and SIC for the set of 4152 crystal structures.
Table 5. The min, max, mean value, and standard deviation of the contributions of H (%) and SIC for the set of 4152 crystal structures.
ValueMinMaxMeanσ
%
Hmol in HmolNet35.594.378.96.4
Hedge,crit in Hedge41.2100.062.411.7
Hedge in HmolNet1.939.29.53.6
H(2N, CNmol) in HmolNet3.825.811.62.8
SIC
SICmol1.0001.0001.0000.000
SICedge,crit0.6131.0000.7400.057
SICedge0.7590.8640.8110.009
SICmolNet0.8130.8790.8530.008
Table 6. The number s of distinctive values H and the discriminatory power D over the set of 4152 crystal structures.
Table 6. The number s of distinctive values H and the discriminatory power D over the set of 4152 crystal structures.
ValuesD, %
Hmol9997.76
Hedge,crit2887.62
Hegde2663.72
H(2N; CNmol)38998.93
HmolNet53199.13
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Banaru, A.M.; Aksenov, S.M. Complexity of Molecular Nets: Topological Approach and Descriptive Statistics. Symmetry 2022, 14, 220. https://doi.org/10.3390/sym14020220

AMA Style

Banaru AM, Aksenov SM. Complexity of Molecular Nets: Topological Approach and Descriptive Statistics. Symmetry. 2022; 14(2):220. https://doi.org/10.3390/sym14020220

Chicago/Turabian Style

Banaru, Alexander M., and Sergey M. Aksenov. 2022. "Complexity of Molecular Nets: Topological Approach and Descriptive Statistics" Symmetry 14, no. 2: 220. https://doi.org/10.3390/sym14020220

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop