[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Reply to: [list | sender only]
phase identifiers
- To: Multiple recipients of list <phase-identifiers@iucr.org>
- Subject: phase identifiers
- From: "I. David Brown" <idbrown@mcmail.cis.mcmaster.ca>
- Date: Thu, 26 Jun 2003 16:11:57 +0100 (BST)
Dear Colleagues, DEADLINE FOR RESPONSE IS 1 AUGUST. YOUR COMMENTS ARE ESSENTIAL IF WE ARE TO DEVELOP A CONSENSUS. PLEASE RESPOND TO THE PHASE-IDENTIFIERS DISCUSSION LIST. Discussion paper number 5 ------------------------- The principal difficulty facing the phase identifier project at the moment is how to identify the different isomers of organic crystals that have the same sum formula. This requires some way of differentiating between the topologies of different isomers. Organic chemists have devised elaborate procedures for describing the topologies of organic molecules, but we do not need anything as complex for several reasons. Firstly, the chemical formula will provide a first level of discrimination on the basis of the numbers of different atoms present. Secondly, we are not using the phase identifier to locate a particular molecule or fragment in the crystal, and thirdly, we are not interested in how many distinct molecules or moieties the crystal may contain. The only test we are interested in is whether the topology of two crystalline phases is the same or different. Sam Motherwell and I have discussed this problem and looked at some of the methods used by the Cambridge Structural Database. I have also tried out a number of simple descriptors. The one that seems to work best consists of a string of five numbers showing how many carbon atoms in the formula unit of the crystal (given by the sum formula) have 0, 1, 2, 3 and 4 attached hydrogen atoms respectively. There are very few organic compounds where this spectrum of numbers cannot be uniquely determined and, combined with the sum formula, it appears to discriminate remarkably well. The five numbers that make up the spectrum need to be separated, say, by a period since each number may run into more than one digit. The presentation can be simplified by omitting all the numbers that are zero (their place is identified by the periods), and since the only molecule or fragment that has four attached hydrogen atoms is methane, the last number can also be omitted (the presence of methane is indicated by a discrepancy between the carbon content of the sum formula and the sum of the numbers in the spectrum). This results in a symbol that contains three periods and up to four different numbers. The table below shows the full and abbreviated H spectra around C for a number of simple organic molecules. For comparison I also give the spectra using the count of attached C or attached C and H neighbours. The carbon spectrum will not distinguish optical isomers. I am not sufficiently familiar with this problem. Does anyone have any suggestions how optical isomers can be uniquely identified? _chemical_formula_C_spectrum _definition ; A sequence of four numbers separated by periods, indicating the number of C atoms in the formula unit (defined by _chemical_formula_sum) that have zero, one, two, and three attached H atoms. In the abbreviated form of this spectrum, zeros are omitted. Each symbol contains three periods and between zero and four numbers whose total equals the carbon atom count minus the number of methane molecules present. ; _example ; Full Symbol C only C+H H Abbrev total C symbol methane 1.0.0.0.0 0.0.0.0.1 0.0.0.0.1 ... 1 ethane 0.2.0.0.0 0.0.0.0.2 0.0.0.2.0 ...2 2 ethylene 0.2.0.0.0 0.0.0.2.0 0.0.2.0.0 ..2. 2 acetylene 0.2.0.0.0 0.0.2.0.0 0.2.0.0.0 .2.. 2 ethanol 0.2.0.0.0 0.0.0.1.1 0.0.1.1.0 ..1.1 2 1,1-dichloroethane 0.2.0.0.0 0.0.0.2.0 0.1.0.1.0 .1..1 2+Cl 1,2-dichloroethane 0.2.0.0.0 0.0.1.0.1 0.0.2.0.0 ..2. 2+Cl propane 0.2.1.0.0 0.0.0.0.3 0.0.1.2.0 ..1.2 3 n-butane 0.2.2.0.0 0.0.0.0.4 0.0.2.2.0 ..2.2 4 i-butane 0.3.0.1.0 0.0.0.0.4 0.1.0.3.0 .1..3 4 cyclobutane 0.0.4.0.0 0.0.0.0.4 0.0.4.0.0 ..4. 4 pyridine 0.2.3.0.0 0.2.3.0.0 0.5.0.0.0 .5.. 5+N cyclopentane0.0.5.0.0 0.0.0.0.5 0.0.5.0.0 ..5. 5 n-pentane 0.2.3.0.0 0.0.0.0.5 0.0.3.2.0 ..3.2 5 cyclopentadiene 0.0.5.0.0 0.0.0.5.0 0.5.0.0.0 .5.. 5 benzene 0.0.6.0.0 0.0.0.6.0 0.6.0.0.0 .6.. 6 cyclohexane 0.0.6.0.0 0.0.0.0.6 0.0.6.0.0 ..6. 6 n-hexane 0.2.4.0.0 0.0.0.0.6 0.0.4.2.0 ..4.2 6 phenol 0.0.6.0.0 0.0.1.5.0 1.5.0.0.0 1.5.. 6+O toluene 0.1.5.1.0 0.0.0.6.1 1.5.0.1.0 1.5..1 7 anthracene 0.0.8.2.0 0.0.0.10.0 2.8.0.0.0 2.8.. 10 Other keys that we have identified are: --------------------------------------- # Note that all keys are optional and not all are appropriate # (carbon spectra only appear if there is C in the formula), but # the more keys that can be given, the better the chance of # identifying materials with the same phase. _chemical_formula_sum (existing item) # This item identifies the formula unit which is important if the # carbon spectrum is given. # For inorganic compounds, only the relative abundance of # the elements is important since the choice of the size of # the formula unit is not always obvious. _chemical_state loop_ _enumeration gas gas phase liq liquid phase sol solid phase of unknown form xtl crystalline solid am amorphous solid lx liquid crystal _space_group_IT_number (existing item) _definition ; The number of the space group from Int. Tables. ; _crystal_system_code loop_ _enumeration aP, mP, mE, oP, oE, oF, oI, tP, tI, hP, hR, cP, cF, cI # Redundant if the space group is known, but in any case useful # in case comparison is being made with a material whose space # group is not fully determined. This symbol should correspond # to the standard setting given in International Tables. _space_group_Wyckoff_sequence _definition ; The sequence of Wyckoff letters in alphabetical order describing the occupied sites in the space group. Where there is a choice of sequences, the sequence lowest in the alphabetic order should be used. E.g., in P-1 the general position is i, but any of the letters a - h could be chosen for an atom on a center of symmetry, so a-d-i6 should be chosen rather than b-f- i6. ; The following keys are more problematic --------------------------------------- _structure_type_code _definition ; The code used in the Pauling file. This is useful for those structure types which have been assigned a type code in the Pauling file. ; # Struktur Berichte has names for many structure types that have # been often used in the condensed matter physics community. # They have been around for many years and should be # straightforward to apply. _chemical_name_mineral (existing item) _definition ; The name assigned by the International Mineralogical Association(?) for natural minerals. Should these names be used for their synthetic analogues, given that the synthetic analogues often have ideal compositions that are unknown in nature? ; _reduced_crystallographic_cell # This has been recommended as a good key for identifying # identical materials, but it is subject to experimental # uncertainty and so will rarely give an exact match. We should # consider including this as a secondary key that can be tested # to see if two cells lie within prescribed tolerances. The following keys identify the conditions under which the material was characterized ----------------------------------------------------------- _temperature (_diffrn_ambient_temperature) _definition ; The temperature in K at which material was characterized. ; _pressure (_diffrn_ambient_pressure) _definition ; The pressure in kPa at which material was characterized. ; Can you please post your comments to the phase-identifiers discussion group (by replying to this email) by August 1. Best wishes David ***************************************************** Dr.I.David Brown, Professor Emeritus Brockhouse Institute for Materials Research, McMaster University, Hamilton, Ontario, Canada Tel: 1-(905)-525-9140 ext 24710 Fax: 1-(905)-521-2773 idbrown@mcmaster.ca *****************************************************
Reply to: [list | sender only]
- Prev by Date: Phase ID proposal 4
- Next by Date: RE: phase identifiers
- Prev by thread: Phase Identifiers
- Next by thread: RE: phase identifiers
- Index(es):