<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator><link href="https://chem-bla-ics.linkedchemistry.info/feed/by_tag/aromaticity.xml" rel="self" type="application/atom+xml" /><link href="https://chem-bla-ics.linkedchemistry.info/" rel="alternate" type="text/html" /><updated>2026-04-11T11:30:50+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/feed/by_tag/aromaticity.xml</id><title type="html">chem-bla-ics</title><subtitle>Chemblaics (pronounced chem-bla-ics) is the science that uses open science and computers to solve problems in chemistry, biochemistry and related fields.</subtitle><author><name>Egon Willighagen</name></author><entry><title type="html">Evidence of Aromaticity</title><link href="https://chem-bla-ics.linkedchemistry.info/2007/11/06/evidence-of-aromaticity.html" rel="alternate" type="text/html" title="Evidence of Aromaticity" /><published>2007-11-06T00:00:00+00:00</published><updated>2007-11-06T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2007/11/06/evidence-of-aromaticity</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2007/11/06/evidence-of-aromaticity.html"><![CDATA[<p>I have been working on a new atom type perception engine for the <a href="http://cdk.sf.net/">CDK</a>, after having decided that
<a href="https://chem-bla-ics.linkedchemistry.info/2007/07/01/atom-typing-in-cdk.html">the existing atom type lists <i class="fa-solid fa-recycle fa-xs"></i></a> where not sufficient for
the algorithms we have in the CDK. The <a href="http://cdk.svn.sourceforge.net/viewvc/*checkout*/cdk/trunk/cdk/src/org/openscience/cdk/config/data/cdk_atomtypes.xml?revision=9288">new list</a>
is growing in size, and basically contains four properties (besides element and formal charge):</p>

<ol>
  <li>number of bounded neighbors</li>
  <li>number of pi bonds (or double bond equivalents)</li>
  <li>number of lone pairs</li>
  <li>hybridization state</li>
</ol>

<p>This seems to be a minimal and accurate set to cover a rather good deal of chemoinformatics. I have yet to make the mappings
of the new atom type list with existing lists for force fields, and radicals are missing too. However, the following
algorithms in the CDK seem to translate rather well:</p>

<ul>
  <li>hydrogen adding</li>
  <li>aromaticity detection (Hückel rules)</li>
</ul>

<p>I still have to rework the double bond perception.</p>

<h2 id="aromaticity">Aromaticity</h2>

<p>Now, aromaticity is a fuzzy concept, and there is no general agreement on what it is. Some say it is smelly compounds, others
say ring systems which apply to the Hückel rule. Based on the new atom type list, I have rewritten the Hückel aromaticity
detector and it applies these rules:</p>

<ul>
  <li>only single rings and two fused non-spiro rings</li>
  <li>4n+2 electrons</li>
  <li>no ring atoms with double points not in the ring too</li>
</ul>

<p>This approach differs in two ways from the old code: it no longer tries to test all ring systems, which required to use
the <a href="http://cheminfo.informatics.indiana.edu/~rguha/code/java/nightly/api/org/openscience/cdk/ringsearch/AllRingsFinder.html">CDK AllRingsFinder algorithm</a>
which combinatorial generates all possible ring systems. The new code only considers ring systems with up to two single
rings. Aromaticity beyond that is even less well defined than aromaticity in general.</p>

<p>The other difference is that the ring system must not have ring atoms which have a double bond which is not part of the
ring too. The classical example is benzoquinone (InChI=1/C6H4O2/c7-5-1-2-6(8)4-3-5/h1-4H) which is not aromatic, even
though it conforms the 4n+2 rule (image from <a href="http://pubchem.ncbi.nlm.nih.gov/">PubChem</a>):</p>

<p><img src="/assets/images/cid4650.png" alt="" /></p>

<h2 id="evidence-of-aromaticity">Evidence of Aromaticity</h2>

<p>The final rule, of course, is what nature tells us what is aromatic and what is not. There are many other details to
aromaticity than I just covered. For example, take azulene (InChI=1/C10H8/c1-2-5-9-7-4-8-10(9)6-3-1/h1-8H). All
atoms are aromatic, but not all bonds (also <a href="https://pubchem.ncbi.nlm.nih.gov/compound/9231">PubChem</a>):</p>

<p><img src="/assets/images/cid9231.png" alt="" /></p>

<p>These things are complex, but the rise of <a href="http://en.wikipedia.org/wiki/Open_Data">Open Data</a> helps us out, as well
as increasing computing power. <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/">Peter</a> has been running two rather
projects which may help us out: <a href="http://wwmm.ch.cam.ac.uk/crystaleye/">CrystalEye</a> (Nick: no blog?) and
<a href="https://blogs.ch.cam.ac.uk/pmr/2007/11/02/open-nmr-update-and-requests-for-input/">OpenNMR <i class="fa-solid fa-recycle fa-xs"></i></a>.</p>

<p>NMR shifts will give us experimental backup on our notion of aromaticity, and so do bond lengths. I
<a href="https://blogs.ch.cam.ac.uk/pmr/2007/11/02/open-nmr-update-and-requests-for-input/#comment-1139">asked Peter about this <i class="fa-solid fa-recycle fa-xs"></i></a>, and whether OpenNMR
predicted shifts could indeed confirm aromaticity of compounds, and <a href="https://blogs.ch.cam.ac.uk/pmr/2007/11/05/open-nmr-how-good-is-the-prediction/">he replied <i class="fa-solid fa-recycle fa-xs"></i></a>
and showed that the predicted spectra could be used to distinguish between C-C and C=C bonds.</p>

<p>I commented the following (which was in moderation at the time of writing), and that gets us to experimental
evidence for aromaticity:</p>

<blockquote>
  <p>Thanx for the elaborate answer. What I had in mind was the question whether NMR shift predictions can be
used to tell me if a certain ring system is aromatic or not, and in case of fused rings, which atoms and
which bonds are aromatic and which not. I’m sure the prediction error for 1H NMR shifts is well below 2ppm,
and more in the order of 0.2ppm.</p>
</blockquote>

<blockquote>
  <p>But maybe I should be asking, can I use CrystalEye to decide if ring systems are “aromatic”, and in case
of two rings fused together (non-spiro), which atoms and bonds are aromatic and which not. Aromaticity is
a fuzzy concept, with various definitions. I would be interesting in linking what the expert considers
‘aromatic’ (or SMILES, or the CDK, or …) with what the QM chemistry (via bond lengths or NMR shift
predictions) and crystal structures (via bond lengths) has to teach us. The null hypothesis being that
the bonds are not delocalized (bond length) and that no ring current is found (NMR shifts, 1H in particular).</p>
</blockquote>

<blockquote>
  <p>Regarding those bond lengths, ‘aromatic’ bonds show a bond length in between that of single and double bonds
(e.g. see <a href="http://www.chem.swin.edu.au/modules/mod2/bondlen.html">this random pick</a>). The CrystalEye data
does not reflect that really, and only <a href="http://wwmm.ch.cam.ac.uk/crystaleye/bondlengths/C-C-after-protocol.svg">a trimodal histograms</a>
shows up. Indeed, the C#C peak is <em>very</em> low, around 1.2A :) Apparently, the triple C#C bond order is
underrepresented in nowadays crystallography.</p>
</blockquote>

<blockquote>
  <p>Maybe aromatic C:C bonds are underrepresented too, or can the absence of a peak around 1.40A be explained
otherwise? I would at least have expected a shoulder or deviation in peak shape of the peak at 1.37A.</p>
</blockquote>

<p>This is what the histogram looks like (for archival reasons):</p>

<p><img src="/assets/images/trimodalCC.png" alt="" /></p>]]></content><author><name>Egon Willighagen</name></author><category term="cdk" /><category term="inchikey:CUFNKYGDVFVPHO-UHFFFAOYSA-N" /><category term="inchikey:AZQWKYJCGOJGHM-UHFFFAOYSA-N" /><category term="aromaticity" /><category term="crystal" /><summary type="html"><![CDATA[I have been working on a new atom type perception engine for the CDK, after having decided that the existing atom type lists where not sufficient for the algorithms we have in the CDK. The new list is growing in size, and basically contains four properties (besides element and formal charge):]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/trimodalCC.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/trimodalCC.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry></feed>