<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator>
  <link href="https://chem-bla-ics.linkedchemistry.info/feed.xml" rel="self" type="application/atom+xml"/>
  <link href="https://chem-bla-ics.linkedchemistry.info" rel="alternate" type="text/html"/>
  <updated>2026-06-07T16:43:55+00:00</updated>
  <id>https://chem-bla-ics.linkedchemistry.info/archive.xml</id>
  <title type="html">chem-bla-ics</title>
  <subtitle>Chemblaics (pronounced chem-bla-ics) is the science that uses open science and computers to solve problems in chemistry, biochemistry and related fields.</subtitle>
  <author>
    <name>Egon Willighagen</name>
    <uri>https://orcid.org/0000-0001-7542-0286</uri>
  </author>

  
  <entry>
    <title type="html">The launch of the Virtual Human Platform</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/06/06/the-launch-of-the-virtual-human-platform.html" rel="alternate" type="text/html" title="The launch of the Virtual Human Platform"/>
    <published>2026-06-06T00:00:00+00:00</published>
    <updated>2026-06-06T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/s784p-s1y68</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/06/06/the-launch-of-the-virtual-human-platform.html">
      <![CDATA[ <p><img src="/assets/images/vhp_platform.png" style="width: 30%; display: block; margin-left: auto; margin-right: auto; float: right" alt="Screenshot of the Virtual Human Platform website, showing a logo, three section panels (Case Studies, Tools, Methods, Data), and a short description. The page is just the top part and includes several menus at the top." />
Nine days ago, the <a href="https://vhp4safety.nl/">VHP4Safety</a> project
(see <a href="https://chem-bla-ics.linkedchemistry.info/tag/vhp4safety">these posts</a>)
held a launch event in Utrecht for the
<a href="https://platform.vhp4safety.nl/">Virtual Human Platform</a> (VHP), a key result of the
<a href="https://www.nwo.nl/en/researchprogrammes/dutch-research-agenda-nwa">Dutch Research Agenda</a> (NWA,
from the Dutch <em>Nationale Wetenschapsagenda</em>). Despite the name, the NWA is just one part
of the NWO funding mechanisms, but like the <a href="https://www.openscience.nl/en/about-us">NWO Open Science programme</a>
it is funding with a specific purpose. And the purpose of the NWA is to answer
research and societal questions that the Dutch people together defined and a public
consultation (many years ago). VHP4Safety is answering to one of those questions.</p>

<h2 id="co-creation">Co-creation</h2>

<p>The project is still running another few months, but the <a href="https://www.sciencrew.com/c/9347/a/335652577?title=Launch_of_the_Virtual_Human_Platform">launch last week</a>
gives us the opportunity to include feedback from the stakeholders from the Dutch
society, many of which have been involved in the project via designathons and
hackathons (see doi:<a href="https://doi.org/10.14573/altex.2407211">10.14573/altex.2407211</a>).</p>

<p>The VHP4Safety platform is a co-creation created by most of the people working
on the VHP4Safety grant. Some people focused on innovation and education (RL3),
others on the regulatory questions (RL2), and some on the development of the
platform (RL1). The research line 1 (RL1) included a work package on the
technological development, work package 1.1, and that was led by Maastricht
University (Ozan and me) and the Applied University of Utrecht (Dr. Marc Teunis).
This project would not be together without the leadership by
Prof. dr. ir. Juliette Legler, Dr. Cyrille Krul, and Prof. dr. Anne Kienhuis
 (see <a href="https://video.edu.nl/w/rvUKc7J4E4HEt2TEbJEQt9">this video</a>).</p>

<h2 id="not-just-technology">Not just technology</h2>

<p>I have to give a huge shout out to Ozan whom had the daunting task
to set up something like OpenRiskNet (doi:<a href="https://doi.org/10.1016/j.toxlet.2018.06.617">10.1016/j.toxlet.2018.06.617</a>),
a project with at least twice as much
funding for operating and documenting just the technical platform, but also
help other partners getting their work on the platform. Also shout outs to 
Luc who in our group first explored how to translate the OpenRiskNet platform
to VHP4Safety with <a href="https://en.wikipedia.org/wiki/Kubernetes">kubernetes</a> from
which we concluded that that was not an option for us. And to Sean in our group
who introduced us to <a href="https://www.geeksforgeeks.org/devops/introduction-to-docker-swarm-mode/">Docker Swarm</a>.</p>

<p>But that is just one aspect of the technological layers. The design outlined
in the original proposal is based on earlier projects, including OpenRiskNet,
eNanoMapper, OpenRiskNet, Open PHACTS, NanoCommons, SbD4Nano and many others.
It is based on open standards developed and/or adopted by
<a href="https://elixir-europe.org/">ELIXIR Europe</a> projects and many other organisations.</p>

<p>And then we have not even covered the content on the platform.</p>

<h2 id="a-virtual-human">A virtual human</h2>

<p>Building full virtual human is an ambition. Many <a href="https://en.wikipedia.org/wiki/Digital_twin">digital twins</a>
capture just one part of human biology. For safety assessment we need many models,
data from experiments, and knowledge bases. And we need a clear narrative that
describes how those isolated solutions are integrated so that regulatory questions
can be answered. That co-created combination is the launched <em>virtual human platform</em>.</p>

<p>Underlying the VHP is a good bit of open science, though it also integrated 
proprietary solutions, currently needed to be able to replace animal testing.
And our modular co-creation resulted in <a href="https://github.com/VHP4Safety">many separate git repositories</a>.
This allows distributed development models and all contributors to take ownership
of the development of their contributions. Marc and Frank introduced a agile computing
approach that we adopted to guide the development of the full platform.</p>

<p>There is so much to write up about the platform (and we will), but for now I want
to highlight a few essential git repositories underlying the 1.0 version of the
platform we launched last week (along with the number of contributors in the past two years):</p>

<ul>
  <li><a href="https://github.com/VHP4Safety/virtual-human-platform">virtual-human-platform</a> (<a href="https://github.com/VHP4Safety/virtual-human-platform/graphs/contributors?from=6%2F1%2F2024">10 contributors</a>): software that provide the platform UX</li>
  <li><a href="https://github.com/VHP4Safety/cloud">cloud</a> (<a href="https://github.com/VHP4Safety/cloud/graphs/contributors?from=6%2F1%2F2024">11 contributors</a>): collects the meta data about (computational) services</li>
  <li><a href="https://github.com/VHP4Safety/ui-casestudy-config">ui-casestudy-config</a> (<a href="https://github.com/VHP4Safety/ui-casestudy-config/graphs/contributors?from=5%2F31%2F2025">7 contributors</a>): collects the details of the narratives of the case studies</li>
</ul>

<p>This includes <a href="https://github.com/aniekdewinter">Aniek</a>, <a href="https://github.com/FW94">Fabian</a>,
<a href="https://github.com/iaortega">Isaac</a>, <a href="https://github.com/senseibelbi">Ivo</a>,
<a href="https://github.com/jmillanacosta">Javier</a>, <a href="https://github.com/johannehouweling">Jente</a>,
<a href="https://github.com/LindeSchoenmaker">Linde</a>, <a href="https://github.com/marvinm2">Marvin</a>,
<a href="https://github.com/mirthhe">Myrthe</a>, <a href="https://github.com/saadlodhi0916">Saad</a>,
<a href="https://github.com/ShakiraPortfolio">Shakira</a>, and <a href="https://github.com/youphendriks">Youp</a>,
in addition to the earlier named <a href="https://github.com/ozancinar">Ozan</a> and
<a href="https://github.com/Maddocent">Marc</a>.</p>

<p>This excludes the many contributions via the designathons and hackathons that are
behind many of the commits to these repositories. And this also excludes the
many <a href="https://github.com/VHP4Safety/">other source code repositories</a> for tools
and services developed and made available on this VHP, with even more researchers.</p>

<p>Very much aware of all the work that is still ahead of us, I am happy with the
important milestone of this release. Thank you to
<a href="https://github.com/orgs/VHP4Safety/people">everyone who contributed to this co-creation</a>,
all the <a href="https://www.sciencrew.com/c/9319/a/329042452?title=VHP4Safety_Partners">involved institutes</a>,
NWO for <a href="https://www.nwo.nl/en/projects/nwa129219272">the funding</a>, and the Dutch
public for the important NWA question.</p>

<p>The project delivered!</p>

      <h4>References</h4>
      <ul>
      
      
      
      
        <li><a href="https://doi.org/10.14573/ALTEX.2407211">10.14573/ALTEX.2407211</a></li>
      
        <li><a href="https://doi.org/10.1016/j.toxlet.2018.06.617">10.1016/j.toxlet.2018.06.617</a></li>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="vhp4safety"/><category term="bioschemas"/><category term="openscience"/><category term="elixir"/><category term="doi:10.14573/ALTEX.2407211"/><category term="justdoi:10.1016/j.toxlet.2018.06.617"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/06/06/the-launch-of-the-virtual-human-platform.html">
      <![CDATA[ Nine days ago, the VHP4Safety project (see these posts) held a launch event in Utrecht for the Virtual Human Platform (VHP), a key result of the Dutch Research Agenda (NWA, from the Dutch Nationale Wetenschapsagenda). Despite the name, the NWA is just one part of the NWO funding mechanisms, but like the NWO Open Science programme it is funding with a specific purpose. And the purpose of the NWA is to answer research and societal questions that the Dutch people together defined and a public consultation (many years ago). VHP4Safety is answering to one of those questions. ]]>
    </summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/vhp_platform.png"/>
    <media:content xmlns:media="http://search.yahoo.com/mrss/" medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/vhp_platform.png"/></entry>
  
  <entry>
    <title type="html">New paper: pyBiodatafuse: Extending interoperability of data using modular queries across biomedical resources</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/05/30/new-paper-pybiodatafuse.html" rel="alternate" type="text/html" title="New paper: pyBiodatafuse: Extending interoperability of data using modular queries across biomedical resources"/>
    <published>2026-05-30T00:00:00+00:00</published>
    <updated>2026-05-30T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/7n2bs-zsm80</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/05/30/new-paper-pybiodatafuse.html">
      <![CDATA[ <p>The number of data and knowledge source relevant to your biological or chemical question
increases every year. They all come with different API and different data models. These
need to be documented and mapped. What better way to do that than actually do that and
then use that. I never asked, but I can imagine that was the original idea of Tooba
and Yojana. At the very least, it demonstrates the level of interoperability we need
in the life sciences.</p>

<p>In a recent paper, <a href="https://orcid.org/0000-0002-7683-0452">Yojana Gadiya</a>,
<a href="https://orcid.org/0000-0002-4166-7093">Javier Millán Acosta</a>, and
<a href="https://orcid.org/0000-0002-4904-3269">Tooba Abbassi-Daloii</a> led a project called
BioDataFuse (worked on at the biohackathons of ELIXIR in <a href="https://doi.org/10.37044/osf.io/mhsqp">2023</a>
and <a href="https://doi.org/10.37044/osf.io/ptmg5_v1">2024</a>
and of SWAT4HCLS in <a href="https://ceur-ws.org/Vol-3890/paper-23.pdf">2024</a>
and <a href="https://ceur-ws.org/Vol-4196/paper_71.pdf">2025</a>) and the matching Python package,
<a href="https://github.com/BioDataFuse/pyBiodatafuse">pyBiodatafuse</a>
(doi:<a href="https://doi.org/10.1093/bioinformatics/btag064">10.1093/bioinformatics/btag064</a>).</p>

<p>With a group of researchers from The Netherlands, Switzerland, Czech Republic, and
the USA, multiple databases are wrapped in a uniform data model. The package
allows the generation of a graph across the imported databases which can then
be further analyzed and visualized. This is an example (RDF) graph that was generated:</p>

<p><img src="/assets/images/pyBiodatafuseGraph.png" alt="" /></p>

<p>Seeing this kind of interoperability brings back <a href="https://chem-bla-ics.linkedchemistry.info/2010/03/04/rdf-jena-bioclipse-eclipse-zest-2-icons.html">good memories</a>.</p>

<p>Congrats to all authors!</p>

      <h4>References</h4>
      <ul>
      
      
        <li><a href="https://doi.org/10.1093/BIOINFORMATICS/BTAG064">10.1093/BIOINFORMATICS/BTAG064</a></li>
      
        <li><a href="https://doi.org/10.37044/OSF.IO/MHSQP">10.37044/OSF.IO/MHSQP</a></li>
      
        <li><a href="https://doi.org/10.37044/OSF.IO/PTMG5_V1">10.37044/OSF.IO/PTMG5_V1</a></li>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="python"/><category term="data"/><category term="doi:10.1093/BIOINFORMATICS/BTAG064"/><category term="doi:10.37044/OSF.IO/MHSQP"/><category term="justdoi:10.37044/OSF.IO/PTMG5_V1"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/05/30/new-paper-pybiodatafuse.html">
      <![CDATA[ The number of data and knowledge source relevant to your biological or chemical question increases every year. They all come with different API and different data models. These need to be documented and mapped. What better way to do that than actually do that and then use that. I never asked, but I can imagine that was the original idea of Tooba and Yojana. At the very least, it demonstrates the level of interoperability we need in the life sciences. ]]>
    </summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/pyBiodatafuseGraph.png"/>
    <media:content xmlns:media="http://search.yahoo.com/mrss/" medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/pyBiodatafuseGraph.png"/></entry>
  
  <entry>
    <title type="html">FAIR Implementation Profiles: Chemistry</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/05/05/fair-implementation-profiles.html" rel="alternate" type="text/html" title="FAIR Implementation Profiles: Chemistry"/>
    <published>2026-05-05T00:00:00+00:00</published>
    <updated>2026-05-05T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/x115q-m5f95</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/05/05/fair-implementation-profiles.html">
      <![CDATA[ <p>I have had this on my todo list for way too long: writing about <a href="https://www.go-fair.org/how-to-go-fair/fair-implementation-profile/">FAIR Implementation Profiles</a>,
or FIPs for short (see also doi:<a href="https://doi.org/10.1007/978-3-030-65847-2_13">10.1007/978-3-030-65847-2_13</a>):</p>

<blockquote>
  <p>The FIP is a collection of FAIR implementation choices made by a community of practice for each of the FAIR Principles.</p>
</blockquote>

<p>In the early GO FAIR days, people referred to the <em>challenges and choices</em>. FIPs are a formal approach to report the choices.
The days of the <em>implementation networks</em>, like the Chemistry Implementation Network (doi:<a href="https://doi.org/10.1162/dint_a_00035">10.1162/dint_a_00035</a>),
or the AdvancedNano network (doi:<a href="https://doi.org/10.1016/j.impact.2024.100513">10.1016/j.impact.2024.100513</a>). For the first,
we absolutely agreed on the InChI, for example, but we never wrote that down as a FIP. But even without FIPs, the
idea was that communities could learn from each other, could converge. This is where the term
<a href="https://www.go-fair.org/today/fair-matrix/">FAIR Convergence Matrix</a> comes from. (I had forgotten I was actually
part of the matching Working Group. If only I had dedicated funding for it at the time.)</p>

<p>Now, I had on my wishlist to write up a list over FIPs around chemistry. This is relevant for the
<a href="https://video.edu.nl/w/pjf5vFFU287AGfYA2SXpE5">FAIR4ChemNL</a> project, but obviously beyond that. And while we did
a lot of FAIRification work in, particularly, the nanosafety cluster projects (eNanoMapper, NanoCommons, NanoSolveIT,
RiskGONE, and SbD4Nano), a lot of this never was formalized as FIPs. Second, various relevant standards have been
proposed in chemistry, for <a href="https://doi.org/10.1186/s13321-021-00520-4">chemical compounds</a> and
<a href="https://doi.org/10.1021/acsenvironau.5c00314">transformation products</a>, among plenty of other things.
But during the first <a href="https://elixir-europe.org/communities/toxicology">ELIXIR Toxicology Community</a>
workshop in Utrecht, we had FIPs on the agenda too (see <a href="https://doi.org/10.37044/osf.io/un2rw">this report</a>).</p>

<h2 id="fips-in-chemistry">FIPs in Chemistry</h2>

<p>But while I had already several browser tabs open for months, I never could find the courage to start making the list. Silly.
Therefore, this list should be considered a start. I don’t think it is exhaustive. I know there is an index somewhere,
but I cannot find back that browser tab. Additions are most welcome: I will update this post (thanks to git and Rogue Scholar).
In brackets I list (a selection of) the FAIR enabling resources mentioned in that FIP.</p>

<ul>
  <li><a href="https://fip-wizard.ds-wizard.org/wizard/projects/2f1c0e80-3c9d-4967-9dc6-fd05fac96269/metrics">Adverse Outcome Pathways FIP</a> (DOI, AOP Wiki ID)</li>
  <li><a href="https://docs.google.com/spreadsheets/d/1yNEYzJRbx10RkuqJmLcO7RPOq-nrimUKQLQW6uWzfho/edit?gid=127295437#gid=127295437">Toxicogenomics</a> (ISA-Tab, BioStudies, ArrayExpress)</li>
  <li><a href="https://fip-wizard.ds-wizard.org/wizard/projects/b0d84171-4556-46ed-9ace-c1c58db38092">WorldFAIR WP04 NANOMATERIALS FIP01</a> (DOI, ORCID, UUID, ROR, InChI, DataCite, QMRF, RDF/JSON-LD, OWL; doi<a href="https://doi.org/10.5281/zenodo.7378109">10.5281/zenodo.7378109</a>)</li>
</ul>

<p>Is that all? No, I do not think so, but this is all I can easily find (ironically). Well, I guess you now see why I have had a so much trouble starting to write down this post…
Someone will surely give me some pointers…</p>

<h2 id="ackknowledgments">Ackknowledgments</h2>

<p>I also like to thank all the people involved in these FIPs. I strongly encourage you to look up all the people who
wrote them down. I also like to thank the INTOXICOM workshop FIP experts Iseult Lynch and Gerhard Burger for
sharing their knowledge.</p>

<p>Finally, I recommend checking out the <a href="https://fip.fair-wizard.com/">FIP Wizard</a>.</p>

      <h4>References</h4>
      <ul>
      
        <li><a href="https://doi.org/10.1162/dint_a_00035">10.1162/dint_a_00035</a></li>
      
        <li><a href="https://doi.org/10.1016/J.IMPACT.2024.100513">10.1016/J.IMPACT.2024.100513</a></li>
      
        
      
        
      
        <li><a href="https://doi.org/10.37044/OSF.IO/UN2RW">10.37044/OSF.IO/UN2RW</a></li>
      
        
      
        <li><a href="https://doi.org/10.5281/zenodo.7378109">10.5281/zenodo.7378109</a></li>
      
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="fair"/><category term="doi:10.1162/dint_a_00035"/><category term="doi:10.1016/J.IMPACT.2024.100513"/><category term="cito:citesAsPotentialSolution:10.1186/s13321-021-00520-4"/><category term="cito:citesAsPotentialSolution:10.1021/acsenvironau.5c00314"/><category term="doi:10.37044/OSF.IO/UN2RW"/><category term="cito:citesForInformation:10.1007/978-3-030-65847-2_13"/><category term="justdoi:10.5281/zenodo.7378109"/><category term="fair4chemnl"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/05/05/fair-implementation-profiles.html">
      <![CDATA[ I have had this on my todo list for way too long: writing about FAIR Implementation Profiles, or FIPs for short (see also doi:10.1007/978-3-030-65847-2_13): ]]>
    </summary></entry>
  
  <entry>
    <title type="html">One Million IUPAC names #5: a new approach and 400k names</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/05/02/one-million-iupac-names-5-a-new-approach.html" rel="alternate" type="text/html" title="One Million IUPAC names #5: a new approach and 400k names"/>
    <published>2026-05-02T00:00:00+00:00</published>
    <updated>2026-05-02T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/gqtbx-jta57</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/05/02/one-million-iupac-names-5-a-new-approach.html">
      <![CDATA[ <p>About fifteen months ago a new project started: <a href="https://chem-bla-ics.linkedchemistry.info/2025/03/08/iupac-names.html">One Million IUPAC names</a>:</p>

<blockquote>
  <p>Thus, the idea came up, can we create a set of 1 million unique IUPAC names found in literature?</p>
</blockquote>

<p>We started out with using <a href="https://europepmc.org/">Europe PMC</a> to get JATS XML files for the full texts of open access articles.
Parsing the XML is easy and the text paragraphs are passed through OSCAR and OPSIN. That has not changed.</p>

<p>What did change last weekend is something I had long on my todo list (but life interfered). The first approach
was to ask for named entities using the Europe PMC APIs. But I quickly realized that with OSCAR and OPSIN we could
get more names out of the articles. The next step was to move from Google Colab to a command line script.
That gave another boost, as explained in <a href="http://localhost:4000/2025/04/27/one-million-iupac-names-2-the-100-thousand-milestone.html">this second post in the series</a>.
We reached 200 thousand names in <a href="https://chem-bla-ics.linkedchemistry.info/2025/06/09/one-million-iupac-names.html">june 2025</a>
but then things slowed down again in the growth. <a href="https://chem-bla-ics.linkedchemistry.info/2025/08/09/one-million-iupac-names-4.html">Two months later</a>
we only had 75 thousand more. However, plenty of discussion was happening and there turned out to be
other, larger collections of IUPAC names under an open license. Millions of names, actually.</p>

<p>But another problem emerged. We were still using the Europe PMC API and were basically asking for open access
articles between two dates. Practically, the API could answer requests between 1 and max 3 days. Beyond that,
times outs and 404s became an issue. Moreover, because these dates are publications dates and not the dates
on which the JATS were deposited, I had to got back to previous months and redo the queries. That gave another
5 thousand names since last August. Something had to change.</p>

<h2 id="the-new-approach">The new Approach</h2>

<p>Europe PMC, however, also provides the JATS XML files as download on <a href="https://europepmc.org/ftp/oa/">their FTP site</a>.
Already that <a href="https://chem-bla-ics.linkedchemistry.info/2025/08/09/one-million-iupac-names-4.html">august 2025</a> I had
a prototype and knew it would change the game. These gzipped XML files are about 150 to 250 MB. Unzipped, about 1 GB each.
Better, these files are based on Europe PMC identifiers, hopefully resolving the issue with using dates in the queries.</p>

<p>Now, parsing a 1 GB XML files is a total non-issue. I have done it plenty of times before. Just use a
<a href="https://en.wikipedia.org/wiki/Simple_API_for_XML">Simple API for XML</a> (SAX) parser. This is a streaming parser
giving you full control of how to parse things. It is ideal for this siutation: you just keep the current
paragraph of text in memory and release that when done with that paragraph. That is, you do not have to read
the full file in memory, just the bits you are interested in. I used this for my Chemical Markup Language
patches for Jmol and JChemPaint back in the nineties.</p>

<p>Last weekend I finally made the jump. Use SAX to extract the <code class="language-plaintext highlighter-rouge">&lt;p&gt;</code> elements one by one, running OSCAR on
them, filter with OPSIN, output that name, and clear the memory. Effectively, each gzipped file processes
with a Groovy script in about 1 to 2 hours.</p>

<p>The output is a mesmerizing stream of scientific literature (which I will use until someone points me to a Java
CLI library that creates a Matrix-style falling letters equivalent), tho less so as a static image:</p>

<p><img src="/assets/images/jats_analysis.png" alt="" /></p>

<p>In this plot, an <code class="language-plaintext highlighter-rouge">x</code> means a new article to be processed. Each <code class="language-plaintext highlighter-rouge">.</code> and <code class="language-plaintext highlighter-rouge">o</code> that follows is a single <code class="language-plaintext highlighter-rouge">&lt;p&gt;</code>
element and the difference is that an <code class="language-plaintext highlighter-rouge">o</code> means at least one IUPAC name was detected in the paragraph.</p>

<p>Each gzipped file gives 400 to 500 new IUPAC names. Indeed, going from 288 thousand to 300 thousand
was a matter of a day and a half. And earlier this afternoon we passed the 400 thousand IUPAC names.
With about 230 gzipped files. Now, I am going back in time, and the sizes of these files are shrinking:
Another 500 files and the size has dropped to around 125 MB, so a rough estimate suggests that
we will end up with 650 to 700 thousand names this way. This will be completed in a few weeks (and mostly
because I need to focus first on other things again, because I can use our computing cluster do this).</p>

<p>Regarding the original goal, fortunately, we are still publishing at a higher rate every year, and
more and more articles are available as open access. So, I still have good hopes we will reach the
<em>1 million IUPAC names</em>. Also, keep in mind, we know how to boost this by simple name variations to
several millions, even with the <a href="https://codeberg.org/BlueObelisk/iupac-names/commit/30ddfd96c3ec6e6a5840be0ada1bdbd40972490e">400 thousand</a>
we have today.</p>

<p>Oh, and <a href="https://github.com/BlueObelisk/iupac-names/issues/4">our next milestone</a> will be in the pocket
before I visit <a href="https://cheminf.uni-jena.de/">Christoph Steinbeck’s cheminformatics team</a> in Jena!</p>

      <h4>References</h4>
      <ul>
      
      
      
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="iupac"/><category term="textmining"/><category term="xml"/><category term="europepmc"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/05/02/one-million-iupac-names-5-a-new-approach.html">
      <![CDATA[ About fifteen months ago a new project started: One Million IUPAC names: ]]>
    </summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/jats_analysis.png"/>
    <media:content xmlns:media="http://search.yahoo.com/mrss/" medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/jats_analysis.png"/></entry>
  
  <entry>
    <title type="html">Open Science Festival Limburg</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/04/27/open-science-festival-limburg.html" rel="alternate" type="text/html" title="Open Science Festival Limburg"/>
    <published>2026-04-27T00:00:00+00:00</published>
    <updated>2026-04-27T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/0qjrj-59n30</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/04/27/open-science-festival-limburg.html">
      <![CDATA[ <p>One of the things I have been busy with in the past weeks (besides contributing to grant proposals) is the organization
of the June 11-12 <a href="https://www.openscience-maastricht.nl/events/open-science-festival-2026/">Open Science Festival Limburg</a>!
Open Science Festivals provide a great platform to talk science in an open way, with people that all find reuse of
science more important. The creativity present at such events is just so energizing.</p>

<p>This is the second time a local Open Science Festival is organized in the area, with an
<a href="https://www.openscience-maastricht.nl/events/open-science-festival-2023/">Open Science Festival Maastricht</a> in 2023.
Of course, a year later Maastricht also hosted the <a href="https://opensciencefestival.nl/open-science-festival-2024">2024 national Open Science Festival</a>.</p>

<p>This year’s event is co-organized by the (new) Open Science Community Parkstad and the
<a href="https://opensciencefestival.nl/open-science-festival-2024">Open Science Community Maastricht</a>,
with interdisciplinarity as main theme. The event will take day over two days, <a href="https://www.openscience-maastricht.nl/11-june-2026/">June 11 in Heerlen</a>,
and <a href="https://www.openscience-maastricht.nl/12-june-2026/">June 12 in Maastricht</a>. You can <a href="https://www.openscience-maastricht.nl/events/open-science-festival-2026/">register</a>
for either or for both days. Each days have plenary sessions and several parallel workshops,
where you will dive into one of the many aspects of open science.</p>

<p>Looking forward to welcoming you in Limburg!</p>

      <h4>References</h4>
      <ul>
      
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="openscience"/><category term="osflimburg"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/04/27/open-science-festival-limburg.html">
      <![CDATA[ One of the things I have been busy with in the past weeks (besides contributing to grant proposals) is the organization of the June 11-12 Open Science Festival Limburg! Open Science Festivals provide a great platform to talk science in an open way, with people that all find reuse of science more important. The creativity present at such events is just so energizing. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">CiTO annotations with Zotero 8 and Google Docs</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/04/15/cito-annotations-with-zotero-8-and-google-docs.html" rel="alternate" type="text/html" title="CiTO annotations with Zotero 8 and Google Docs"/>
    <published>2026-04-15T00:00:00+00:00</published>
    <updated>2026-04-15T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/8zeka-ahq74</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/04/15/cito-annotations-with-zotero-8-and-google-docs.html">
      <![CDATA[ <p>This is a bit of work I did already in March, but with the <a href="https://www.zotero.org/blog/zotero-9/">Zotero 9 release</a> I was reminded
that I wanted to blog this. Ideally, it will trigger some further discussion and maybe a future Zotero/Google Docs version supports
bibliography-level annotations too.</p>

<p>Still, <a href="https://www.zotero.org/blog/zotero-8/">Zotero</a> 8 brought in <em>prefix</em> and <em>suffix</em> support, and I was wondering if this could
be used for CiTO citation intent annotations. And it can. This was the result, and make of it what you want:</p>

<p><img src="/assets/images/zotero_notes.png" alt="" /></p>

      <h4>References</h4>
      <ul>
      
      
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="cito"/><category term="zotero"/><category term="google"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/04/15/cito-annotations-with-zotero-8-and-google-docs.html">
      <![CDATA[ This is a bit of work I did already in March, but with the Zotero 9 release I was reminded that I wanted to blog this. Ideally, it will trigger some further discussion and maybe a future Zotero/Google Docs version supports bibliography-level annotations too. ]]>
    </summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/zotero_notes.png"/>
    <media:content xmlns:media="http://search.yahoo.com/mrss/" medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/zotero_notes.png"/></entry>
  
  <entry>
    <title type="html">SWAT4HCLS 2026</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/04/04/swat4hcls-2026.html" rel="alternate" type="text/html" title="SWAT4HCLS 2026"/>
    <published>2026-04-04T16:54:00+00:00</published>
    <updated>2026-04-04T16:54:00+00:00</updated>
    <id>https://doi.org/10.59350/bmxve-vry14</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/04/04/swat4hcls-2026.html">
      <![CDATA[ <p>A bit over a week ago, <a href="https://www.swat4ls.org/workshops/amsterdam2026/">SWAT4HCLS 2026</a> took place, with the matching
<a href="https://www.swat4ls.org/workshops/amsterdam2026/swat4hcls-biohackathon-2026/">biohackathon</a> on Thursday (see
<a href="https://chem-bla-ics.linkedchemistry.info/2026/03/22/swat4hcls-2026-amsterdam-this-week.html">this post</a>.
I attempted a bit of live coverage on mastodon: <a href="https://social.edu.nl/@egonw/116285060969709401">day 1</a> and
<a href="https://social.edu.nl/@egonw/116289579219485790">day 2</a>. But it seems the semantic web community interested
in SWAT4HCLS has not found the fediverse yet. So, make sure to check
<a href="https://www.swat4ls.org/workshops/amsterdam2026/programme/accepted-submissions/">this full list of abstracts</a>.</p>

<p>The meeting consisted of <a href="https://www.swat4ls.org/workshops/amsterdam2026/programme/keynotes/">four keynotes</a>, each
one was quite interesting. Cornet gave a nice historic perspective of the venue and of the semantic web field,
which is a great way to welcome the participants to your institute. The talk also touches on the main theme
of the meeting: clinical data. It is a long standing (and important) research field, but progress is slow.
Cornet <a href="https://social.edu.nl/@egonw/116283216644714695">comments</a> along the lines that <em>we have been talking
about reasoning over patient data for more than twenty years, but we still have not solve it</em>.</p>

<p>The problem is really not only privacy, but simple also lack of a common language. As
<a href="https://qlever.scholia.wiki/orcid/0000-0003-3248-7899">Sabine Österle</a> explains
about sharing health/patient data in Switzerland, across 26 kantons and legislations and 4 national languages.
Another issue is more technical, running SPARQL across hospitals involves more than just aligning ontologies,
but also requires (too much) fiddling with SPARQL queries.</p>

<p>There was plenty of other content too, however. For example, I was pleasantly
<a href="https://social.edu.nl/@egonw/116284409447761902">surprised</a> by the
<a href="https://www.swat4ls.org/workshops/amsterdam2026/programme/accepted-submissions/#RDF4RiskAssessment_Toolkit_A_Toolkit_for_Converting_Tabular_Research_Data_to_FAIR_RDF_for_Risk_Assessment_and_Life_Sciences">RDF4RiskAssessment</a>
work, the <a href="https://www.swat4ls.org/workshops/amsterdam2026/programme/accepted-submissions/#RO-Crates_for_BioImaging">RO-Crates for BioImaging</a>,
and <a href="https://www.swat4ls.org/workshops/amsterdam2026/programme/accepted-submissions/#FDPcrawleR_A_Lightweight_R_Framework_for_Auditing_FAIR_Data_Points_and_FAIR_Virtual_Platforms">FDPcrawleR</a>.
All these projects have direct links to research ongoing in <a href="https://www.maastrichtuniversity.nl/research/translational-genomics">our TGX team</a>.</p>

<p><a href="https://qlever.scholia.wiki/orcid/0000-0003-1213-6776">Hanna Bast</a> gave the second keynote of the first day, about <a href="https://qlever.dev/">QLever</a>
(doi:<a href="https://doi.org/10.1145/3132847.3132921">10.1145/3132847.3132921</a>). She talked about some of the recent improvements,
something we really <a href="https://chem-bla-ics.linkedchemistry.info/2026/02/28/rescuing-scholia-3-we-did-it.html">needed for Scholia</a>.
She showed a technical approach to make federated queries faster, tho it currently only works between endpoints
that both run QLever. One thing I am looking forward to, is playing with the notion of
<a href="https://docs.qlever.dev/materialized-views/?h=materialize">materialized views</a>, but the biohackathon
was too short to get around to that during the Thursday.</p>

<p>The second day kicked off with a keynote by <a href="https://qlever.scholia.wiki/orcid/0000-0002-3469-4923">Janna Hastings</a>,
whose work I greatly admire. I was not disappointed today, and she showed the
<a href="https://www.bciontology.org/">Behaviour Change Intervention Ontology</a> and <a href="https://chebifier.hastingslab.org/">Chebifier</a>
(doi:<a href="https://doi.org/10.1039/D3DD00238A">10.1039/D3DD00238A</a>).</p>

<p>The last talk I want to mention in the blog is by two researcher working with Michel Dumontier. They
<a href="https://www.swat4ls.org/workshops/amsterdam2026/programme/accepted-submissions/#Embedding-based_Deduplication_of_Knowledge_Graphs_using_Graph_Neural_Networks">presented</a>
a study about deduplication in/of knowledge graphs. This is something I want to read in more detail.</p>

      <h4>References</h4>
      <ul>
      
      
        <li><a href="https://doi.org/10.1039/D3DD00238A">10.1039/D3DD00238A</a></li>
      
        <li><a href="https://doi.org/10.1145/3132847.3132921">10.1145/3132847.3132921</a></li>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="swat4ls"/><category term="mastodon"/><category term="justdoi:10.1039/D3DD00238A"/><category term="justdoi:10.1145/3132847.3132921"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/04/04/swat4hcls-2026.html">
      <![CDATA[ A bit over a week ago, SWAT4HCLS 2026 took place, with the matching biohackathon on Thursday (see this post. I attempted a bit of live coverage on mastodon: day 1 and day 2. But it seems the semantic web community interested in SWAT4HCLS has not found the fediverse yet. So, make sure to check this full list of abstracts. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">Using compact identifiers in project reports</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/03/29/using-compact-identifiers-in-project-reports.html" rel="alternate" type="text/html" title="Using compact identifiers in project reports"/>
    <published>2026-03-29T00:00:00+00:00</published>
    <updated>2026-03-29T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/re9j2-hk972</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/03/29/using-compact-identifiers-in-project-reports.html">
      <![CDATA[ <p>This document describes how you can improve the FAIR-ness of your project report by using
compact identifiers. Of course, it can be applied to any other document too, and has been used
in, for example, journal articles and online documentation already.</p>

<p>Compact identifiers find a balance between compactness in writing and being a persistent, unique,
and global identifier. It “is a string constructed by concatenating a namespace prefix, a separating colon,
and a locally unique identifier (LUI)” (doi:<a href="https://doi.org/10.1038/sdata.2018.29">10.1038/sdata.2018.29</a>).
For example, for proteins it can represent the PDB structure <a href="https://bioregistry.io/pdb:2gc4">2gc4</a> as
<em>pdb:2gc4</em>. There is a clear similarity with the SciCrunch <a href="https://rrid.site/">Research Resource Identifiers</a>
(RRIDs) as used by several journals, like
<a href="https://elifesciences.org/inside-elife/ff683ecc/rrids-how-did-we-get-here-and-where-are-we-going">eLife</a>
(doi:<a href="https://doi.org/10.1007/s12021-015-9284-3">10.1007/s12021-015-9284-3</a>).</p>

<p>When the prefixes are defined by community standards, then a compact identifier can be resolved.
There currently are multiple providers of prefix files (doi:<a href="https://doi.org/10.1038/sdata.2018.29">10.1038/sdata.2018.29</a>),
including Identifiers.org (doi:<a href="https://doi.org/10.1093/bioinformatics/btaa864">10.1093/bioinformatics/btaa864</a>)
and Bioregistry (doi:<a href="https://doi.org/10.1038/s41597-022-01807-3">10.1038/s41597-022-01807-3</a>).
The Bioregistry has an overview of more than twenty registries of prefixes and their metadata
(doi:<a href="https://doi.org/10.1038/s41597-022-01807-3">10.1038/s41597-022-01807-3</a>). The metadata commonly
includes information on the URL pattern for each identifier. Often this is more than one pattern, as
there may more several databases with information for the same identifier.</p>

<p>It is the URL pattern in the database that allows services to <em>resolve</em> the compact identifier
into a link to a database. The above registries correspond to three existing <em>resolvers</em> that will take a compact
identifier as part of a resolver URL and redirect to the database with the record matching
that identifier:</p>

<ul>
  <li>Name-to-Thing (N2T): <a href="https://n2t.net/">https://n2t.net/</a></li>
  <li>Identifiers.org: <a href="https://identifiers.org/">https://identifiers.org/</a></li>
  <li>The Bioregistry: <a href="https://bioregistry.io/">https://bioregistry.io/</a></li>
</ul>

<p>Each of these URLs can be extended with a compact identifier. For example, a taxon record
from the NCBI databases or the PDB entry mentioned earlier:</p>

<ul>
  <li><a href="https://bioregistry.io/pdb:2gc4">https://bioregistry.io/pdb:2gc4</a></li>
  <li><a href="https://identifiers.org/col:6MB3T">https://identifiers.org/col:6MB3T</a> (<code class="language-plaintext highlighter-rouge">col</code> is the prefix for the Catalogue of Life)</li>
</ul>

<h2 id="why-use-in-reports">Why use in reports?</h2>

<p>Using persistent identifiers is generally accepted as a good practice that benefits science
and has been part of the ideas of FAIR data (doi:<a href="https://doi.org/10.1038/sdata.2016.18">10.1038/sdata.2016.18</a>)
and of Open Science. Compact
identifiers make it easy to be precise in reports about what things the reports talk about: they
are relatively short but very precise at the same time. also, that has the benefit that they
are much easier to reuse than labels of things and concepts that intrinsically have a certain
level of uncertainty; a database entry has commonly a very specific meaning.</p>

<h2 id="examples-uses">Examples uses</h2>

<p>The use of compact identifiers can be used in two ways. The simplest is to just put the
compact identifier as plain text in the document, possibly in parentheses
(with the compact identifier highlighted here in bold):</p>

<ul>
  <i>This report is only about the experimental data of the human (<b>NCBITaxon:9606</b>) cell lines.</i>
</ul>

<p>Or:</p>

<ul>
  <i>We found that BRCA1 (<b>ensembl:ENSG00000012048</b>) played an important role.</i>
</ul>

<p>Alternatively, you can add a hyperlink with one of the resolvers, for example, Identifiers.org:</p>

<ul>
  <i>We found that BRCA1 (<b><a href="https://identifiers.org/ensembl:ENSG00000012048">ensembl:ENSG00000012048</a></b>) played an important role.</i>
</ul>

<h3 id="compact-identifiers-for-material-identifiers">Compact identifiers for material identifiers</h3>

<p>The European Registry of Materials proposes to use the compact identifier for their
ERM identifiers (doi:<a href="https://doi.org/10.1186/s13321-022-00614-7">10.1186/s13321-022-00614-7</a>):</p>

<ul>
  <i>
    For example, the NanoSolveIT project registered a material with the ERM00000001 identifier.
    The full Uniform Resource Identifier (URI) for this compound is
    https://nanocommons.github.io/identifiers/registry#ERM00000001 which is too long to be used
    in documentation. The corresponding compact identifier <b>erm:ERM00000001</b> is easy to use in written
    material, analogous to the use of Protein Data Bank (PDB) identifiers for proteins in journals.
  </i>
</ul>

<h3 id="compact-identifiers-for-citation-intent-annotations">Compact identifiers for citation intent annotations</h3>

<p>The compact identifier has also been used as the method to include citation intentions in journal
articles (doi:<a href="https://doi.org/10.1186/s13321-020-00448-1">10.1186/s13321-020-00448-1</a>,
compact identifier here highlighted in bold):</p>

<ul>
  <i>
    We take advantage here of the ability to add notes to full form [..] references in bibliographies.
    These are referred to as bibnotes. The content of the note will be strictly formatted: it will use
    the syntax [<b>cito:usesMethodIn</b>] and formatted in bold. That is, the bibnote starts with the
    [ character, followed by one of the CiTO types, and ends with the ] character. If you wish to
    provide more than one annotation, you can repeat this syntax, separated by one or more spaces,
    for example: [<b>cito:usesMethodIn</b>] [<b>cito:citeAsAuthority</b>].
  </i>
</ul>

<p>Note that in this use, the square brackets and bold typeface are used to make them easier to
be recognized. Also, note that this document uses this approach to indicate the intention of
why the cited articles are cited.</p>

<h2 id="conclusion">Conclusion</h2>

<p>This document described what the compact identifier is, how it helps linking to online
databases, and how they can be used in written reports as plain text, optionally
hyperlinked with one of the compact identifier resolvers.</p>

<h3 id="acknowledgments">Acknowledgments</h3>

<p>I thank <a href="https://n2t.net/github:tabbassidaloii">github:tabbassidaloii</a>,
<a href="https://n2t.net/github:cthoyt">github:cthoyt</a>, and
<a href="https://n2t.net/github:larsgw">github:larsgw</a> for their comment on
<a href="https://github.com/egonw/compact-ids-in-reports">this GitHub repo</a>.</p>

      <h4>References</h4>
      <ul>
      
      
      
        
      
        
      
        
      
        
      
        
      
        
      
        
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="identifier"/><category term="semweb"/><category term="cito"/><category term="cito:usesMethodIn,includesQuotationFrom:10.1038/sdata.2018.29"/><category term="cito:obtainsBackgroundFrom:10.1007/s12021-015-9284-3"/><category term="cito:usesMethodIn:10.1093/bioinformatics/btaa864"/><category term="mycito:usesMethodIn:10.1038/S41597-022-01807-3"/><category term="cito:obtainsBackgroundFrom:10.1038/sdata.2016.18"/><category term="mycito:includesQuotationFrom:10.1186/S13321-022-00614-7"/><category term="mycito:includesQuotationFrom:10.1186/S13321-020-00448-1"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/03/29/using-compact-identifiers-in-project-reports.html">
      <![CDATA[ This document describes how you can improve the FAIR-ness of your project report by using compact identifiers. Of course, it can be applied to any other document too, and has been used in, for example, journal articles and online documentation already. ]]>
    </summary></entry>
  
  <entry>
    <title type="html">SWAT4HCLS 2026 Amsterdam this week</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/03/22/swat4hcls-2026-amsterdam-this-week.html" rel="alternate" type="text/html" title="SWAT4HCLS 2026 Amsterdam this week"/>
    <published>2026-03-22T00:00:00+00:00</published>
    <updated>2026-03-22T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/mztnx-y1770</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/03/22/swat4hcls-2026-amsterdam-this-week.html">
      <![CDATA[ <p>Tomorrow, <a href="https://www.swat4ls.org/workshops/amsterdam2026/">SWAT4HCLS 2026</a> will start, again in Amsterdam.
The first SWAT4LS I attended <a href="https://chem-bla-ics.linkedchemistry.info/2009/11/21/swat4ls-linking-open-drug-data-to.html">was also in Amsterdam</a>, and the second meeting in Amsterdam I was <a href="https://chem-bla-ics.linkedchemistry.info/2016/12/18/my-swat4ls-poster-about-enanomapper.html">also there</a>. And I was in <a href="https://www.swat4ls.org/workshops/cambridge2015/index.php">Cambridge</a> (see
<a href="https://chem-bla-ics.blogspot.com/2015/12/swat4ls-in-cambridge.html">this post</a>),
<a href="https://www.swat4ls.org/workshops/antwerp2018/">Antwerp</a>  (no post), and at least to one of the two
<a href="https://www.swat4ls.org/workshops/leiden2024/">Leiden</a> meetings (also no posts, it seems).</p>

<p>I am looking forward to meet old friends, new friends (some whom I never met in person), and
recent collaborators (that I never met in person).
For those who will not be in Amsterdam, you can follow the meeting on social media with
the <a href="https://hashtags-hub.toolforge.org/swat4hcls">hashtag #swat4hcls</a>. And there is also
<a href="https://fediwall.biohackrxiv.org/">this BioHackrXiv Fediwall</a>, for those in the
<a href="https://en.wikipedia.org/wiki/Fediverse">fediverse</a>.</p>

<h3 id="scholia-demo">Scholia demo</h3>

<p>I will give a demo to update people on the work in the <a href="https://github.com/wdscholia/scholia">Scholia</a> project with
Daniel Mietchen, Peter Patel-Schneider, Konrad Linden, Johannes Kalmbach,
Lars Willighagen, Wolfgang Fahl, and Hannah Bast (also keynote in Amsterdam)
to <a href="https://chem-bla-ics.linkedchemistry.info/2026/02/28/rescuing-scholia-3-we-did-it.html">update the SPARQL queries</a>
we use to visualize data in <a href="https://www.wikidata.org/">Wikidata</a> to SPARQL 1.1 so that it can run on
<a href="https://qlever.dev/">Qlever</a>.
The abstract can be <a href="https://commons.wikimedia.org/wiki/File:Scholia_2026_Compliance_with_SPARQL_1.1.pdf">found in Wikimedia Commons</a>.</p>

<p>This was the outcome of many years figuring how to ensure Scholia could remain working. The
<a href="https://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2026-02-17/Technology_report">Wikidata RDF graph split</a>
has given us many headaches, so many that just before christmas it became it could
be possible to survive the split, I was so happy, I realize I want to share this news. So, we teamed
up and wrote this demonstration contribution abstract. Thanks to everyone who made this happen!
Just to be clear, we are not done yet. The system is not running outside the Wikimedia Foundation
platforms.</p>

<p>One of the reviewer comments requested <a href="https://qlever.scholia.wiki/event/Q138033585">a Scholia page for the meeting</a>.
It has not been updated for the accepted speakers, but you can look at <a href="https://qlever.scholia.wiki/event-series/Q56846035">pages for past meetings</a>
to get an idea what you will find.</p>

<h3 id="swat4hcls-biohackathon-2026">SWAT4HCLS Biohackathon 2026</h3>

<p>There will also be <a href="https://www.swat4ls.org/workshops/amsterdam2026/swat4hcls-biohackathon-2026/">a biohackathon again</a>,
of course, with the <a href="https://index.biohackrxiv.org/tag/SWAT4HCLS26">option for BioHackRxiv reports</a>.
There are already <a href="https://www.swat4ls.org/workshops/amsterdam2026/swat4hcls-biohackathon-2026/">several pitches</a>,
including one that I submitted about Scholia.</p>

      <h4>References</h4>
      <ul>
      
      
      
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="rdf"/><category term="sparql"/><category term="swat4ls"/><category term="wikidata"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/03/22/swat4hcls-2026-amsterdam-this-week.html">
      <![CDATA[ Tomorrow, SWAT4HCLS 2026 will start, again in Amsterdam. The first SWAT4LS I attended was also in Amsterdam, and the second meeting in Amsterdam I was also there. And I was in Cambridge (see this post), Antwerp (no post), and at least to one of the two Leiden meetings (also no posts, it seems). ]]>
    </summary></entry>
  
  <entry>
    <title type="html">CDK 2.12</title>
    <link href="https://chem-bla-ics.linkedchemistry.info/2026/03/08/cdk-2.12.html" rel="alternate" type="text/html" title="CDK 2.12"/>
    <published>2026-03-08T00:00:00+00:00</published>
    <updated>2026-03-08T00:00:00+00:00</updated>
    <id>https://doi.org/10.59350/gw9at-srp84</id>
    <content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/03/08/cdk-2.12.html">
      <![CDATA[ <p><a href="https://github.com/cdk/cdk/releases/tag/cdk-2.12">Version 2.12</a> of the <a href="https://cdk.github.io/">Chemistry Development Kit</a> has been released.
It is the last release with contributions by <a href="https://www.nwo.nl/en/projects/osf232097">our NWO Open Science grant</a>.
This release adds some nice new APIs:</p>

<ul>
  <li>harmonize hydrogens to various states: depiction, stereo, minimal, and unsafe (useful for depictions)</li>
  <li>generate wedge bonds based on coordinates and stereochemistry</li>
  <li>more Markush / RGroup support</li>
  <li>atropisomers via CXSMILES</li>
  <li>sugar extraction</li>
</ul>

<p>I also update the following libraries/tools to use CDK 2.12:</p>

<ul>
  <li><a href="https://github.com/enanomapper/nanojava/releases/tag/nanojava-2.0.6">NanoJava 2.16</a></li>
  <li><a href="https://github.com/egonw/bacting/releases/tag/bacting-1.0.10">Bacting 1.0.10</a> (and the Python pyBacting will follow asap)</li>
</ul>

      <h4>References</h4>
      <ul>
      
      
        <li><a href="https://doi.org/10.5281/zenodo.18850648">10.5281/zenodo.18850648</a></li>
      </ul>
      ]]>
    </content>
    
    
      <author><name>Egon Willighagen</name><uri>https://orcid.org/0000-0001-7542-0286</uri></author>
    
    <category term="cdk"/><category term="openscience"/><category term="justdoi:10.5281/zenodo.18850648"/>
    
    <summary type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/03/08/cdk-2.12.html">
      <![CDATA[ Version 2.12 of the Chemistry Development Kit has been released. It is the last release with contributions by our NWO Open Science grant. This release adds some nice new APIs: ]]>
    </summary></entry>
  
</feed>
