<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator><link href="https://chem-bla-ics.linkedchemistry.info/feed/by_tag/cito.xml" rel="self" type="application/atom+xml" /><link href="https://chem-bla-ics.linkedchemistry.info/" rel="alternate" type="text/html" /><updated>2026-05-06T08:00:30+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/feed/by_tag/cito.xml</id><title type="html">chem-bla-ics</title><subtitle>Chemblaics (pronounced chem-bla-ics) is the science that uses open science and computers to solve problems in chemistry, biochemistry and related fields.</subtitle><author><name>Egon Willighagen</name></author><entry><title type="html">CiTO annotations with Zotero 8 and Google Docs</title><link href="https://chem-bla-ics.linkedchemistry.info/2026/04/15/cito-annotations-with-zotero-8-and-google-docs.html" rel="alternate" type="text/html" title="CiTO annotations with Zotero 8 and Google Docs" /><published>2026-04-15T00:00:00+00:00</published><updated>2026-04-15T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2026/04/15/cito-annotations-with-zotero-8-and-google-docs</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/04/15/cito-annotations-with-zotero-8-and-google-docs.html"><![CDATA[<p>This is a bit of work I did already in March, but with the <a href="https://www.zotero.org/blog/zotero-9/">Zotero 9 release</a> I was reminded
that I wanted to blog this. Ideally, it will trigger some further discussion and maybe a future Zotero/Google Docs version supports
bibliography-level annotations too.</p>

<p>Still, <a href="https://www.zotero.org/blog/zotero-8/">Zotero</a> 8 brought in <em>prefix</em> and <em>suffix</em> support, and I was wondering if this could
be used for CiTO citation intent annotations. And it can. This was the result, and make of it what you want:</p>

<p><img src="/assets/images/zotero_notes.png" alt="" /></p>]]></content><author><name>Egon Willighagen</name></author><category term="cito" /><category term="zotero" /><category term="google" /><summary type="html"><![CDATA[This is a bit of work I did already in March, but with the Zotero 9 release I was reminded that I wanted to blog this. Ideally, it will trigger some further discussion and maybe a future Zotero/Google Docs version supports bibliography-level annotations too.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/zotero_notes.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/zotero_notes.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Using compact identifiers in project reports</title><link href="https://chem-bla-ics.linkedchemistry.info/2026/03/29/using-compact-identifiers-in-project-reports.html" rel="alternate" type="text/html" title="Using compact identifiers in project reports" /><published>2026-03-29T00:00:00+00:00</published><updated>2026-03-29T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2026/03/29/using-compact-identifiers-in-project-reports</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/03/29/using-compact-identifiers-in-project-reports.html"><![CDATA[<p>This document describes how you can improve the FAIR-ness of your project report by using
compact identifiers. Of course, it can be applied to any other document too, and has been used
in, for example, journal articles and online documentation already.</p>

<p>Compact identifiers find a balance between compactness in writing and being a persistent, unique,
and global identifier. It “is a string constructed by concatenating a namespace prefix, a separating colon,
and a locally unique identifier (LUI)” (doi:<a href="https://doi.org/10.1038/sdata.2018.29">10.1038/sdata.2018.29</a>).
For example, for proteins it can represent the PDB structure <a href="https://bioregistry.io/pdb:2gc4">2gc4</a> as
<em>pdb:2gc4</em>. There is a clear similarity with the SciCrunch <a href="https://rrid.site/">Research Resource Identifiers</a>
(RRIDs) as used by several journals, like
<a href="https://elifesciences.org/inside-elife/ff683ecc/rrids-how-did-we-get-here-and-where-are-we-going">eLife</a>
(doi:<a href="https://doi.org/10.1007/s12021-015-9284-3">10.1007/s12021-015-9284-3</a>).</p>

<p>When the prefixes are defined by community standards, then a compact identifier can be resolved.
There currently are multiple providers of prefix files (doi:<a href="https://doi.org/10.1038/sdata.2018.29">10.1038/sdata.2018.29</a>),
including Identifiers.org (doi:<a href="https://doi.org/10.1093/bioinformatics/btaa864">10.1093/bioinformatics/btaa864</a>)
and Bioregistry (doi:<a href="https://doi.org/10.1038/s41597-022-01807-3">10.1038/s41597-022-01807-3</a>).
The Bioregistry has an overview of more than twenty registries of prefixes and their metadata
(doi:<a href="https://doi.org/10.1038/s41597-022-01807-3">10.1038/s41597-022-01807-3</a>). The metadata commonly
includes information on the URL pattern for each identifier. Often this is more than one pattern, as
there may more several databases with information for the same identifier.</p>

<p>It is the URL pattern in the database that allows services to <em>resolve</em> the compact identifier
into a link to a database. The above registries correspond to three existing <em>resolvers</em> that will take a compact
identifier as part of a resolver URL and redirect to the database with the record matching
that identifier:</p>

<ul>
  <li>Name-to-Thing (N2T): <a href="https://n2t.net/">https://n2t.net/</a></li>
  <li>Identifiers.org: <a href="https://identifiers.org/">https://identifiers.org/</a></li>
  <li>The Bioregistry: <a href="https://bioregistry.io/">https://bioregistry.io/</a></li>
</ul>

<p>Each of these URLs can be extended with a compact identifier. For example, a taxon record
from the NCBI databases or the PDB entry mentioned earlier:</p>

<ul>
  <li><a href="https://bioregistry.io/pdb:2gc4">https://bioregistry.io/pdb:2gc4</a></li>
  <li><a href="https://identifiers.org/col:6MB3T">https://identifiers.org/col:6MB3T</a> (<code class="language-plaintext highlighter-rouge">col</code> is the prefix for the Catalogue of Life)</li>
</ul>

<h2 id="why-use-in-reports">Why use in reports?</h2>

<p>Using persistent identifiers is generally accepted as a good practice that benefits science
and has been part of the ideas of FAIR data (doi:<a href="https://doi.org/10.1038/sdata.2016.18">10.1038/sdata.2016.18</a>)
and of Open Science. Compact
identifiers make it easy to be precise in reports about what things the reports talk about: they
are relatively short but very precise at the same time. also, that has the benefit that they
are much easier to reuse than labels of things and concepts that intrinsically have a certain
level of uncertainty; a database entry has commonly a very specific meaning.</p>

<h2 id="examples-uses">Examples uses</h2>

<p>The use of compact identifiers can be used in two ways. The simplest is to just put the
compact identifier as plain text in the document, possibly in parentheses
(with the compact identifier highlighted here in bold):</p>

<ul>
  <i>This report is only about the experimental data of the human (<b>NCBITaxon:9606</b>) cell lines.</i>
</ul>

<p>Or:</p>

<ul>
  <i>We found that BRCA1 (<b>ensembl:ENSG00000012048</b>) played an important role.</i>
</ul>

<p>Alternatively, you can add a hyperlink with one of the resolvers, for example, Identifiers.org:</p>

<ul>
  <i>We found that BRCA1 (<b><a href="https://identifiers.org/ensembl:ENSG00000012048">ensembl:ENSG00000012048</a></b>) played an important role.</i>
</ul>

<h3 id="compact-identifiers-for-material-identifiers">Compact identifiers for material identifiers</h3>

<p>The European Registry of Materials proposes to use the compact identifier for their
ERM identifiers (doi:<a href="https://doi.org/10.1186/s13321-022-00614-7">10.1186/s13321-022-00614-7</a>):</p>

<ul>
  <i>
    For example, the NanoSolveIT project registered a material with the ERM00000001 identifier.
    The full Uniform Resource Identifier (URI) for this compound is
    https://nanocommons.github.io/identifiers/registry#ERM00000001 which is too long to be used
    in documentation. The corresponding compact identifier <b>erm:ERM00000001</b> is easy to use in written
    material, analogous to the use of Protein Data Bank (PDB) identifiers for proteins in journals.
  </i>
</ul>

<h3 id="compact-identifiers-for-citation-intent-annotations">Compact identifiers for citation intent annotations</h3>

<p>The compact identifier has also been used as the method to include citation intentions in journal
articles (doi:<a href="https://doi.org/10.1186/s13321-020-00448-1">10.1186/s13321-020-00448-1</a>,
compact identifier here highlighted in bold):</p>

<ul>
  <i>
    We take advantage here of the ability to add notes to full form [..] references in bibliographies.
    These are referred to as bibnotes. The content of the note will be strictly formatted: it will use
    the syntax [<b>cito:usesMethodIn</b>] and formatted in bold. That is, the bibnote starts with the
    [ character, followed by one of the CiTO types, and ends with the ] character. If you wish to
    provide more than one annotation, you can repeat this syntax, separated by one or more spaces,
    for example: [<b>cito:usesMethodIn</b>] [<b>cito:citeAsAuthority</b>].
  </i>
</ul>

<p>Note that in this use, the square brackets and bold typeface are used to make them easier to
be recognized. Also, note that this document uses this approach to indicate the intention of
why the cited articles are cited.</p>

<h2 id="conclusion">Conclusion</h2>

<p>This document described what the compact identifier is, how it helps linking to online
databases, and how they can be used in written reports as plain text, optionally
hyperlinked with one of the compact identifier resolvers.</p>

<h3 id="acknowledgments">Acknowledgments</h3>

<p>I thank <a href="https://n2t.net/github:tabbassidaloii">github:tabbassidaloii</a>,
<a href="https://n2t.net/github:cthoyt">github:cthoyt</a>, and
<a href="https://n2t.net/github:larsgw">github:larsgw</a> for their comment on
<a href="https://github.com/egonw/compact-ids-in-reports">this GitHub repo</a>.</p>]]></content><author><name>Egon Willighagen</name></author><category term="identifier" /><category term="semweb" /><category term="cito" /><category term="cito:usesMethodIn,includesQuotationFrom:10.1038/sdata.2018.29" /><category term="cito:obtainsBackgroundFrom:10.1007/s12021-015-9284-3" /><category term="cito:usesMethodIn:10.1093/bioinformatics/btaa864" /><category term="mycito:usesMethodIn:10.1038/S41597-022-01807-3" /><category term="cito:obtainsBackgroundFrom:10.1038/sdata.2016.18" /><category term="mycito:includesQuotationFrom:10.1186/S13321-022-00614-7" /><category term="mycito:includesQuotationFrom:10.1186/S13321-020-00448-1" /><summary type="html"><![CDATA[This document describes how you can improve the FAIR-ness of your project report by using compact identifiers. Of course, it can be applied to any other document too, and has been used in, for example, journal articles and online documentation already.]]></summary></entry><entry><title type="html">CiTO for blog citations</title><link href="https://chem-bla-ics.linkedchemistry.info/2025/02/08/cito-for-blog-citations.html" rel="alternate" type="text/html" title="CiTO for blog citations" /><published>2025-02-08T00:00:00+00:00</published><updated>2025-02-08T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2025/02/08/cito-for-blog-citations</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2025/02/08/cito-for-blog-citations.html"><![CDATA[<p>This is mostly a test, but if it turns out the way I hope it will, likely after a few iterations, it adds
support in my blog for CiTO intent annotations to the DOIs I cite. I
<a href="https://chem-bla-ics.linkedchemistry.info/2024/12/30/fair-blog-to-blog-citations.html">pondered about the earlier</a>.
In the <a href="https://www.jsonfeed.org/version/1.1/">JSON Feed</a> it should, at least for now, show up like this:</p>

<div class="language-json highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="nl">"_references"</span><span class="p">:</span><span class="w"> </span><span class="p">[</span><span class="w">
  </span><span class="p">{</span><span class="w">
    </span><span class="nl">"url"</span><span class="p">:</span><span class="w"> </span><span class="s2">"https://doi.org/10.59350/er1mn-m5q69"</span><span class="p">,</span><span class="w">
    </span><span class="nl">"doi"</span><span class="p">:</span><span class="w"> </span><span class="s2">"10.59350/er1mn-m5q69"</span><span class="p">,</span><span class="w">
    </span><span class="nl">"cito"</span><span class="p">:</span><span class="w"> </span><span class="p">[</span><span class="w"> </span><span class="s2">"extends"</span><span class="w"> </span><span class="p">]</span><span class="w">
  </span><span class="p">}</span><span class="w">
</span><span class="p">]</span><span class="w">
</span></code></pre></div></div>]]></content><author><name>Egon Willighagen</name></author><category term="cito" /><category term="blog" /><category term="cito:extends:10.59350/er1mn-m5q69" /><category term="json" /><summary type="html"><![CDATA[This is mostly a test, but if it turns out the way I hope it will, likely after a few iterations, it adds support in my blog for CiTO intent annotations to the DOIs I cite. I pondered about the earlier. In the JSON Feed it should, at least for now, show up like this:]]></summary></entry><entry><title type="html">FAIR blog-to-blog citations</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/12/30/fair-blog-to-blog-citations.html" rel="alternate" type="text/html" title="FAIR blog-to-blog citations" /><published>2024-12-30T00:00:00+00:00</published><updated>2024-12-30T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/12/30/fair-blog-to-blog-citations</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/12/30/fair-blog-to-blog-citations.html"><![CDATA[<p><a href="https://chem-bla-ics.linkedchemistry.info/2021/08/28/scholarly-journals-should-use-archived.html">Linkrot is real</a> and
<a href="https://doi.org/10.59348/1z1p2-nn569">digital preservation problematic</a>. One reason why I have
<a href="https://chem-bla-ics.linkedchemistry.info/2023/07/27/archiving-and-updating-my-blog.html">started migrating my blog</a>
to a more robust platform. That first step gave me version control. This summer my blog was
<a href="https://chem-bla-ics.linkedchemistry.info/2024/07/21/rogue-scholar-and-more.html">accepted to Rogue Scholar</a>.
That gave me DOIs. And an idea.</p>

<p>Things are coming together, and while commercial publishers (SpringerNature, Elsevier, MDPI, Frontiers, etc)
are focused on profit (“shareholder value”) instead of the community they serve, Open Science is providing
working, real-world, inexpensive, superior FAIR solutions for scientific dissemination. Maybe European
universities are not convinced yet (see <a href="https://doi.org/10.59350/1nmwy-nhk20">Björn’s post</a>), but it is
happening.</p>

<p>Two things that are happening are <a href="https://openalex.org/">OpenAlex</a> and <a href="https://opencitations.net/">OpenCitations</a>.
<a href="https://chem-bla-ics.linkedchemistry.info/tag/cito">CiTO adoption</a> not so much yet, but I am not giving up
yet. Simply because Open Science doesn’t go away and everything can be picked up tomorrow. Each holiday
I am picking up the Citation Typing Ontology and this holiday the
<a href="https://chem-bla-ics.linkedchemistry.info/2024/04/02/open-science-retreat-2.html">use of nanopublications for CiTO intent annotation</a>
of April this year.</p>

<p>Yesterday, I played with the nanopublication templates used by NanoDash, got to the triples of it, and
ended up using the web interface to create <a href="https://w3id.org/np/RA43F9EoOuzF0xoNUnCMNyFsfIqlsuWDdPHCnN0wCdCAw">a derived template</a>
from <a href="https://w3id.org/np/RAX_4tWTyjFpO6nz63s14ucuejd64t2mK3IBlkwZ7jjLo">Tobias’ template from April</a>.
What makes this nanopublication template special is that it uses <a href="https://github.com/SPAROntologies/cito">the CiTO ontology</a>.</p>

<p>The difference is that the original template used <code class="language-plaintext highlighter-rouge">ScholarlyWork</code> as type for the citing resource,
while the derivative uses <code class="language-plaintext highlighter-rouge">CreativeWork</code> from the schema.org namespace, allowing things like this:</p>

<ul>
  <li>article to software release: <a href="https://w3id.org/np/RAzmTPPM7v5Ilgvo-3aFRRZgdD3ImaUB434NtGlfI0G90">example nanopub</a></li>
  <li>article to blog: <a href="https://w3id.org/np/RAaRH1WhRgirso3JiTUJJ0XcBaRyHI6G4OZPdWBoIf17U">example nanopub</a></li>
  <li>blog to article: <a href="https://w3id.org/np/RAXL9q3jakrpaDh8oyVaNS1Y7JowmZm4tx4WcdIFMmg8g">example nanopub</a></li>
  <li>blog to blog: <a href="https://w3id.org/np/RAJOwolZUwUxuvPEhMFiQYHywJdWMWTlt_gnXoUbUBaYY">example nanopub</a></li>
</ul>

<p>The last three are possible because of the Rogue Scholar DOIs. Let’s continue with the fourth example,
the blog to blog citation. While an URL is a unique, global identifier, the digital preservation depends
on a lot of things. On the other hand, a DOI with the associated metadata is easier to preserve. For example,
because it can be spread more easily than the digital object itself.</p>

<p>So, when <a href="https://blog.front-matter.io/author/martin/">Martin Fenner</a> and
<a href="https://chem-bla-ics.linkedchemistry.info/2024/12/08/rich-l-apodaca.html">I</a>
<a href="https://chem-bla-ics.linkedchemistry.info/2024/12/27/archiving_blogs.html">started</a>
<a href="https://doi.org/10.53731/3c6pm-xbp04">archiving</a>
the <a href="https://depth-first.com/">Depth-First blog of Rich Apodaca</a> to digitally preserve his blog, <!-- keep link -->
it also automatically gave the blog posts DOIs. This makes the blog more FAIR, just like it does
for my blog. And being more FAIR, we can use the DOIs for other things too, like blog to blog
citations with CiTO intent annotation, as nanopublications.
(Technically, any Springer Nature journal can do this, but they found reasons to not do it.)</p>

<p>So, let’s take <a href="https://chem-bla-ics.linkedchemistry.info/2024/12/08/rich-l-apodaca.html">this blog post</a>.
I have today updated this to not use <code class="language-plaintext highlighter-rouge">depth-first.com</code> URLs but, following Martin’s example, use the DOIs <!-- keep link -->
for those posts instead.</p>

<p>And when I make a nanopublication out of this, I can add the citation intent, and then it looks like
<a href="https://w3id.org/np/RAmETOQXyoS5dYeP8yhJscOrAIimf1RHFnzG2GtziqIQ8">this</a>:</p>

<p><img src="/assets/images/nanopub1.png" alt="" /></p>

<p>For some reason, the DOIs do not show up as references as they do for this post for the DOIs of the
posts of Martin Paul Eve, Björn Brembs, and Martin Fenner. It does for
<a href="https://chem-bla-ics.linkedchemistry.info/2009/11/19/chempedia-rdf-1-sparql-end-point.html">this post citing Depth-First</a>.</p>

<p>So, from now on, I will use DOIs when citing other blog posts, and I hope many other blogs will
start using Rogue Scholar or some other service to generate DOIs for single blog posts.
I also have to figure out if I want to use DOIs to link to posts in my own blog.
And hopefully, OpenCitations will soon accept citations provided by nanopublications.
With or without CiTO intent annotations, whatever comes first. Oh, and I cannot wait to see
the citations who up in <a href="https://www.altmetric.com/">Altmetric.com</a> :)</p>

<p>Let’s see where this is going.</p>]]></content><author><name>Egon Willighagen</name></author><category term="cito" /><category term="blog" /><category term="publishing" /><category term="justdoi:10.59348/1z1p2-nn569" /><category term="justdoi:10.59350/1nmwy-nhk20" /><category term="justdoi:10.53731/3c6pm-xbp04" /><summary type="html"><![CDATA[Linkrot is real and digital preservation problematic. One reason why I have started migrating my blog to a more robust platform. That first step gave me version control. This summer my blog was accepted to Rogue Scholar. That gave me DOIs. And an idea.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/nanopub1.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/nanopub1.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Scholarly discussions through the eyes of CiTO (and Wikidata)</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/08/11/scholarly-discussions.html" rel="alternate" type="text/html" title="Scholarly discussions through the eyes of CiTO (and Wikidata)" /><published>2024-08-11T00:00:00+00:00</published><updated>2024-08-11T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/08/11/scholarly-discussions</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/08/11/scholarly-discussions.html"><![CDATA[<p>Diabetes was already discussed in literature back in 1838-1839 (doi:<a href="https://doi.org/10.1016/S0140-6736(02)96038-1">10.1016/S0140-6736(02)96038-1</a>,
doi:<a href="10.1016/S0140-6736(02)96066-6">10.1016/S0140-6736(02)96066-6</a>, and doi:<a href="https://doi.org/10.1016/S0140-6736(02)83966-6">10.1016/S0140-6736(02)83966-6</a>).
These three papers show a short discussion. Papers were a lot shorter back in the days, and the discussion actually shows why papers are longer now
(tho I am not sure they really got sufficiently more reproducible, but that’s another discussion).</p>

<p>Traditional citation counts do not make this discussion obvious, but if we make our publishing sufficiently FAIR (it’s far from that, right now),
then we can get a step closer. For example, with the <a href="https://purl.org/spar/cito">Citation Typing Ontology</a>
we can show how the papers relate to each other:</p>

<p><img src="/assets/images/clannyNetwork.png" alt="" /></p>

<p>This network is based on public knowledge in <a href="https://wikidata.org/">Wikidata</a> and actually can be easily reproduced by anyone
with <a href="https://w.wiki/AtV9">this query</a>:</p>

<div class="language-sparql highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="c1">#defaultView:Graph</span><span class="w">
</span><span class="k">SELECT</span><span class="w"> </span><span class="k">DISTINCT</span><span class="w"> </span><span class="nv">?focus1</span><span class="w"> </span><span class="nv">?focus1Label</span><span class="w"> </span><span class="nv">?focus2</span><span class="w"> </span><span class="nv">?focus2Label</span><span class="w"> </span><span class="nv">?edgeLabel</span><span class="w"> </span><span class="k">WHERE</span><span class="w"> </span><span class="p">{</span><span class="w">
  </span><span class="k">VALUES</span><span class="w"> </span><span class="nv">?focus1</span><span class="w"> </span><span class="p">{</span><span class="w"> </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q124174475</span><span class="w"> </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q124174776</span><span class="w"> </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q124174815</span><span class="w"> </span><span class="p">}</span><span class="w">
  </span><span class="k">VALUES</span><span class="w"> </span><span class="nv">?focus2</span><span class="w"> </span><span class="p">{</span><span class="w"> </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q124174475</span><span class="w"> </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q124174776</span><span class="w"> </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q124174815</span><span class="w"> </span><span class="p">}</span><span class="w">
  </span><span class="nv">?focus1</span><span class="w"> </span><span class="nn">p</span><span class="o">:</span><span class="ss">P2860</span><span class="w"> </span><span class="nv">?citation</span><span class="w"> </span><span class="p">.</span><span class="w">
  </span><span class="nv">?citation</span><span class="w"> </span><span class="nn">ps</span><span class="o">:</span><span class="ss">P2860</span><span class="w"> </span><span class="nv">?focus2</span><span class="w"> </span><span class="p">;</span><span class="w"> </span><span class="nn">pq</span><span class="o">:</span><span class="ss">P3712</span><span class="w"> </span><span class="nv">?edge</span><span class="w"> </span><span class="p">.</span><span class="w">
  </span><span class="nv">?edge</span><span class="w"> </span><span class="nn">rdfs</span><span class="o">:</span><span class="ss">label</span><span class="w"> </span><span class="nv">?edgeLabel</span><span class="w"> </span><span class="p">.</span><span class="w"> </span><span class="k">FILTER</span><span class="p">(</span><span class="nb">LANG</span><span class="p">(</span><span class="nv">?edgeLabel</span><span class="p">)</span><span class="w"> </span><span class="p">=</span><span class="w"> </span><span class="s2">"en"</span><span class="p">)</span><span class="w">
  </span><span class="k">SERVICE</span><span class="w"> </span><span class="nn">wikibase</span><span class="o">:</span><span class="ss">label</span><span class="w"> </span><span class="p">{</span><span class="w"> </span><span class="nn">bd</span><span class="o">:</span><span class="ss">serviceParam</span><span class="w"> </span><span class="nn">wikibase</span><span class="o">:</span><span class="ss">language</span><span class="w"> </span><span class="s2">"[AUTO_LANGUAGE],mul,en"</span><span class="p">.</span><span class="w"> </span><span class="p">}</span><span class="w">
</span><span class="p">}</span><span class="w">
</span></code></pre></div></div>

<p>The two “focus” values are an identical list of the articles I want to see. To make sure to get citations between all of them,
I have to give them twice.</p>

<p>In the above example I have used <code class="language-plaintext highlighter-rouge">VALUES</code> for this, but I can also generate the controlled list of items between the citations
I want to visualize with any SPARQL fragment too. <a href="https://edu.nl/y38rg">This query</a> does that (or here as
<a href="https://gist.github.com/egonw/b5fb7ae550c1597ff247f70cee8063c8">GitHub Gist</a>, but something else too: it uses a trick I learned
from <a href="https://scholia.toolforge.org/author/Q20980928">Finn Nielsen</a> from <a href="https://github.com/WDscholia/scholia/commit/d34dee85bc12575e0f1891c4e663ef8e2c450083">this patch</a>
from the <a href="https://scholia.toolforge.org/">Scholia</a> project (doi:<a href="https://doi.org/10.1007/978-3-319-70407-4_36">10.1007/978-3-319-70407-4_36</a>)).</p>

<p>Here, I select the articles by replacing the above <code class="language-plaintext highlighter-rouge">VALUES</code> lines with this fragment (<code class="language-plaintext highlighter-rouge">P50</code> is ‘author’ and <code class="language-plaintext highlighter-rouge">Q20895241</code> is me in Wikidata):</p>

<pre><code class="language-SPARQL">  ?focus1 wdt:P50 wd:Q20895241 .
  ?focus2 wdt:P50 wd:Q20895241 .
</code></pre>

<p>And, to be honest, then I get this network which is much richer than I expected:</p>

<p><img src="/assets/images/willighagen_cito.png" alt="" /></p>

<p>I wonder how far we can push this. Can we also do this for the <a href="https://scholia.toolforge.org/venue/Q6294930">Journal of Cheminformatics</a>?
After all, this journal had a <a href="https://www.biomedcentral.com/collections/cito">CiTO Pilot</a> and, indeed,
<a href="https://edu.nl/hk8xy">the results do not disappoint</a>! All I had to do was replace the focus section:</p>

<pre><code class="language-SPARQL">  ?focus1 wdt:P1433 wd:Q6294930 .
  ?focus2 wdt:P1433 wd:Q6294930 .
</code></pre>]]></content><author><name>Egon Willighagen</name></author><category term="cito" /><category term="wikidata" /><category term="justdoi:10.1016/S0140-6736(02)96038-1" /><category term="justdoi:10.1016/S0140-6736(02)96066-6" /><category term="justdoi:10.1016/S0140-6736(02)83966-6" /><category term="doi:10.1007/978-3-319-70407-4_36" /><summary type="html"><![CDATA[Diabetes was already discussed in literature back in 1838-1839 (doi:10.1016/S0140-6736(02)96038-1, doi:10.1016/S0140-6736(02)96066-6, and doi:10.1016/S0140-6736(02)83966-6). These three papers show a short discussion. Papers were a lot shorter back in the days, and the discussion actually shows why papers are longer now (tho I am not sure they really got sufficiently more reproducible, but that’s another discussion).]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/clannyNetwork.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/clannyNetwork.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">CiTO updates: Wakefield and WikiPathways</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/08/07/cito-updates.html" rel="alternate" type="text/html" title="CiTO updates: Wakefield and WikiPathways" /><published>2024-08-07T00:00:00+00:00</published><updated>2024-08-07T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/08/07/cito-updates</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/08/07/cito-updates.html"><![CDATA[<p>This summer I am trying to finish up some smaller projects that I did not have time for to finish, with
mixed successes. I am combing this with a nice Dutch staycation, and I already cycled in
<a href="https://en.wikipedia.org/wiki/Overijssel">Overijssel</a> and in south-west <a href="https://en.wikipedia.org/wiki/Friesland">Friesland</a>
and learning about their histories.
But this post is about an update on my Citation Typing Ontology use cases. And I have to say,
a <a href="https://www.youtube.com/watch?v=1kD7jkyDr3s">mention by Silvio Peroni</a> is pretty awesome, thanks!</p>

<p>First, the bad news. I still did not get around to the following to tasks I have. First, I need to write up a
step-by-step guide how to create <a href="https://chem-bla-ics.linkedchemistry.info/2024/04/02/open-science-retreat-2.html">CiTO nanopublications</a>
and matching draft article. Second, I still need to work out how to update the JATS workflow for
<a href="https://chem-bla-ics.linkedchemistry.info/2021/11/15/biohackathon-europe-2021-1-cito.html">CiTO annotation in BioHackrXiv</a>.</p>

<h2 id="wakefield">Wakefield</h2>

<p>Let’s first start with a dataset. Peroni mentioned a study they did (<a href="https://doi.org/10.1007/S11192-021-04097-5">10.1007/S11192-021-04097-5</a>)
into why the famous Wakefield paper
(doi:<a href="https://doi.org/10.1016/S0140-6736(97)11096-0">10.1016/S0140-6736(97)11096-0</a>) is cited. They published
their data set on Zenodo (doi:<a href="https://doi.org/10.5281/zenodo.13166142">10.5281/zenodo.13166142</a>) with CCZero,
so I imported it into <a href="https://wikidata.org/">Wikidata</a>. Well, at least the citations
of articles already in Wikidata. I used a Bacting (doi:<a href="https://doi.org/10.21105/joss.02558">10.21105/joss.02558</a>)
<a href="https://gist.github.com/egonw/379c72a49517716712b70bdee0d845ce">script</a> and it actually was quite short.
In the end, this added some 500 new citation intentions to Wikidata, now at almost <a href="https://scholia.toolforge.org/cito/">2000</a>.
This is also the third dataset with explicit CiTO intention annotations (see also
<a href="https://chem-bla-ics.linkedchemistry.info/2023/04/02/cito-updates-4-annotations-in-datasets.html">this post</a>).</p>

<p>This is what the <a href="https://scholia.toolforge.org/work/Q28264479#cito-incoming">CiTO section of the Wakefield paper</a>
in <a href="https://scholia.toolforge.org/">Scholia</a> (doi:<a href="https://doi.org/10.1007/978-3-319-70407-4_36">10.1007/978-3-319-70407-4_36</a>)
now looks like:</p>

<p><img src="/assets/images/wakefieldCitations.png" alt="" /></p>

<h2 id="wikipathways">WikiPathways</h2>

<p>A second thing I want to show is a potentional CiTO intention annotation dataset. Almost two years ago
<a href="https://qoto.org/@xanderpico">Alex Pico</a> started a new <a href="https://wikipathways.org/">WikiPathways</a>
feature as part of the new website (doi:<a href="https://doi.org/10.1093/NAR/GKAD960">10.1093/NAR/GKAD960</a>)):
<a href="https://github.com/wikipathways/wikipathways-database/commit/97f7df0057d312f0c332a9ff290c11684bf252d5">a list of citations to specific pathways</a>
(in WikiPathways). Alex’ setup is fully automated and using <a href="https://www.ncbi.nlm.nih.gov/pmc/">PubMed Central</a>
and find mentions in figure captions:</p>

<p><em>Beyond citations to previous WikiPathways journal articles, we have identified 1228 mentions of a total of 582
unique WikiPathways pathway model identifiers, e.g. WP4846, in PubMedCentral articles over the past 13 years.</em></p>

<p>The file format is a pretty basic YAML file:</p>

<p><img src="/assets/images/citedin_yaml.png" alt="" /></p>

<p>Additional mentions are found in the main text and tables in the article. These are not always picked up.
These can be added manually. Over the past months and the past two weeks particularly, I have been adding
additional mentions, not listed yet. We now passed 1500 mentions but I cannot easily give the other
statistics.</p>

<p>BTW, anyone can add these citations with the ‘edit’ pencil and some Microsoft GitHub editing (but
as far as I am concerned, please feel free to also just mention the paper on the
<a href="https://github.com/wikipathways/wikipathways-help/discussions">WikiPathways Community Forum</a>):</p>

<p><img src="/assets/images/citedin_website.png" alt="" /></p>

<p>So, in the next few days I plan to do two things: 1. generate RDF for the YAML file and make that part of the
<a href="https://data.wikipathways.org/current/rdf/">monthly WikiPathways RDF release</a>; 2. extract citations and
offer this back to <a href="https://opencitations.net/">the OpenCitations project</a>; and, 3. add the citations
into Wikidata. Of course, all with <code class="language-plaintext highlighter-rouge">cito:usesDataFrom</code> :)</p>

<p>There is a fourth things that I am still thinking about. I can also use the above data the annotation
citations to the WikiPathways papers if they also mention a WikiPathways identifier as <code class="language-plaintext highlighter-rouge">cito:usesDataFrom</code>,
but I cannot fully oversee the implications of that. What do you think?</p>]]></content><author><name>Egon Willighagen</name></author><category term="cito" /><category term="wikipathways" /><category term="wikidata" /><category term="justdoi:10.1016/S0140-6736(97)11096-0" /><category term="doi:10.21105/JOSS.02558" /><category term="doi:10.1007/978-3-319-70407-4_36" /><category term="justdoi:10.5281/ZENODO.13166142" /><category term="doi:10.1093/NAR/GKAD960" /><category term="justdoi:10.1007/S11192-021-04097-5" /><summary type="html"><![CDATA[This summer I am trying to finish up some smaller projects that I did not have time for to finish, with mixed successes. I am combing this with a nice Dutch staycation, and I already cycled in Overijssel and in south-west Friesland and learning about their histories. But this post is about an update on my Citation Typing Ontology use cases. And I have to say, a mention by Silvio Peroni is pretty awesome, thanks!]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/wakefieldCitations.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/wakefieldCitations.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Open Science Retreat #2: CiTO Nanopublications</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/04/02/open-science-retreat-2.html" rel="alternate" type="text/html" title="Open Science Retreat #2: CiTO Nanopublications" /><published>2024-04-02T00:00:00+00:00</published><updated>2024-04-02T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/04/02/open-science-retreat-2</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/04/02/open-science-retreat-2.html"><![CDATA[<p>During the <a href="http://chem-bla-ics.linkedchemistry.info/2024/03/31/open-science-retreat-1.html">Open Science Retreat</a> I organized
a short session where we looking into typing citation intentions using a new nanopublication template. First, let’s describe
nanopublications (originally used in doi:<a href="https://doi.org/10.3233/ISU-2010-0613">10.3233/ISU-2010-0613</a>) a bit.
Scholia gives <a href="https://qlever.scholia.wiki/topic/Q57814310">a nice overview of (macro?)publications on the topic <i class="fa-solid fa-recycle fa-xs"></i></a>.
The <a href="https://nanopub.net/">nanopub.net</a>
website describes that <em>[a nanopublication is a small knowledge graph snippet with metadata that is treated as an
independent (scientific) publication.]</em>. The knowledge graph, it continues, can be anything from an opinion to the link
between a disease and a gene (doi:<a href="https://doi.org/10.1109/ESCIENCE.2018.00024">10.1109/ESCIENCE.2018.00024</a>).</p>

<p>Now, in this post I will document an update of how we can use nanopublications for citation intention annotation, and
compare this to existing solutions. I have been collecting and indexing the CiTO intention annotations in Wikidata and
visualizing the corpus with Scholia at <a href="https://qlever.scholia.wiki/cito/">qlever.scholia.wiki/cito/ <i class="fa-solid fa-recycle fa-xs"></i></a>. There are
currently 22 journal articles with explicit CiTO annoation, largely thanks to a <a href="https://www.biomedcentral.com/collections/cito">Journal of Cheminformatics pilot</a>
(e.g. see doi:<a href="https://doi.org/10.1186/s13321-023-00683-2">10.1186/s13321-023-00683-2</a>). Recently,
the preprint/report server <a href="https://biohackrxiv.org/discover">BioHackrXiv</a> started
<a href="https://github.com/biohackrxiv/publication-template">CiTO support</a> too, also visible in the statistics
on Scholia with another 17 papers. A third source is data sets from bibliometric-like studies, as explained
in <a href="https://chem-bla-ics.linkedchemistry.info/2023/04/02/cito-updates-4-annotations-in-datasets.html">this post <i class="fa-solid fa-recycle fa-xs"></i></a>. Nanopublications
would be a fourth solution.</p>

<p>So, why another solutions? Like the datasets, assuming DataCite approaches, have clear provenance, but the overhead
of and needed time for creating a dataset with citation intent annotations can be limiting. And because nanopublications
can be linked to ORCID identifiers, we can even discover which citation intent annotations are created by the original
authors of articles. Another advantage is that nanopubs are basically RDF and we can query them easily, allowing
the citation intentions to migrate to Wikidata. Scholia already saw an update to recognize nanopublications as
a unique kind reference (see the new Wikidata property <a href="https://www.wikidata.org/wiki/Property:P12545">Nanopublication identifier (P12545)</a>).</p>

<h1 id="nanodash-template">NanoDash template</h1>

<p>So, if we can make it easy for people to define nanopublications with CiTO citation intent annotations, than we can
start formalizing intent annotations from a much wider range of use cases. For example, we can annotate historically
important discussions. Anyone can retrospectively annotate all their own articles, making them more FAIR. And if we
use DOI links, then it no longer is limited to journal articles, but we can use of for software and data citations too.
This is where <a href="https://w3id.org/np/RAX_4tWTyjFpO6nz63s14ucuejd64t2mK3IBlkwZ7jjLo">a recent template</a> comes in created by
<a href="https://orcid.org/0000-0002-1267-0234">Tobias Kuhn</a>, one of the main nanopub developers:</p>

<p><img src="/assets/images/citoPub.png" alt="" /></p>

<p>This nanopublication template defines the minimal needs of the assumptions, along with useful provenance and nanopub
info. Basically, the assertion defines that one DOI is a ScholarlyWork and using the CiTO, defines that it cites
one or more article works (with DOI). For each citations, one can select any of the known CiTO intent types,
e.g. ‘extends’ or ‘uses method’ in, as in <a href="https://w3id.org/np/RA6Rxk1sSOSWxM7A6gW4SjJZRVt4fbY6nShPTAbQ8kce8">this nanopublication</a>
created with this template:</p>

<p><img src="/assets/images/citoPub2.png" alt="" /></p>

<h2 id="sparql-ing-cito-annotations">SPARQL-ing CiTO annotations</h2>

<p>Besides the template, Tobias also started a SPARQL query to which I added restrictions that the citing and cited
resources needs to have a DOI, giving us <a href="https://query.knowledgepixels.com/tools/type/2c1cce3f3152738c1009d59251409392aaaa3b0324bcb5fdfb4b7b944b8f0c18/yasgui.html#query=prefix+rdfs%3A+%3Chttp%3A%2F%2Fwww.w3.org%2F2000%2F01%2Frdf-schema%23%3E%0Aprefix+np%3A+%3Chttp%3A%2F%2Fwww.nanopub.org%2Fnschema%23%3E%0Aprefix+npa%3A+%3Chttp%3A%2F%2Fpurl.org%2Fnanopub%2Fadmin%2F%3E%0Aprefix+npx%3A+%3Chttp%3A%2F%2Fpurl.org%2Fnanopub%2Fx%2F%3E%0Aprefix+xsd%3A+%3Chttp%3A%2F%2Fwww.w3.org%2F2001%2FXMLSchema%23%3E%0Aprefix+dct%3A+%3Chttp%3A%2F%2Fpurl.org%2Fdc%2Fterms%2F%3E%0A%0Aselect+%3Fnp+%3Flabel+%3Fsubj+%3Fcitationrel+%3Fobj+%3Fdate+where+%7B%0A++graph+npa%3Agraph+%7B%0A++++%3Fnp+npa%3AhasValidSignatureForPublicKey+%3Fpubkey+.%0A++++%3Fnp+dct%3Acreated+%3Fdate+.%0A++++%3Fnp+np%3AhasAssertion+%3Fassertion+.%0A++++optional+%7B+%3Fnp+rdfs%3Alabel+%3Flabel+.+%7D%0A++++filter+not+exists+%7B+%3Fnpx+npx%3Ainvalidates+%3Fnp+%3B+npa%3AhasValidSignatureForPublicKey+%3Fpubkey+.+%7D%0A++++filter+not+exists+%7B+%3Fnp+npx%3AhasNanopubType+npx%3AExampleNanopub+.+%7D%0A++%7D%0A++graph+%3Fassertion+%7B%0A++++%3Fsubj+%3Fcitationrel+%3Fobj+.%0A++++filter(regex(str(%3Fcitationrel)%2C+%22%5Ehttp%3A%2F%2Fpurl.org%2Fspar%2Fcito%2F.*%24%22))%0A++++filter(regex(str(%3Fsubj)%2C+%22doi.org%2F10%22))%0A++++filter(regex(str(%3Fobj)%2C+%22doi.org%2F10%22))%0A++%7D%0A%7D%0A++&amp;contentTypeConstruct=text%2Fturtle&amp;contentTypeSelect=application%2Fsparql-results%2Bjson&amp;endpoint=%2Frepo%2Ftype%2F2c1cce3f3152738c1009d59251409392aaaa3b0324bcb5fdfb4b7b944b8f0c18&amp;requestMethod=POST&amp;tabTitle=Query&amp;headers=%7B%7D&amp;outputFormat=table">this query</a>:</p>

<div class="language-sparql highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="k">prefix</span><span class="w"> </span><span class="nn">rdfs</span><span class="o">:</span><span class="w"> </span><span class="nn">&lt;http://www.w3.org/2000/01/rdf-schema#&gt;</span><span class="w">
</span><span class="k">prefix</span><span class="w"> </span><span class="nn">np</span><span class="o">:</span><span class="w"> </span><span class="nn">&lt;http://www.nanopub.org/nschema#&gt;</span><span class="w">
</span><span class="k">prefix</span><span class="w"> </span><span class="nn">npa</span><span class="o">:</span><span class="w"> </span><span class="nn">&lt;http://purl.org/nanopub/admin/&gt;</span><span class="w">
</span><span class="k">prefix</span><span class="w"> </span><span class="nn">npx</span><span class="o">:</span><span class="w"> </span><span class="nn">&lt;http://purl.org/nanopub/x/&gt;</span><span class="w">
</span><span class="k">prefix</span><span class="w"> </span><span class="nn">xsd</span><span class="o">:</span><span class="w"> </span><span class="nn">&lt;http://www.w3.org/2001/XMLSchema#&gt;</span><span class="w">
</span><span class="k">prefix</span><span class="w"> </span><span class="nn">dct</span><span class="o">:</span><span class="w"> </span><span class="nn">&lt;http://purl.org/dc/terms/&gt;</span><span class="w">

</span><span class="k">select</span><span class="w"> </span><span class="nv">?np</span><span class="w"> </span><span class="nv">?label</span><span class="w"> </span><span class="nv">?subj</span><span class="w"> </span><span class="nv">?citationrel</span><span class="w"> </span><span class="nv">?obj</span><span class="w"> </span><span class="nv">?date</span><span class="w"> </span><span class="k">where</span><span class="w"> </span><span class="p">{</span><span class="w">
  </span><span class="k">graph</span><span class="w"> </span><span class="nn">npa</span><span class="o">:</span><span class="ss">graph</span><span class="w"> </span><span class="p">{</span><span class="w">
    </span><span class="nv">?np</span><span class="w"> </span><span class="nn">npa</span><span class="o">:</span><span class="ss">hasValidSignatureForPublicKey</span><span class="w"> </span><span class="nv">?pubkey</span><span class="w"> </span><span class="p">.</span><span class="w">
    </span><span class="nv">?np</span><span class="w"> </span><span class="nn">dct</span><span class="o">:</span><span class="ss">created</span><span class="w"> </span><span class="nv">?date</span><span class="w"> </span><span class="p">.</span><span class="w">
    </span><span class="nv">?np</span><span class="w"> </span><span class="nn">np</span><span class="o">:</span><span class="ss">hasAssertion</span><span class="w"> </span><span class="nv">?assertion</span><span class="w"> </span><span class="p">.</span><span class="w">
    </span><span class="k">optional</span><span class="w"> </span><span class="p">{</span><span class="w"> </span><span class="nv">?np</span><span class="w"> </span><span class="nn">rdfs</span><span class="o">:</span><span class="ss">label</span><span class="w"> </span><span class="nv">?label</span><span class="w"> </span><span class="p">.</span><span class="w"> </span><span class="p">}</span><span class="w">
    </span><span class="k">filter</span><span class="w"> </span><span class="k">not</span><span class="w"> </span><span class="k">exists</span><span class="w"> </span><span class="p">{</span><span class="w"> </span><span class="nv">?npx</span><span class="w"> </span><span class="nn">npx</span><span class="o">:</span><span class="ss">invalidates</span><span class="w"> </span><span class="nv">?np</span><span class="w"> </span><span class="p">;</span><span class="w"> </span><span class="nn">npa</span><span class="o">:</span><span class="ss">hasValidSignatureForPublicKey</span><span class="w"> </span><span class="nv">?pubkey</span><span class="w"> </span><span class="p">.</span><span class="w"> </span><span class="p">}</span><span class="w">
    </span><span class="k">filter</span><span class="w"> </span><span class="k">not</span><span class="w"> </span><span class="k">exists</span><span class="w"> </span><span class="p">{</span><span class="w"> </span><span class="nv">?np</span><span class="w"> </span><span class="nn">npx</span><span class="o">:</span><span class="ss">hasNanopubType</span><span class="w"> </span><span class="nn">npx</span><span class="o">:</span><span class="ss">ExampleNanopub</span><span class="w"> </span><span class="p">.</span><span class="w"> </span><span class="p">}</span><span class="w">
  </span><span class="p">}</span><span class="w">
  </span><span class="k">graph</span><span class="w"> </span><span class="nv">?assertion</span><span class="w"> </span><span class="p">{</span><span class="w">
    </span><span class="nv">?subj</span><span class="w"> </span><span class="nv">?citationrel</span><span class="w"> </span><span class="nv">?obj</span><span class="w"> </span><span class="p">.</span><span class="w">
    </span><span class="k">filter</span><span class="p">(</span><span class="nb">regex</span><span class="p">(</span><span class="nb">str</span><span class="p">(</span><span class="nv">?citationrel</span><span class="p">),</span><span class="w"> </span><span class="s2">"^http://purl.org/spar/cito/.*$"</span><span class="p">))</span><span class="w">
    </span><span class="k">filter</span><span class="p">(</span><span class="nb">regex</span><span class="p">(</span><span class="nb">str</span><span class="p">(</span><span class="nv">?subj</span><span class="p">),</span><span class="w"> </span><span class="s2">"doi.org/10"</span><span class="p">))</span><span class="w">
    </span><span class="k">filter</span><span class="p">(</span><span class="nb">regex</span><span class="p">(</span><span class="nb">str</span><span class="p">(</span><span class="nv">?obj</span><span class="p">),</span><span class="w"> </span><span class="s2">"doi.org/10"</span><span class="p">))</span><span class="w">
  </span><span class="p">}</span><span class="w">
</span><span class="p">}</span><span class="w">
</span></code></pre></div></div>

<p>This includes 6 citation intentions defined by 4 nanopublications added during the Open Science Retreat:</p>

<ul>
  <li><a href="https://w3id.org/np/RAUjZE1JMu1GAvUQ_fZ4yc9-7sOSCT9xbeS0wYznkKtYk">RAUjZE1JMu</a> by <a href="https://nanodash.knowledgepixels.com/explore?id=https%3A%2F%2Forcid.org%2F0000-0002-7192-1486">me</a> for a paper by Marija Purgar</li>
  <li><a href="https://nanodash.knowledgepixels.com/explore?id=RAXgI--5gcKskgrnOI1XZoA4b3hu9RbNj3bcc2Zxeos7c">RAXgI–5gc</a> by <a href="https://nanodash.knowledgepixels.com/explore?id=https%3A%2F%2Forcid.org%2F0000-0003-2408-7588">Christian Meesters</a></li>
  <li><a href="https://nanodash.knowledgepixels.com/explore?id=RATZNhd3l_jN0y8GEi8mLIqy-uVV8tiUZIq2RJtkq6G8A">RATZNhd3l_j</a> by <a href="https://nanodash.knowledgepixels.com/explore?id=https%3A%2F%2Forcid.org%2F0000-0003-4285-690X">Taichi Oichi</a></li>
  <li><a href="https://nanodash.knowledgepixels.com/explore?id=RA6Q6wxSYyWfA3XwpOBqSNFKgQpM7ZgdVBoU2kSD-CFjw">RA6Q6wxSYy</a> by <a href="https://nanodash.knowledgepixels.com/explore?id=https%3A%2F%2Forcid.org%2F0000-0003-1559-1838">Niklas Hohmann</a></li>
</ul>

<h1 id="from-nanopublications-to-wikidata">From nanopublications to Wikidata</h1>

<p>Now, this query also provides me with enough information to propagate the citation intent (a fact?) to Wikidata
and cite the original nanopublication as reference. With a variation of the above SPARQL query, I can get the
five most recent new nanopublications, convert them to QuickStatements, and then enjoy them in Wikidata. This
is written up in <a href="https://github.com/egonw/ons-wikidata/blob/main/Nanopubs/createQS.groovy">this Bacting script</a>.</p>

<p>The script needs to handle some situations. For example, it will not add items for DOIs not already in Wikidata.
So, if neither of the two DOIs are known in Wikidata, then nothing gets added. If they both are, then it will
add the citation intent. There are alternative solutions, but in practice that doesn’t matter and the QuickStatements
is in all situations the same, and QuickStatements will only add the new information.</p>

<p>This is what it will <a href="https://www.wikidata.org/wiki/Q113312162#P2860">look like in Wikidata</a>:</p>

<p><img src="/assets/images/citoPub3.png" alt="" /></p>

<p>And this is <a href="https://qlever.scholia.wiki/cito/#articles">what it looks <i class="fa-solid fa-recycle fa-xs"></i></a> (yellow) when we compare the contributions
from nanopublications now with the other sources:</p>

<p><img src="/assets/images/citoPubs4.png" alt="" /></p>]]></content><author><name>Egon Willighagen</name></author><category term="osr24nl" /><category term="openscience" /><category term="cito" /><category term="nanopub" /><category term="justdoi:10.3233/ISU-2010-0613" /><category term="doi:10.1109/ESCIENCE.2018.00024" /><category term="doi:10.1186/S13321-023-00683-2" /><category term="wikidata" /><summary type="html"><![CDATA[During the Open Science Retreat I organized a short session where we looking into typing citation intentions using a new nanopublication template. First, let’s describe nanopublications (originally used in doi:10.3233/ISU-2010-0613) a bit. Scholia gives a nice overview of (macro?)publications on the topic . The nanopub.net website describes that [a nanopublication is a small knowledge graph snippet with metadata that is treated as an independent (scientific) publication.]. The knowledge graph, it continues, can be anything from an opinion to the link between a disease and a gene (doi:10.1109/ESCIENCE.2018.00024).]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/citoPub.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/citoPub.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Open Science Retreat #1: impressions</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/03/31/open-science-retreat-1.html" rel="alternate" type="text/html" title="Open Science Retreat #1: impressions" /><published>2024-03-31T00:00:00+00:00</published><updated>2024-03-31T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/03/31/open-science-retreat-1</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/03/31/open-science-retreat-1.html"><![CDATA[<p>Last week I attended the <a href="https://openscienceretreat.eu/">Open Science Retreat</a> (<a href="https://hashtags-hub.toolforge.org/osr24nl">#osr24nl</a>)
in a quite and relaxing region in North-Holland. The meeting was how I like all meetings to be (and I count myself lucky many of my meetings
are like this): open, welcoming, constructive, diverse, and intellectually challenging. Not all scientific meetings are like this
and it is easy to end up going to obligatory meetings where the discussions are of a different level. Therefore, great thanks to
the organizers, but also to all participants, that showed not just to have a hearth for open science (getting pretty common),
but also a drive to advocate for open science. Finally, I like to thank the people that joined me in creating nanopublications for
CiTO annotations (will blog about that later), and <a href="https://twitter.com/marija_purgar/status/1773745895508451573">to Sadik and Marija</a>
with whom we worked on exploring using Wikibase for capturing knowledge about research waste in ecology (more about that later too).</p>]]></content><author><name>Egon Willighagen</name></author><category term="osr24nl" /><category term="openscience" /><category term="wikibase" /><category term="cito" /><category term="nanopub" /><summary type="html"><![CDATA[Last week I attended the Open Science Retreat (#osr24nl) in a quite and relaxing region in North-Holland. The meeting was how I like all meetings to be (and I count myself lucky many of my meetings are like this): open, welcoming, constructive, diverse, and intellectually challenging. Not all scientific meetings are like this and it is easy to end up going to obligatory meetings where the discussions are of a different level. Therefore, great thanks to the organizers, but also to all participants, that showed not just to have a hearth for open science (getting pretty common), but also a drive to advocate for open science. Finally, I like to thank the people that joined me in creating nanopublications for CiTO annotations (will blog about that later), and to Sadik and Marija with whom we worked on exploring using Wikibase for capturing knowledge about research waste in ecology (more about that later too).]]></summary></entry><entry><title type="html">CiTO updates #4: annotations in datasets</title><link href="https://chem-bla-ics.linkedchemistry.info/2023/04/02/cito-updates-4-annotations-in-datasets.html" rel="alternate" type="text/html" title="CiTO updates #4: annotations in datasets" /><published>2023-04-02T00:00:00+00:00</published><updated>2023-04-02T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2023/04/02/cito-updates-4-annotations-in-datasets</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2023/04/02/cito-updates-4-annotations-in-datasets.html"><![CDATA[<p>Okay, <a href="https://jcheminf.biomedcentral.com/articles/10.1186/s13321-023-00683-2">the Pilot</a>
<a href="https://jcheminf.biomedcentral.com/articles/10.1186/s13321-023-00684-1">is over</a> ending with 17 papers, 16 of which have CiTO
annotations (and so far 4 J.Cheminform. <a href="https://doi.org/10.1186/s13321-022-00656-x">papers</a>
<a href="https://doi.org/10.1186/s13321-022-00673-w">after</a> <a href="https://doi.org/10.1186/s13321-022-00677-6">the</a>
<a href="https://doi.org/10.1186/s13321-023-00701-3">pilot</a>), but my interest in the
<a href="http://purl.org/spar/cito">Citation Typing Ontology</a> continues and we just need
<a href="https://chem-bla-ics.blogspot.com/2023/02/citation-typing-progress-but-we-need.html">more adoption</a>.</p>

<p><strong>Datasets as source of annotations</strong></p>

<p>So, here’s a quick <a href="https://wikidata.org/">Wikidata</a> update. I have been using Wikidata as infrastructure to collect and share CiTO
annotations (see also the below “Scholia patch” posts). Some time ago I recovered my CiteULike CiTO annotations and made this
<a href="https://scholia.toolforge.org/work/Q115470140">available on Zenodo</a> (doi:<a href="https://doi.org/10.5281/ZENODO.7368209">10.5281/zenodo.7368209</a>).</p>

<p>And while thinking about datasets with CiTO annotations, I found two other datasets. One was from an article in Portuguese and one from an
<a href="https://scholia.toolforge.org/work/Q117369886">article by Peroni et al.</a> with
<a href="https://zenodo.org/record/6885109">this data file</a>. That data file is actually a zip, but inside the zip file is a CSV file with three
interesting columns: <code class="language-plaintext highlighter-rouge">cited_doi</code>, <code class="language-plaintext highlighter-rouge">citing_doi</code>, and <code class="language-plaintext highlighter-rouge">intext_citation.intent</code>. There are many more columns and I can highly recommend browsing
them. But these are the three I need to add data to Wikidata. The third column is free text, but using the CiTO for labels, making it
relatively easy to convert to <a href="https://w.wiki/62sR">citation intentions from Wikidata</a>
(PS, thanks to <a href="https://www.wikidata.org/wiki/User:Fvtvr3r">Fvtvr3r</a> for adding more!).</p>

<p>So, I had a cleaned file and started writing a Groovy Bioclipse script using <a href="https://doi.org/10.21105/joss.02558">Bacting</a>.
It basically does a few things: extract all DOIs, check which ones are in Wikidata, analyze the <code class="language-plaintext highlighter-rouge">intext_citation.intent</code> column content,
and then generate QuickStatements (see <a href="https://gist.github.com/egonw/f74fd3bc1f6361434b042a4cac2a8089">this gist</a>). Out of the 600
lines from the input, it creates some 200 new CiTO-annotated citations in Wikidata between
<a href="https://scholia.toolforge.org/work/Q117357537#statements">some 150 article pairs</a>:</p>

<p><img src="/assets/images/Screenshot_20230402_084711.png" alt="" /></p>

<p>The ability to include CiTO annotations from datasets is another welcome boost for the CiTO statistics in Wikidata.
<a href="https://w.wiki/6XQf">This SPARQL query</a> shows an overview of sources that support the CiTO intention annotation, but note that
a claim with a CiTO intention may also have CrossRef, PubMed, and COCI as reference. In those cases, they are primarily for
the citations and not the intention.</p>

<p>There are <a href="https://scholar.social/@egonw/110124747053293502">now</a> (the <a href="https://scholia.toolforge.org/cito/#statistics">latest stats are here</a>)
<strong>1202 citation intention</strong> annotations in Wikidata for 992 citations from <strong>405 articles in 199 venues</strong>. Of these 27 articles have
explicit annotations in the article itself and are found in 4 venues, two journals and two preprint servers). These annotated citations
are to 510 articles in 190 different venues. <a href="https://github.com/WDscholia/scholia/pull/2271">This Scholia patch</a> will add a new
statistics, the number of datasets providing citation intentions, of which there are (as discussed)
<a href="https://scholia.toolforge.org/topic/Q115470140">currently</a> <a href="https://scholia.toolforge.org/work/Q117357537">two</a> in Wikidata.
The latter two provide intentions for the majority of articles and are depicted in yellow in the below overview.</p>

<p><img src="/assets/images/Screenshot_20230402_085317.png" alt="" /></p>

<p>With an annotation in <a href="https://www.wikidata.org/wiki/Q27638524">an 1938 article by Alan Turing</a>! I ran into this article in November 2011
noting an apparent duplicate title in his article list. I turned out an earlier article had a correction with the same name.
I added <a href="https://www.wikidata.org/w/index.php?title=Q27638524&amp;diff=1527020358&amp;oldid=984628387&amp;diffmode=source">this clarification</a>:</p>

<p><img src="/assets/images/Screenshot_20230402_090600.png" alt="" /></p>

<p>This is very trivial citation intention data that publishers could provide as open data.</p>

<p>Okay, that will do for today. There are actually some really interesting things in the pipeline, but I will have to write about that later. I have some deadlines I should start looking at. Below is some extra reading.
Some more history</p>

<ul>
  <li>2021: <a href="https://chem-bla-ics.linkedchemistry.info/2021/11/15/biohackathon-europe-2021-1-cito.html">BioHackathon Europe 2021 #1: CiTO annotations in BioHackrXiv <i class="fa-solid fa-recycle fa-xs"></i></a></li>
  <li>2021: <a href="https://chem-bla-ics.blogspot.com/2021/03/markdown-template-for-journal-of.html">Markdown template for the Journal of Cheminformatics with CiTO support</a></li>
  <li>2020: <a href="https://chem-bla-ics.linkedchemistry.info/2020/11/30/cito-updates-3-third-paper-in.html">CiTO updates #3: third paper in the collection and updated Scholia patch <i class="fa-solid fa-recycle fa-xs"></i></a></li>
  <li>2020: <a href="https://chem-bla-ics.linkedchemistry.info/2020/11/01/cito-updates-2-annotation-migration-to.html">CiTO updates #2: annotation migration to Wikidata and first Scholia patch <i class="fa-solid fa-recycle fa-xs"></i></a></li>
  <li>2020: <a href="https://chem-bla-ics.linkedchemistry.info/2020/11/01/cito-updates-1-first-research-paper-in.html">CiTO updates #1: first research paper in the Journal of Cheminformatics with CiTO annotation published <i class="fa-solid fa-recycle fa-xs"></i></a></li>
  <li>July 2020: <a href="https://chem-bla-ics.blogspot.com/2020/07/new-editorial-adoption-of-citation.html">New Editorial: “Adoption of the Citation Typing Ontology by the Journal of Cheminformatics”</a></li>
  <li>2015: <a href="https://chem-bla-ics.blogspot.com/2015/03/what-youre-doing-is-rather-desperate.html">“What You’re Doing Is Rather Desperate”</a></li>
  <li>2012: <a href="https://chem-bla-ics.linkedchemistry.info/2012/02/23/cito-citeulike-publishing-innovation.html">CiTO / CiteULike: publishing innovation <i class="fa-solid fa-recycle fa-xs"></i></a></li>
  <li>2010: <a href="https://chem-bla-ics.linkedchemistry.info/2010/10/31/citeulike-cito-use-case-1-wordles.html">CiteULike CiTO Use Case #1: Wordles <i class="fa-solid fa-recycle fa-xs"></i></a></li>
  <li>September 2010: <a href="https://chem-bla-ics.linkedchemistry.info/2010/09/17/list-of-things-i-miss-in-citeulike.html">A list of things I miss in CiteULike <i class="fa-solid fa-recycle fa-xs"></i></a></li>
</ul>]]></content><author><name>Egon Willighagen</name></author><category term="cito" /><category term="data" /><category term="scholia" /><category term="doi:10.1186/s13321-023-00683-2" /><category term="justdoi:10.1186/s13321-023-00684-1" /><category term="justdoi:10.1186/s13321-022-00656-x" /><category term="justdoi:10.1186/s13321-022-00673-w" /><category term="justdoi:10.1186/s13321-022-00677-6" /><category term="doi:10.1186/s13321-023-00701-3" /><category term="justdoi:10.1162/QSS_A_00222" /><category term="justdoi:10.5281/zenodo.5155219" /><category term="doi:10.21105/joss.02558" /><category term="doi:10.5281/ZENODO.7368209" /><summary type="html"><![CDATA[Okay, the Pilot is over ending with 17 papers, 16 of which have CiTO annotations (and so far 4 J.Cheminform. papers after the pilot), but my interest in the Citation Typing Ontology continues and we just need more adoption.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/Screenshot_20230402_085317.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/Screenshot_20230402_085317.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">BioHackathon Europe 2021 #1: CiTO annotations in BioHackrXiv</title><link href="https://chem-bla-ics.linkedchemistry.info/2021/11/15/biohackathon-europe-2021-1-cito.html" rel="alternate" type="text/html" title="BioHackathon Europe 2021 #1: CiTO annotations in BioHackrXiv" /><published>2021-11-15T00:00:00+00:00</published><updated>2021-11-15T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2021/11/15/biohackathon-europe-2021-1-cito</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2021/11/15/biohackathon-europe-2021-1-cito.html"><![CDATA[<p>Serendipity. I did not plan this hack at the <a href="https://biohackathon-europe.org/">BioHackathon Europe 2021</a> but it happened anyway.
Based on earlier work in the <a href="https://www.biomedcentral.com/collections/cito">Journal of Cheminformatics</a>, extending on the
<a href="https://doi.org/10.7717/peerj-cs.112">work by Krewinkel et al.</a> I looked into the idea of using the Lua filter for
<a href="https://biohackrxiv.org/">BioHackrXiv</a>, a preprint server for BioHackathons. Actually, I started by looking at the
Citation Styling Language file used by the BioHackrXiv tools. But that was just wrong.</p>

<p>Long story short: <a href="https://github.com/biohackrxiv/bhxiv-gen-pdf/pull/10">it worked</a>! Thanks to the encouragements from
<a href="https://github.com/pjotrp">Pjotr</a> and <a href="https://github.com/inutano">Tazro</a> and suggestions from
<a href="https://twitter.com/larswillighagen/status/1458059589925187585">Lars</a> and some code on how to
<a href="http://lua-users.org/wiki/TableUtils">dump a Lua data structure to stdout</a>.</p>

<p>In the Markdown/BibTeX combination you would normally write <code class="language-plaintext highlighter-rouge">[@bibtexkey]</code> to add the reference to the article with the given key
in the <code class="language-plaintext highlighter-rouge">.bib</code> file. To type the citation (to state the intention why you cite that source), for example because you use a method
in it, you write <code class="language-plaintext highlighter-rouge">[@usesMethodIn:bibtexkey]</code>. This is different from
<a href="https://github.com/jcheminform/markdown-jcheminf">how it currently works for the Journal of Cheminformatics</a>,
where the intention cannot be given at citation level yet. You can even use more than one intention, e.g. <code class="language-plaintext highlighter-rouge">[@usesMethodIn:extends:bibtexkey]</code>.</p>

<p>If you want to try it, just create a compatible Markdown file with BibTeX file in a new GitHub repository, and post the repository URL on
this <a href="http://preview.biohackrxiv.org/">cool preview website</a>.</p>

<p>Here’s what the created PDF could look like:</p>

<p><img src="/assets/images/citoBioHackrXiv.png" alt="" /></p>]]></content><author><name>Egon Willighagen</name></author><category term="cito" /><category term="biohackrxiv" /><category term="markdown" /><category term="pandoc" /><category term="biohackeu12" /><category term="justdoi:10.7717/peerj-cs.112" /><summary type="html"><![CDATA[Serendipity. I did not plan this hack at the BioHackathon Europe 2021 but it happened anyway. Based on earlier work in the Journal of Cheminformatics, extending on the work by Krewinkel et al. I looked into the idea of using the Lua filter for BioHackrXiv, a preprint server for BioHackathons. Actually, I started by looking at the Citation Styling Language file used by the BioHackrXiv tools. But that was just wrong.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/citoBioHackrXiv.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/citoBioHackrXiv.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry></feed>