<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator><link href="https://chem-bla-ics.linkedchemistry.info/feed/by_tag/fair.xml" rel="self" type="application/atom+xml" /><link href="https://chem-bla-ics.linkedchemistry.info/" rel="alternate" type="text/html" /><updated>2026-04-19T09:50:36+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/feed/by_tag/fair.xml</id><title type="html">chem-bla-ics</title><subtitle>Chemblaics (pronounced chem-bla-ics) is the science that uses open science and computers to solve problems in chemistry, biochemistry and related fields.</subtitle><author><name>Egon Willighagen</name></author><entry><title type="html">The TDCC NES Col-Lab Retreat</title><link href="https://chem-bla-ics.linkedchemistry.info/2026/02/21/the-tdcc-nes-col-lab-retreat.html" rel="alternate" type="text/html" title="The TDCC NES Col-Lab Retreat" /><published>2026-02-21T00:00:00+00:00</published><updated>2026-02-21T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2026/02/21/the-tdcc-nes-col-lab-retreat</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2026/02/21/the-tdcc-nes-col-lab-retreat.html"><![CDATA[<p>Last autumn two TDCC projects started, <em>FAIR4ChemNL</em> (<a href="https://chem-bla-ics.linkedchemistry.info/2026/02/08/open-infrastructures.html">with the PeerTube channel</a>
and doi:<a href="https://doi.org/10.61686/XVYQV45374">10.61686/XVYQV45374</a>) and <em>FAIRify for metabolomics data</em>
(doi:<a href="https://doi.org/10.61686/CSGIP04334">10.61686/CSGIP04334</a>). But I haven’t written much on either yet and what the role is our research group in these projects.</p>

<p>Let’s start with what the TDCC actually are: they are <a href="https://tdcc.nl/">Thematic Digital Competence Centres</a>:</p>

<blockquote>
  <p>The Thematic Digital Competence Centres (TDCCs) are network-based initiatives set up by NWO and the Dutch academic
community to broker investments into research data management projects. The three TDCCs are national and discipline
based, with one pillar each for the Social Sciences &amp; Humanities (SSH), Natural and Engineering Sciences (NES) and
Life Sciences &amp; Health (LSH). The networks will help formulate and facilitate projects designed to promote the adoption
of open data, software and research practices, alongside the development of the necessary expertise.</p>
</blockquote>

<p>So, where initiatives like <a href="https://www.go-fair.org/">GO FAIR</a> had centers of competencies (the implementation networks),
they did not have funding for them. This was a main reason why the <em>Chemistry Implementation Network</em> (ChIN,
doi:<a href="https://doi.org/10.1162/dint_a_00035">10.1162/dint_a_00035</a>) did not take off.
The TDCCs do not provide a lot of money, but enough to support disseminating expertise and promote some key ideas.</p>

<p>The idea is that combined with other efforts, it strengthens the level of FAIR in the Dutch research community.
I have to say, this is much needed, as the level of FAIR data in journal publications is so much to wish for,
and still mostly absent.</p>

<p>The FAIR4ChemNL project already had a networking activity during the writing of the proposal, the workshop already
back in 2024 that I <a href="https://chem-bla-ics.linkedchemistry.info/2024/06/10/two-meetings.html">blogged about earlier</a>
(see also <a href="https://doi.org/10.5281/zenodo.15050550">this report</a>).
The FAIRify project is coordinated by the group that was key in the <em>Netherlands Metabolomics Center</em> (NMC), now the
<a href="https://metabolomicscentre.nl/">BeneLux Metabolomics Center</a>. During a postdoc at the NMC during my Wageningen
days, we already did a lot of FAIR competency building with <a href="https://chem-bla-ics.linkedchemistry.info/tag/metware">the MetWare project</a>.</p>

<h2 id="the-col-lab-retreat">The Col-Lab Retreat</h2>

<p>The <a href="https://tdcc.nl/about-tddc/nes/">TDCC-NES</a> organized a networking event in August last year,
the 2025 <a href="https://nescollab.nl/">TDCC-NES Col-Lab Retreat</a>. I am late with
reporting on it, but there simply was too much project management that took priority. The meeting was in the
wonderful Dutch town Schoorl, and the location is great for collaborative meetings. I had been there a year
earlier for an Open Science Retreat and was happy to go back.</p>

<p>During the unconference-style meeting <a href="https://tdcc.nl/creating-space-for-our-community-the-story-of-our-nes-col-lab-retreat/">various topics were discussed</a>
in breakout groups, and because of the two TDCC projects, I was particularly interested in the <em>Metadata and interoperability</em>
topic. Partly because this is how we can make eletronic lab notebooks automatically push metadata to
registries (and <a href="https://www.linkedin.com/in/rory-macneil-68a80011/">Rory Macneil</a> was also in Schoorl,
of <a href="https://www.researchspace.com/">RSpace/ResearchSpace</a> which already integrated with various open
platforms), and partly because I wanted to continue explore <a href="https://chem-bla-ics.linkedchemistry.info/tag/nanopub">nanopublications</a>
with <a href="https://fediscience.org/@rupdecat">Christian Meesters</a>, which could be the envelope to distribute
the metadata. For the last, I was looking at the Java library for nanopublications
(see <a href="https://github.com/Nanopublication/nanopub-java/pull/52">this PR</a>.</p>

<p>The idea that ELNs automatically share metadata about experiments is something that is attractive.
It would require no involvement from the researcher, would be fully automatic, and drive interest
(users, peer reviewers) to experiments and experimental data. Something that is still absurdly hard
is to do a search for experiments that measured the melting point of some chemical. How
awesome would it be if ELNs would automatically register chemicals from the experiment in,
for example, <a href="https://pubchem.ncbi.nlm.nih.gov/">PubChem</a>.</p>

<p>We had the idea of applying for a Lorentz Workshop, but the earliest deadline was too early, but
maybe it is time to pick up that idea again. Interoperability standards already exist, like
the aforementioned nanopubs, but also <a href="https://www.researchobject.org/ro-crate/">RO-Crates</a> that are also studied by Jente Houweling
in the VHP4Safety project (see <a href="https://platform.vhp4safety.nl/data">this Data tab</a> for a preview).</p>]]></content><author><name>Egon Willighagen</name></author><category term="fair" /><category term="doi:10.1162/DINT_A_00035" /><category term="chemistry" /><category term="metabolomics" /><category term="fair4chemnl" /><category term="fairify" /><category term="cito:citesAsEvidence:10.5281/ZENODO.15050550" /><category term="nanopub" /><category term="crate" /><category term="pubchem" /><summary type="html"><![CDATA[Last autumn two TDCC projects started, FAIR4ChemNL (with the PeerTube channel and doi:10.61686/XVYQV45374) and FAIRify for metabolomics data (doi:10.61686/CSGIP04334). But I haven’t written much on either yet and what the role is our research group in these projects.]]></summary></entry><entry><title type="html">PFAS in the blood of the Dutch population</title><link href="https://chem-bla-ics.linkedchemistry.info/2025/07/06/pfas-in-the-blood-of-the-dutch-population.html" rel="alternate" type="text/html" title="PFAS in the blood of the Dutch population" /><published>2025-07-06T00:00:00+00:00</published><updated>2025-07-06T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2025/07/06/pfas-in-the-blood-of-the-dutch-population</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2025/07/06/pfas-in-the-blood-of-the-dutch-population.html"><![CDATA[<p>A recent report by the Dutch <a href="https://www.rivm.nl/">RIVM</a>, <em>PFAS in the blood of the Dutch population</em>
(doi:<a href="https://www.rivm.nl/bibliotheek/rapporten/2025-0094.pdf">10.21945/RIVM-2025-0094</a>), writes
that seven <a href="https://scholia.toolforge.org/chemical-class/Q648037">PFAS</a> compounds are found in blood samples
of all tested people. Another nine compounds are found in at least 1-in-10 people.
Because there is relevant data in the report on the 28 studied PFAS compound, I wanted to
have the report more FAIR than it is on the website. Why this report? Well, the chemistry and the
history is fascinating and brutal (I like <a href="https://www.youtube.com/watch?v=SC2eSujzrUY">this Veritasium video</a>).</p>

<p>The history tells me that our society may sound woke and leftish, in reality it is a continous fight
for basic human rights. (Something that plenty have been saying for years.)
In this case, a healtht life is the human right.</p>

<p>So, what can I do to make this report more FAIR?</p>

<h2 id="findable-in-wikidata">Findable in Wikidata</h2>

<p>Since this report has been <a href="https://news.google.com/search?q=PFAS%20in%20the%20blood%20of%20the%20Dutch%20population&amp;hl=en-US&amp;gl=US&amp;ceid=US%3Aen">mentioned in the news</a>,
it clearly is notable. The simplest thing to do is thus to just add it <a href="https://www.wikidata.org/wiki/Wikidata:Main_Page">Wikidata</a>.
Because the DOI of the report had not been recorded yet, I could not let <a href="https://scholia.toolforge.org/">Scholia</a>
do it for me. But doing it manually is only a bit more work: <a href="https://www.wikidata.org/wiki/Q135222054">Q135222054</a>.
The provided metadata <a href="https://www.rivm.nl/en/news/first-nationwide-study-into-pfas-in-blood">on the RIVM website</a>
is minimal.</p>

<p>But we can do more. Particularly, because I want people to find this report when they look info knowledge
about the 28 studied chemicals, I added <a href="https://www.wikidata.org/wiki/Q135222054#P921">main subject</a> annotation
using the information in <em>Table 1</em> in the report. Using Scholia and the CAS registry number in the table,
I crosscheck the information in Wikidata is consistent with the report (and visa versa). It was.
I then added the Dutch name and acronym for most of them. Some already had the name as in the Table.
That gives us a nice “Topic scores” plot for <a href="https://scholia.toolforge.org/work/Q135222054">the Scholia page of the report</a>:</p>

<p><img src="/assets/images/pfas_report.png" alt="" /></p>

<p>The central PFAS bubble is also only one <em>main subject</em> but larger because many the specific PFAS compounds
are subclassing PFAS. And you may also note many smaller bubbles. These actually come from <em>main subject</em>
annotations of articles cited from the report. Because I added a few of them too. Not all, because many are
not in Wikidata (yet).</p>

<h2 id="findable-in-wikipathways">Findable in WikiPathways</h2>

<p>But since 16 of these compounds are readily found in human blood samples, that is handy knowledge when
doing metabolomics (on blood samples). Or (and I leave that to later blog post), we can map the experimental
data for Dordrecht versus the rest of The Netherlands to the PFAS compounds. That is relevant to research
by <a href="https://vhp4safety.nl">VHP4Safety</a>. There are many ways to see if you have PFAS in your dataset,
but since we have many controlled lists of genes in metabolites, I added one for common PFAS in human
blood samples. Well, the 16 common in Dutch blood samples:</p>

<p><img src="/assets/images/pfas_wikipathways.png" alt="" /></p>

<p>Each <em>metabolite</em> here is annotated with their Wikidata identifier, allowing us to map experimental
data on top of it. And we get links out to other databases almost for free:</p>

<p><img src="/assets/images/pfas_wikipathways_outlinks.png" alt="" /></p>

<p>And the link to Wikidata actually links to Scholia, so for the PFOA in the above example,
we can quickly see the boiling point, decomposition point, and melting point of this PFAS.
And literature with undoubtedly even more knowledge about this PFAS:</p>

<p><img src="/assets/images/pfas_scholia.png" alt="" /></p>

<p>Now, these two steps were mostly manual: drawing <a href="https://classic.wikipathways.org/index.php/Pathway:WP5579">WP5579</a>
in WikiPathways and adding the report annotations (<em>main subject</em> and <em>cites</em>) in Wikidata.</p>

<h2 id="findable-in-the-vhp4safety-compound-wiki">Findable in the VHP4Safety Compound Wiki</h2>

<p>As part of the VHP4Safety project, I am collecting information on chemicals studied in the context
of toxicology, safety, and risk assessment. Often specific collections of compounds studied as a whole.
This report is such a collection and provides experimental data on these compounds. So, I want this
report to be findable for the <a href="https://compoundcloud.wikibase.cloud/">VHP4Safety Compound Wiki</a> too.
Creating the collection is a manual step: <a href="https://compoundcloud.wikibase.cloud/wiki/Item:Q5145">Q5145</a>.</p>

<p>Now, because both Wikidata and our VHP4Safety Compound Wiki (a Wikibase instance) are semantic and support, I can use SPARQL
to create instructions to link the 28 compounds to the new collection. Now, arguably, that can be
done manually too, and maybe faster, for larger collections this is harder. So, I dug up
<a href="https://compoundcloud.wikibase.cloud/wiki/User:Egonw">my earlier notes</a> and got some useful
things together.</p>

<p>This query lists all 28 PFAS linked to the report <a href="https://w.wiki/Eepm">in Wikidata</a>:</p>

<div class="language-sparql highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="k">SELECT</span><span class="w"> </span><span class="nv">?pfas</span><span class="w"> </span><span class="nv">?pfasLabel</span><span class="w"> </span><span class="k">WHERE</span><span class="w"> </span><span class="p">{</span><span class="w">
  </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q135222054</span><span class="w"> </span><span class="nn">wdt</span><span class="o">:</span><span class="ss">P921</span><span class="w"> </span><span class="nv">?pfas</span><span class="w"> </span><span class="p">.</span><span class="w">
  </span><span class="nv">?pfas</span><span class="w"> </span><span class="nn">wdt</span><span class="o">:</span><span class="ss">P31</span><span class="w"> </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q113145171</span><span class="w"> </span><span class="p">.</span><span class="w">
  </span><span class="k">SERVICE</span><span class="w"> </span><span class="nn">wikibase</span><span class="o">:</span><span class="ss">label</span><span class="w"> </span><span class="p">{</span><span class="w"> </span><span class="nn">bd</span><span class="o">:</span><span class="ss">serviceParam</span><span class="w"> </span><span class="nn">wikibase</span><span class="o">:</span><span class="ss">language</span><span class="w"> </span><span class="s2">"[AUTO_LANGUAGE],mul,en"</span><span class="p">.</span><span class="w"> </span><span class="p">}</span><span class="w">
</span><span class="p">}</span><span class="w">
</span></code></pre></div></div>

<p>Using federation powers, I can use this for <a href="https://edu.nl/ar9wf to match these up with our Wikibase">a SPARQL query</a>,
and return the results in QuickStatements that say <em>this VHP compound is part of the VHP collection</em>:</p>

<div class="language-sparql highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="k">PREFIX</span><span class="w"> </span><span class="nn">wb</span><span class="o">:</span><span class="w"> </span><span class="nn">&lt;https://compoundcloud.wikibase.cloud/entity/&gt;</span><span class="w">
</span><span class="k">PREFIX</span><span class="w"> </span><span class="nn">wbt</span><span class="o">:</span><span class="w"> </span><span class="nn">&lt;https://compoundcloud.wikibase.cloud/prop/direct/&gt;</span><span class="w">

</span><span class="k">SELECT</span><span class="w"> </span><span class="p">(</span><span class="nb">SUBSTR</span><span class="p">(</span><span class="nb">STR</span><span class="p">(</span><span class="nv">?cmp</span><span class="p">),</span><span class="mi">45</span><span class="p">)</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="nv">?qid</span><span class="p">)</span><span class="w"> </span><span class="nv">?P21</span><span class="w"> </span><span class="k">WHERE</span><span class="w"> </span><span class="p">{</span><span class="w">
  </span><span class="nv">?cmp</span><span class="w"> </span><span class="nn">wbt</span><span class="o">:</span><span class="ss">P5</span><span class="w"> </span><span class="nv">?wikidata</span><span class="w"> </span><span class="p">.</span><span class="w">
  </span><span class="k">SERVICE</span><span class="w"> </span><span class="nn">&lt;https://query.wikidata.org/sparql&gt;</span><span class="w"> </span><span class="p">{</span><span class="w">
    </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q135222054</span><span class="w"> </span><span class="nn">wdt</span><span class="o">:</span><span class="ss">P921</span><span class="w"> </span><span class="nv">?pfas</span><span class="w"> </span><span class="p">.</span><span class="w">
    </span><span class="nv">?pfas</span><span class="w"> </span><span class="nn">wdt</span><span class="o">:</span><span class="ss">P31</span><span class="w"> </span><span class="nn">wd</span><span class="o">:</span><span class="ss">Q113145171</span><span class="w"> </span><span class="p">.</span><span class="w">
    </span><span class="k">BIND</span><span class="w"> </span><span class="p">(</span><span class="nb">substr</span><span class="p">(</span><span class="nb">str</span><span class="p">(</span><span class="nv">?pfas</span><span class="p">),</span><span class="mi">32</span><span class="p">)</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="nv">?wikidata</span><span class="p">)</span><span class="w">
  </span><span class="p">}</span><span class="w">
  </span><span class="k">BIND</span><span class="w"> </span><span class="p">(</span><span class="s2">"Q5145"</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="nv">?P21</span><span class="p">)</span><span class="w">
</span><span class="p">}</span><span class="w">
</span></code></pre></div></div>

<p>I actually had to add 5 PFAS compounds in the VHP4Safety Compound Wiki first. That follows the
<a href="https://chem-bla-ics.linkedchemistry.info/2016/03/20/adding-disclosures-to-wikidata-with.html">same procedure for how I have been adding chemical compounds to Wikidata</a>
(see also <a href="https://doi.org/10.26434/chemrxiv-2025-53n0w">this preprint</a>).
The input <code class="language-plaintext highlighter-rouge">cas.smi</code> has the (missing) SMILES, Wikidata QID, and English label:</p>

<div class="language-plaintext highlighter-rouge"><div class="highlight"><pre class="highlight"><code>C(CS(=O)(=O)O)C(C(C(C(C(C(F)(F)F)(F)F)(F)F)(F)F)(F)F)(F)F       Q27063662       6:2 Fluorotelomer sulfonate
CN(CC(=O)O)S(=O)(=O)C(C(C(C(C(C(F)(F)F)(F)F)(F)F)(F)F)(F)F)(F)F Q126605979      MeFHxSAA
CN(CC(=O)O)S(=O)(=O)C(C(C(C(F)(F)F)(F)F)(F)F)(F)F       Q126682412      MeFBSAA
C(=O)(C(C(F)(F)F)(F)OC(C(C(F)(F)F)(F)F)(F)F)O[H]        Q29387971       2,3,3,3-tetrafluoro-2-(heptafluoropropoxy)propanoic acid
C(C(C(=O)O)(F)F)(OC(C(C(OC(F)(F)F)(F)F)(F)F)(F)F)F      Q81981675       4,8-Dioxa-3H-perfluorononanoic acid
</code></pre></div></div>

<p>For reference, this is the command line I used to create QuickStatement instructions:</p>

<div class="language-shell highlighter-rouge"><div class="highlight"><pre class="highlight"><code>groovy createWDitemsFromSMILES.groovy <span class="nt">-w</span> compoundcloud.wikibase.cloud <span class="nt">-c</span> Q2368 <span class="nt">-d</span> P5 <span class="nt">-l</span> <span class="nt">-i</span> wikidata <span class="nt">-a</span> P11
</code></pre></div></div>

<h2 id="final-remark">Final remark</h2>

<p>Are these 16 the only PFAS in our body? With 28 studied out of <a href="https://doi.org/10.1021/acs.est.3c04855">a potential seven million</a>,
I doubt it.</p>]]></content><author><name>Egon Willighagen</name></author><category term="pfas" /><category term="chemistry" /><category term="fair" /><category term="scholia" /><category term="wikidata" /><category term="vhp4safety" /><category term="doi:10.26434/CHEMRXIV-2025-53N0W" /><category term="cito:citesAsRecommendedReading:10.1021/acs.est.3c04855" /><summary type="html"><![CDATA[A recent report by the Dutch RIVM, PFAS in the blood of the Dutch population (doi:10.21945/RIVM-2025-0094), writes that seven PFAS compounds are found in blood samples of all tested people. Another nine compounds are found in at least 1-in-10 people. Because there is relevant data in the report on the 28 studied PFAS compound, I wanted to have the report more FAIR than it is on the website. Why this report? Well, the chemistry and the history is fascinating and brutal (I like this Veritasium video).]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/pfas_report.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/pfas_report.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">New paper: FAIR assessment of nanosafety data reusability with community standards</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/06/10/linking-fair-to-reuse.html" rel="alternate" type="text/html" title="New paper: FAIR assessment of nanosafety data reusability with community standards" /><published>2024-06-10T00:00:00+00:00</published><updated>2024-06-10T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/06/10/linking-fair-to-reuse</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/06/10/linking-fair-to-reuse.html"><![CDATA[<p><a href="FAIR assessment of nanosafety data reusability with community standards">Ammar</a> is finishing up his PhD thesis with his
research on the use of FAIR towards predictive toxicology. Or, “AI ready”, as the term FAIR is now sometimes explained.
Any computational method needs good data, and just FAIR is not enough. It needs to meet community standards, as formalized
in R1.3. To me, this includes meeting community standards like minimal reporting standards. Indeed, in the
<a href="https://www.nanosafetycluster.eu/">EU NanoSafety Cluster</a> the notion that FAIR data also needs be scientifically
good data is well noted.</p>

<p>In this paper (doi:<a href="https://doi.org/10.1038/s41597-024-03324-x">10.1038/s41597-024-03324-x</a>),
Ammar explores this notion and compiled more than 200 maturity indicators in the category R1.3
resulting from 12 different community standards. For example, this includes minimal reporting standards. There
is overlap in needs, but they often also have a different focus. The conclusion here: different (re)use cases
have different needs, and data not usable to one use case can be sufficiently FAIR for another. Of course, ideally,
it would be FAIR enough for all use cases.</p>

<p>Ammar formalizes the maturity indicators and links the comming maturity indicators to various use cases.
That means that when you determine the indicator values for your data, people can immediately lookup how
this data can be reused. And, the generator of the data can immediately see how the data would need to be
improved to widen the reusability. How FAIR can we get?</p>

<p>His proposal has already been further explored in two other papers, one around data sharing
(doi:<a href="https://doi.org/10.1038/s41596-024-00993-1">10.1038/s41596-024-00993-1</a>, see also
<a href="https://doi.org/10.59350/vfvwq-s0v13">this blog post</a>) and one around QSAR modelling
(doi:<a href="https://doi.org/10.1016/j.impact.2023.100475">10.1016/j.impact.2023.100475</a>,
see also <a href="https://doi.org/10.59350/7zf38-w9670">this blog post</a>).</p>

<p>The below screenshot shows what an analysis using this approach can look like:</p>

<p><img src="/assets/images/41597_2024_3324_Fig3_HTML.png" alt="" /></p>]]></content><author><name>Egon Willighagen</name></author><category term="fair" /><category term="toxicology" /><category term="doi:10.1038/S41597-024-03324-X" /><category term="doi:10.1016/J.IMPACT.2023.100475" /><category term="qsar" /><category term="doi:10.1038/S41596-024-00993-1" /><summary type="html"><![CDATA[Ammar is finishing up his PhD thesis with his research on the use of FAIR towards predictive toxicology. Or, “AI ready”, as the term FAIR is now sometimes explained. Any computational method needs good data, and just FAIR is not enough. It needs to meet community standards, as formalized in R1.3. To me, this includes meeting community standards like minimal reporting standards. Indeed, in the EU NanoSafety Cluster the notion that FAIR data also needs be scientifically good data is well noted.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/41597_2024_3324_Fig3_HTML.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/41597_2024_3324_Fig3_HTML.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Two meetings: ELIXIR Toxicology and FAIR4ChemNL</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/06/10/two-meetings.html" rel="alternate" type="text/html" title="Two meetings: ELIXIR Toxicology and FAIR4ChemNL" /><published>2024-06-10T00:00:00+00:00</published><updated>2024-06-10T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/06/10/two-meetings</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/06/10/two-meetings.html"><![CDATA[<p>Noting that in the coming week I am not attending the <a href="https://elixir-europe.org/events/elixir-all-hands-2024">ELIXIR All Hands in Uppsala</a>.
Having lived in (and around) Uppsala for more than three years, I am disappointed and with the first stories from colleagues coming
in even more. But it has been a way too busy year, I have much to finish up, and I need to take care of myself too. I am not 32 anymore.</p>

<p>But in the past two weeks I did attend two workshops. The first was a <a href="https://www.aanmelder.nl/intoxicom2024firstworkshop">workshop</a> by the
<a href="https://elixir-europe.org/communities/toxicology">ELIXIR Toxicology Community</a>, which was held in Utrecht/NL. The programme was around
FAIR and included two really nice hands-on sessions where we developed drafts for <a href="https://faircookbook.elixir-europe.org/">FAIR Cookbook</a>
recipes (see also doi:<a href="https://doi.org/10.1038/s41597-023-02166-3">10.1038/s41597-023-02166-3</a>) and for
<a href="https://www.go-fair.org/how-to-go-fair/fair-implementation-profile/">FAIR Implementation Profiles</a>
(doi:<a href="https://doi.org/10.1007/978-3-030-65847-2_13">10.1007/978-3-030-65847-2_13</a>). We will write up a
<a href="https://biohackrxiv.org/discover">BioHackrXiv</a> report.</p>

<p>The second workshop was last week, the <a href="https://tdcc.nl/evenementen/fair4chemnl-workshop/">FAIR4ChemNL workshop</a>, which was also held
in Utrecht/NL. The topic was FAIR in chemistry, and we discussed various aspects. There was a significant participant group from the
German NFDI4Cat project (“Cat” is short for (chemical) catalysis), which recently published a nice analysis of several ontologies
(doi:<a href="https://doi.org/10.1186/s13321-024-00807-2">10.1186/s13321-024-00807-2</a>). And there was also a lot of mention of RDF and SPARQL.</p>

<p>I think it is time for a new special issue around semantic web technologies.</p>]]></content><author><name>Egon Willighagen</name></author><category term="elixir" /><category term="fair" /><category term="chemistry" /><category term="doi:10.1038/S41597-023-02166-3" /><category term="justdoi:10.1007/978-3-030-65847-2_13" /><category term="justdoi:10.1186/S13321-024-00807-2" /><category term="rdf" /><category term="sparql" /><category term="fair4chemnl" /><summary type="html"><![CDATA[Noting that in the coming week I am not attending the ELIXIR All Hands in Uppsala. Having lived in (and around) Uppsala for more than three years, I am disappointed and with the first stories from colleagues coming in even more. But it has been a way too busy year, I have much to finish up, and I need to take care of myself too. I am not 32 anymore.]]></summary></entry><entry><title type="html">New paper: A template wizard for the cocreation of machine-readable data-reporting to harmonize the evaluation of (nano)materials</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/05/27/from-spreadsheets-to-rdf.html" rel="alternate" type="text/html" title="New paper: A template wizard for the cocreation of machine-readable data-reporting to harmonize the evaluation of (nano)materials" /><published>2024-05-27T00:00:00+00:00</published><updated>2024-05-27T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/05/27/from-spreadsheets-to-rdf</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/05/27/from-spreadsheets-to-rdf.html"><![CDATA[<p>I was about to call this blog post <em>From spreadsheets to RDF</em>, after <a href="https://chem-bla-ics.linkedchemistry.info/2024/05/20/from-papers-to-rdf.html">the post last week</a>.
But then I decided to just use the pattern I typically use. Why I wanted to use that shorter term in the first
place was that one of the thing I like about the <a href="https://sourceforge.net/projects/ambit/">AMBIT software</a>
(of OpenTox and eNanoMapper fame) is its
RDF support (see doi:<a href="https://doi.org/10.1186/1756-0500-4-487">10.1186/1756-0500-4-487</a>). But
<a href="https://chem-bla-ics.linkedchemistry.info/tag/rdf">RDF</a>, ontologies,
those are hard things. And unlike mathematics, we do not have simple objects like integer numbers or simple
operators. Well, I think we do, and we talk about them. But there is no obligatory education. Just like
any biologist needs to know what <em>1 + 2</em> means, I think any biologist needs basic knowledge about how
knowledge graphs work. But sometimes feels like a taboo, like cursing in the life sciences church.</p>

<p>So, there we are. This is where spreadsheets come in. If done well, they combine aspects of knowledge graphs
with usability and can even cover a good bit of the learnability. This is what is described in this new
paper about templates in the <a href="https://www.nanosafetycluster.eu/">EU NanoSafety Cluster</a>: <em>A template wizard
for the cocreation of machine-readable data-reporting to harmonize the evaluation of (nano)materials</em>
(doi:<a href="https://doi.org/10.1038/s41596-024-00993-1">10.1038/s41596-024-00993-1</a>).</p>

<p>The learnability comes in with the spreadsheet templates (“this is how we did it”) and a “wizard” around
it guides the user with the selection of a template but also can provide feedback on the template. The
technical term for that is “validator”, but it can be tought of as a spelling checker. Computers are good at
finding contradictions (the lack of a pattern), though less good at ranking the alternatives (which is
the cause of hallucinations in AI approaches).</p>

<p>And to return to the RDF, software like AMBIT can read these templates, use the semantics linked to the
template, and make the FAIR static spreadsheets (good for archiving on Zenodo!) available as FAIR interactive
data (good for exploration and machine learning), and as RDF (good for data integration).</p>

<p>Congrats to <a href="http://orcid.org/0000-0002-4322-6179">Nina</a> and the various EU NanoSafety Cluster projects!</p>]]></content><author><name>Egon Willighagen</name></author><category term="rdf" /><category term="opentox" /><category term="fair" /><category term="doi:10.1186/1756-0500-4-487" /><category term="doi:10.1038/S41596-024-00993-1" /><summary type="html"><![CDATA[I was about to call this blog post From spreadsheets to RDF, after the post last week. But then I decided to just use the pattern I typically use. Why I wanted to use that shorter term in the first place was that one of the thing I like about the AMBIT software (of OpenTox and eNanoMapper fame) is its RDF support (see doi:10.1186/1756-0500-4-487). But RDF, ontologies, those are hard things. And unlike mathematics, we do not have simple objects like integer numbers or simple operators. Well, I think we do, and we talk about them. But there is no obligatory education. Just like any biologist needs to know what 1 + 2 means, I think any biologist needs basic knowledge about how knowledge graphs work. But sometimes feels like a taboo, like cursing in the life sciences church.]]></summary></entry><entry><title type="html">New paper: From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/05/20/from-papers-to-rdf.html" rel="alternate" type="text/html" title="New paper: From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials" /><published>2024-05-20T00:00:00+00:00</published><updated>2024-05-20T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/05/20/from-papers-to-rdf</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/05/20/from-papers-to-rdf.html"><![CDATA[<p>Making something FAIR is hard, particularly when you do more than making something findable. We’ve seen before that
making something usefully findable <a href="https://chem-bla-ics.blogspot.com/2020/10/new-paper-semi-automated-workflow-for.html">requires deep indexing</a>,
and already that continues to be difficult, because we are not seeing it enough.
So, when I thought convert a <a href="https://chem-bla-ics.blogspot.com/2021/05/new-strategy-towards-generation-of.html">paper led by Hoet’s lab in Leuven</a>
into machine-actionable RDF to make it FAIR, I gravely underestimated the amount of work.
<a href="https://scholia.toolforge.org/author/Q99306396">Jeaphianne</a> et al. did an awesome job on this work
(doi:<a href="https://doi.org/10.1186/s13321-024-00833-0">10.1186/s13321-024-00833-0</a>).</p>

<p>The idea was simple: write up which nanomaterial (type) activates which molecular initiating event.
It would simply annotate each material with a unique identifier to link it to databases like
<a href="https://enanomapper.adma.ai/">eNanoMapper</a> and <a href="https://doi.org/10.3389/fphy.2023.1271842">NanoCommons</a>
and it would use unique identifiers for the
<a href="https://chem-bla-ics.blogspot.com/2022/05/new-providing-adverse-outcome-pathways.html">Adverse Outcome Pathway</a>) (AOP) key events.
As such, it would make a direct link in the growing linked open data cloud between the AOPs
and the nanomaterial databases.</p>

<p>Unfortunately, it was quickly discovered that actually reusing this new datasets requires rich annotation (metadata!)
of the materials and the materials from the source paper were not yet in material databases.
And then the cumbersome start was started, resulting in a very rich data model describing the
key events, the materials, the assays used, and the original papers themselves:</p>

<p><img src="/assets/images/13321_2024_833_Fig1_HTML.png" alt="" /></p>

<p>But the work has not finished yet. The paper assigned <a href="https://chem-bla-ics.blogspot.com/2022/09/nanomaterial-identifiers-erm-identifier.html">ERM identifiers</a>
to all included materials, and now these need to be added to new <a href="https://nanocommons.github.io/erm-database/">ERM Identifier Database</a>
under development.</p>]]></content><author><name>Egon Willighagen</name></author><category term="fair" /><category term="rdf" /><category term="doi:10.1186/S13321-024-00833-0" /><category term="doi:10.14573/ALTEX.2102191" /><category term="doi:10.3390/NANO10102068" /><category term="erm" /><category term="doi:10.1186/S13321-022-00614-7" /><category term="doi:10.3389/FPHY.2023.1271842" /><category term="doi:10.3762/BJNANO.6.165" /><category term="doi:10.1089/AIVT.2021.0010" /><summary type="html"><![CDATA[Making something FAIR is hard, particularly when you do more than making something findable. We’ve seen before that making something usefully findable requires deep indexing, and already that continues to be difficult, because we are not seeing it enough. So, when I thought convert a paper led by Hoet’s lab in Leuven into machine-actionable RDF to make it FAIR, I gravely underestimated the amount of work. Jeaphianne et al. did an awesome job on this work (doi:10.1186/s13321-024-00833-0).]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/13321_2024_833_Fig1_HTML.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/13321_2024_833_Fig1_HTML.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Reusing data: two new papers</title><link href="https://chem-bla-ics.linkedchemistry.info/2024/03/17/two-papers.html" rel="alternate" type="text/html" title="Reusing data: two new papers" /><published>2024-03-17T00:00:00+00:00</published><updated>2024-03-17T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2024/03/17/two-papers</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2024/03/17/two-papers.html"><![CDATA[<p>My research is about the interaction of (machine) representation and the impact on the success of
data analysis (matchine learning, chemometrics, AI, etc). See the posts
<a href="https://chem-bla-ics.linkedchemistry.info/2010/08/09/molecular-chemometrics-principles-1.html">about</a>
<a href="https://chem-bla-ics.linkedchemistry.info/2010/08/12/molecular-chemometrics-principles-2-be.html">molecular</a>
<a href="https://chem-bla-ics.linkedchemistry.info/2010/08/14/molecular-chemometrics-principles-3.html">chemometrics</a>.
This got me into <a href="https://chem-bla-ics.linkedchemistry.info/tag/fair">FAIR</a>: making data interoperable
and being able to (really) reuse data is the starting point of doing research.</p>

<p>So, when I get the chance to see something where I worked on to make more FAIR actually being used,
I love to push the boundaries of FAIR a bit extra. The study of representation of molecules and molecular
systems is not quite a popular science, but I find it important. Two new papers got recently published
to which I contributed from this perspective.</p>

<p>The first paper by Anna Niarakis <i>et al.</i> is about using the SARS-CoV-2/COVID-19 knowledge base we
have collected of the past 4 years (doi:<a href="https://doi.org/10.3389/fimmu.2023.1282859">10.3389/fimmu.2023.1282859</a>).
For me, this started with a WikiPathways with early knowledge about the virus proteins. I think
in this and earlier papers, we improved our open science and bioinformatics and are actually
more ready for a next pandemic, which inevitably will come.</p>

<p>The second paper by Alfaro Serrano <i>et al.</i> is about how access to data remains key to many
things, and this, obviously, includes the Sustainable Development Goals (SDGs)
(doi:<a href="https://doi.org/10.1039/D3SU00148B">10.1039/D3SU00148B</a>). When it comes down
to the face/off of FAIR versus Open, I think Open has more impact, hands-down.</p>

<p>About the latter, I recently wrote up ten simple actions you can take to make your
nanosafety research output more FAIR (doi:<a href="https://doi.org/10.5281/zenodo.10533126">10.5281/zenodo.10533126</a>).</p>]]></content><author><name>Egon Willighagen</name></author><category term="covid19" /><category term="fair" /><category term="nanosafety" /><category term="nanocommons" /><category term="doi:10.3389/FIMMU.2023.1282859" /><category term="doi:10.1039/D3SU00148B" /><category term="doi:10.5281/ZENODO.10533126" /><summary type="html"><![CDATA[My research is about the interaction of (machine) representation and the impact on the success of data analysis (matchine learning, chemometrics, AI, etc). See the posts about molecular chemometrics. This got me into FAIR: making data interoperable and being able to (really) reuse data is the starting point of doing research.]]></summary></entry><entry><title type="html">Using FAIR to select data for reuse</title><link href="https://chem-bla-ics.linkedchemistry.info/2023/09/17/using-fair-for-reuse.html" rel="alternate" type="text/html" title="Using FAIR to select data for reuse" /><published>2023-09-17T00:00:00+00:00</published><updated>2023-09-17T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2023/09/17/using-fair-for-reuse</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2023/09/17/using-fair-for-reuse.html"><![CDATA[<p>This paper got published in July already, but I had not had the time yet to blog about this exciting work by
<a href="https://scholia.toolforge.org/author/Q92131000">Irini Furxhi</a> and <a href="https://scholia.toolforge.org/author/Q86442640">Ammar Ammar</a>:
<em>A data reusability assessment in the nanosafety domain based on the NSDRA framework followed by an exploratory
quantitative structure activity relationships (QSAR) modeling targeting cellular viability</em>
(doi:<a href="https://doi.org/10.1016/j.impact.2023.100475">10.1016/j.impact.2023.100475</a>)</p>

<p>The study has two sides to it: first, it looks into how far we are with <a href="https://en.wikipedia.org/wiki/Quantitative_structure%E2%80%93activity_relationship">QSAR</a>
in the field of nanosafety. We have limited data, but this paper got together 34 data sets, and in the model building
many different possible factors are explored. Now, as a scholar, I would really want to know which factors are
really important. We have been studying this for some time, e.g. in the past RRegrs paper
(doi:<a href="https://doi.org/10.1186/S13321-015-0094-2">10.1186/S13321-015-0094-2</a>). Basically, I think we still
don’t really understand the relation between the data characteristics and the modelling options. When is
data rich enough to move from classification to regression? How much (many) exerimental data do we need,
for the model to capture a certain applicability domain sufficiently?</p>

<p>Actually, I think the rise of deep learning approaches shows us a few things: more data actually does help.
But also, with enough data, the representation becomes less important for the overall pattern. There are
even hints that deep learning needs a certain level of noise. Did anyone study that phenomenon yet?</p>

<p>Now, the reader of this paper will not be disappointed. The design is complex and there are many small hints
about what worked and what did not. But this gets us to the other side of this story.</p>

<p>The second side of this paper is the question whether the level of FAIR-ness helps this QSAR modelling.
Earlier, Ammar studied the R1.3 aspects of nanosafety research. The R1.3 guiding principle expects that
<a href="https://www.go-fair.org/fair-principles/r1-3-metadata-meet-domain-relevant-community-standards/">(Meta)data meet domain-relevant community standards</a>.
Ammar’s research (preprint doi:<a href="https://doi.org/10.26434/CHEMRXIV-2022-L8VK8-V2">10.26434/CHEMRXIV-2022-L8VK8-V2</a>)
shows we can link this to actual reuse, where QSAR is one of those use cases.
In their July paper, they show how we can integrate the use of the community standards
in a reproducible way to support nanosafety research.</p>

<p>The following screenshot from the article (Figure 2, CC-BY) shows the relation between R1.3 maturity
indicators and QSAR variables:</p>

<p><img src="/assets/images/qsar-maturity-indicators.jpg" alt="" /></p>

<p>I think Furxhi and Ammar may actually have introduced a new community standard: this is how nanoQSAR
research should be done from now on. Irini and Ammar, thanks for this great collaboration!</p>]]></content><author><name>Egon Willighagen</name></author><category term="fair" /><category term="qsar" /><category term="doi:10.1016/J.IMPACT.2023.100475" /><category term="doi:10.1186/S13321-015-0094-2" /><category term="doi:10.26434/CHEMRXIV-2022-L8VK8-V2" /><summary type="html"><![CDATA[This paper got published in July already, but I had not had the time yet to blog about this exciting work by Irini Furxhi and Ammar Ammar: A data reusability assessment in the nanosafety domain based on the NSDRA framework followed by an exploratory quantitative structure activity relationships (QSAR) modeling targeting cellular viability (doi:10.1016/j.impact.2023.100475)]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/qsar-maturity-indicators.jpg" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/qsar-maturity-indicators.jpg" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Community activity #2: FAIRsharing</title><link href="https://chem-bla-ics.linkedchemistry.info/2023/06/11/community-activity-2-fairsharing.html" rel="alternate" type="text/html" title="Community activity #2: FAIRsharing" /><published>2023-06-11T00:00:00+00:00</published><updated>2023-06-11T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2023/06/11/community-activity-2-fairsharing</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2023/06/11/community-activity-2-fairsharing.html"><![CDATA[<p>Some years ago we started the <a href="https://elixir-europe.org/communities/toxicology">ELIXIR Toxicology Community</a>. It has been an interesting journey,
partly covered in <a href="https://f1000research.com/articles/10-1129/v1">this whitepaper</a>). We started with interaction we had in several projects already,
but particularly the potential. I see this. This series of posts is a number of things toxicology projects can do to benefit from ELIXIR solutions
(“<a href="https://elixir-europe.org/services">services</a>”). The posts have been sent first to the ELIXIR Toxicology Community mailing list (please join!).</p>

<h3 id="history">History</h3>

<p>In this post, let’s look at <a href="https://fairsharing.org/">FAIRsharing</a>. It is “A curated, informative and educational resource on data and metadata standards,
inter-related to databases and data policies” [0,1].</p>

<p>The ELIXIR Toxicology Community (we) maintains the toxicology corner of this database and members of our community have been adding toxicology-related
databases, relevant standards. On the side of the policies we are falling a bit short:
<a href="https://fairsharing.org/Toxicology">fairsharing.org/Toxicology</a>.</p>

<h3 id="why-adopt-fairsharing">Why adopt FAIRsharing</h3>

<p>FAIRsharing is one place where metadata can be shared about your databases. It helps make your resources and research more FAIR and explains people
how your work relates to other work (<a href="https://fairsharing.org/graph/3496">fairsharing.org/graph/3496</a>):</p>

<p><img src="/assets/images/fairsharing_toxicology.png" alt="Screenshot of the 'collects' graph of the FAIRsharing Toxicology Community." /></p>

<h3 id="what-you-can-do">What you can do</h3>

<p>Get an account (with your ORCID or GitHub account) and add resources important to your research, your projects, your work generally. Particularly,
(data) policies and standards you are expected to comply with are useful. Also, links between various resources. For example, if some (project)
database complies with an important policy or standards, this is worth seeing show up.</p>

<p>Alternatively, join the ELIXIR Toxicology Community <a href="https://doi.org/10.1162/dint_r_00024">mailing list</a> and post the missing resource there,
or use our issue tracker at <a href="https://github.com/elixir-europe/toxicology-community/issues/">github.com/elixir-europe/toxicology-community/issues/</a>.</p>

<p>Let’s make toxicology more <a href="https://doi.org/10.1162/dint_r_00024">FAIR</a>.</p>

<p>0.<a href="https://www.nature.com/articles/s41587-019-0080-8">https://www.nature.com/articles/s41587-019-0080-8</a>
1.<a href="https://scholia.toolforge.org/work/Q64084285">https://scholia.toolforge.org/work/Q64084285</a></p>]]></content><author><name>Egon Willighagen</name></author><category term="elixir" /><category term="fair" /><category term="toxicology" /><category term="doi:10.12688/f1000research.74502.1" /><category term="justdoi:10.1038/s41587-019-0080-8" /><category term="doi:10.1162/dint_r_00024" /><summary type="html"><![CDATA[Some years ago we started the ELIXIR Toxicology Community. It has been an interesting journey, partly covered in this whitepaper). We started with interaction we had in several projects already, but particularly the potential. I see this. This series of posts is a number of things toxicology projects can do to benefit from ELIXIR solutions (“services”). The posts have been sent first to the ELIXIR Toxicology Community mailing list (please join!).]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/fairsharing_toxicology.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/fairsharing_toxicology.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Paper: The FAIR Cookbook - the essential resource for and by FAIR doers</title><link href="https://chem-bla-ics.linkedchemistry.info/2023/05/22/paper-fair-cookbook-essential-resource.html" rel="alternate" type="text/html" title="Paper: The FAIR Cookbook - the essential resource for and by FAIR doers" /><published>2023-05-22T00:00:00+00:00</published><updated>2023-05-22T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2023/05/22/paper-fair-cookbook-essential-resource</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2023/05/22/paper-fair-cookbook-essential-resource.html"><![CDATA[<p>I think that if you want to make your knowledge FAIR, you should use an open license and RDF. Simple. Now, not everything is knowledge.
A lot of data is, but a lot more is not, think raw data. Using RDF to explain a protein sequence is still something that makes me feel uneasy.</p>

<p>However, first, you need to make RDF, you need to make assumptions explicit, you need to decide on meaning. Making RDF is not easy.
It’s not hard, just a lot of administration and scientific thinking. What did I measure? What model do I use to describe the chemistry?
You know, my research job.</p>

<p>Moreover, not only data should be FAIR. All research output (worth communicating) should be FAIR.</p>

<p>In the past, Andra Waagmeester invited me to co-author a recipe that explains the
<a href="http://www.openphacts.org/specs/2013/WD-rdfguide-20131007/">general steps of creating RDF</a>. This was during the Open PHACTS project and with Carina Haupt.
Writing recipes is something getting traction. They are a bit like <a href="https://r-pkgs.org/vignettes.html">vignettes from the R world</a>.</p>

<p>In the past few years the <a href="https://cordis.europa.eu/project/id/802750">FAIRplus project</a> created a
<a href="https://faircookbook.elixir-europe.org/">FAIR Cookbook</a> with recipes and I wrote a few. Actually, I still have a few to finish,
for which I cannot find the time. I retrospect, I spent too much time on perfecting the recipe to finish them earlier. The FAIR Cookbook
is now a professional venue with editorial board. It is fully open source and welcomes your recipes. Oh, and it is now hosted as ELIXIR service,
which is great to see!</p>

<p>Finally, the <a href="https://doi.org/10.1038/s41597-023-02166-3">The FAIR Cookbook - the essential resource for and by FAIR doers paper</a>
is out. Go read it :)</p>

<p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41597-023-02166-3/MediaObjects/41597_2023_2166_Fig2_HTML.png?as=webp" alt="Screenshot of a FAIR Cookbook recipe showing the infobox at the top (with reading time, difficulty indicator (4/5 flames), the audience (PIs, ontologists, data scholars), and the author list with ORCID, affiliation, and CReDIT annotation.)" /></p>
<center>
Figure 2 from the article: 'Citability of recipes and identification of and credit for authors; an example is provided.'
</center>]]></content><author><name>Egon Willighagen</name></author><category term="fair" /><category term="doi:10.1038/s41597-023-02166-3" /><summary type="html"><![CDATA[I think that if you want to make your knowledge FAIR, you should use an open license and RDF. Simple. Now, not everything is knowledge. A lot of data is, but a lot more is not, think raw data. Using RDF to explain a protein sequence is still something that makes me feel uneasy.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41597-023-02166-3/MediaObjects/41597_2023_2166_Fig2_HTML.png?as=webp" /><media:content medium="image" url="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41597-023-02166-3/MediaObjects/41597_2023_2166_Fig2_HTML.png?as=webp" xmlns:media="http://search.yahoo.com/mrss/" /></entry></feed>