<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="4.3.4">Jekyll</generator><link href="https://chem-bla-ics.linkedchemistry.info/feed/by_tag/connotea.xml" rel="self" type="application/atom+xml" /><link href="https://chem-bla-ics.linkedchemistry.info/" rel="alternate" type="text/html" /><updated>2026-04-19T09:50:36+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/feed/by_tag/connotea.xml</id><title type="html">chem-bla-ics</title><subtitle>Chemblaics (pronounced chem-bla-ics) is the science that uses open science and computers to solve problems in chemistry, biochemistry and related fields.</subtitle><author><name>Egon Willighagen</name></author><entry><title type="html">Lunch at Nature HQ (with Euan, Joanna, Ian and Ålf)</title><link href="https://chem-bla-ics.linkedchemistry.info/2007/10/16/lunch-at-nature-hq-with-euan-joanna-ian.html" rel="alternate" type="text/html" title="Lunch at Nature HQ (with Euan, Joanna, Ian and Ålf)" /><published>2007-10-16T00:10:00+00:00</published><updated>2007-10-16T00:10:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2007/10/16/lunch-at-nature-hq-with-euan-joanna-ian</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2007/10/16/lunch-at-nature-hq-with-euan-joanna-ian.html"><![CDATA[<p>On my way back from the <a href="http://chem-bla-ics.blogspot.com/search/label/Taverna0710">Taverna workshop</a> I visited Nature HQ, as
<a href="http://blogs.nature.com/wp/nascent/2007/10/lunch_with_egon_willighagen.html">Ian reported about on Nascent</a>. It was a (too) short
meeting, but very nice to meet <a href="http://network.nature.com/blogs/user/euan">Euan</a> (finally; he wrote the
<a href="http://postgenomic.com/">postgenomic.com</a> software which I use for <a href="http://cb.openmolecules.net/">Chemical blogspace</a>),
<a href="http://network.nature.com/blogs/user/joannascott">Joanna</a> (whom I met in Chicago already, where she had
<a href="https://chem-bla-ics.linkedchemistry.info/2007/03/26/acs-chicago-day-1.html">two <i class="fa-solid fa-recycle fa-xs"></i></a>
<a href="https://chem-bla-ics.linkedchemistry.info/2007/03/29/acs-chicago-day-3.html">presentations <i class="fa-solid fa-recycle fa-xs"></i></a>,
and is responsible for <a href="http://blogs.nature.com/wp/nascent/2007/08/events_on_second_nature.html">Second Nature</a>),
<a href="http://network.nature.com/blogs/user/U3DF456C6">Ian</a> (who works on <a href="http://connotea.org/">Connotea</a>,
and <a href="http://network.nature.com/blogs/user/U3DF456C6/2007/10/08/molecule-tagging-with-connotea">commented</a> on
<a href="https://chem-bla-ics.linkedchemistry.info/2007/09/19/tagging-molecules-mashup-of-connotea.html">my tagging molecule blog <i class="fa-solid fa-recycle fa-xs"></i></a>)
and <a href="http://network.nature.com/profile/alf">Ålf</a> (who works on <a href="http://scintilla.nature.com/">Scintilla</a>) and
briefly <a href="http://network.nature.com/profile/timo">Timo</a> (who rules them all). BTW, I had a simple but delicious pasta.</p>

<p>First, let me note that if I would have to name a favorite molecule, and it was <a href="http://en.wikipedia.org/wiki/Acetic_acid">acetic acid</a>,
not <a href="http://en.wikipedia.org/wiki/Ascorbic_acid">ascorbic acid</a>. Reason why it would be my favorite is acetic acid was the first
organic molecule I put in the <a href="http://www.woc.science.ru.nl/">Woordenboek Organische Chemie</a> in 1995.</p>

<p>We discussed a number of things, regarding the things we do. One of these was tagging molecules. Ian used
<em>http://rdf.openmolecules.net/?info:inchi/InChI=1/CH4/h1H4</em> instead of <em>http://rdf.openmolecules.net/?InChI=1/CH4/h1H4</em>.
The first was not yet picked up by <a href="http://rdf.openmolecules.net/">rdf.openmolecules.net</a> but I fixed that.</p>

<p>We also discussed linking molecular structures with scientific literature. The discussions in blogspace of this week
show that doing that by using computer programs is not appreciated by publishers (see
<a href="http://www.chemspider.com/blog/?p=204">here</a>,
<a href="https://blogs.ch.cam.ac.uk/pmr/2007/10/13/outrage-repurposing-open-access-material-is-allowed-without-explicit-permission/">here <i class="fa-solid fa-recycle fa-xs"></i></a>,
<a href="https://blogs.ch.cam.ac.uk/pmr/2007/10/16/why-green-open-access-does-not-support-text-and-data-mining/">here <i class="fa-solid fa-recycle fa-xs"></i></a>,
<a href="https://blogs.ch.cam.ac.uk/pmr/2007/10/15/indexing-open-access-and-free-access-articles/">here <i class="fa-solid fa-recycle fa-xs"></i></a>,
<a href="http://www.chemspider.com/open-chemistry-web/?p=4">here</a>, and
<a href="https://blogs.ch.cam.ac.uk/pmr/2007/10/14/opsinoscar-you-us-we-please-help/">here <i class="fa-solid fa-recycle fa-xs"></i></a>)
(The publishers seem to prefer to like to send of a PDF to India or China.)</p>

<p>I proposed that the InChI would be part of the publication, for all molecules mentioned in the article. If a
journal can require exact bibliography and experimental section formats, they can certainly require InChIs too.
There are few programs left which cannot autogenerate an InChI, and the chemists draws the structures anyway.
However, the software used in the editorial process does not support linking InChIs with a PDF (if that software
would have been opensource …).</p>

<p>So, the best current option seems to be social tagging mechanisms, and this is what we talked about. Just use
Connotea (or any other service) and tag your molecule with a DOI:</p>

<p><img src="/assets/images/doiTagDelicious.png" alt="" /></p>

<p>and</p>

<p><img src="/assets/images/connoteaTagDelicious1.png" alt="" /></p>

<p>This tagging is done manually. No machines involved in that. Nothing the publishers can do about this. No ChemRefer needed.
But this will allow us to start building a database with links between papers and molecules, which we badly need. BTW, we will
not have to start from scratch. The <a href="http://www.nmrshiftdb.org/">NMRShiftDB</a> already contains many links, which is open data!</p>

<p>Now, you might notice the informal semantics of the <code class="language-plaintext highlighter-rouge">doi:</code> prefix. That’s something I hereby propose, as it allow
services to pick up the content more easily. You might also note the <em>incorrect</em> DOI in Connotea. The reason for
that is that Connotea does not yet support a ‘/’ in a tag. I
<a href="http://sourceforge.net/tracker/index.php?func=detail&amp;aid=1814491&amp;group_id=133040&amp;atid=726030">reported that problem</a>.</p>]]></content><author><name>Egon Willighagen</name></author><category term="connotea" /><category term="chemistry" /><category term="inchi" /><category term="doi" /><category term="nmrshiftdb" /><summary type="html"><![CDATA[On my way back from the Taverna workshop I visited Nature HQ, as Ian reported about on Nascent. It was a (too) short meeting, but very nice to meet Euan (finally; he wrote the postgenomic.com software which I use for Chemical blogspace), Joanna (whom I met in Chicago already, where she had two presentations , and is responsible for Second Nature), Ian (who works on Connotea, and commented on my tagging molecule blog ) and Ålf (who works on Scintilla) and briefly Timo (who rules them all). BTW, I had a simple but delicious pasta.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://chem-bla-ics.linkedchemistry.info/assets/images/doiTagDelicious.png" /><media:content medium="image" url="https://chem-bla-ics.linkedchemistry.info/assets/images/doiTagDelicious.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Tagging Molecules: a mashup of Connotea and RDF</title><link href="https://chem-bla-ics.linkedchemistry.info/2007/09/19/tagging-molecules-mashup-of-connotea.html" rel="alternate" type="text/html" title="Tagging Molecules: a mashup of Connotea and RDF" /><published>2007-09-19T00:00:00+00:00</published><updated>2007-09-19T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2007/09/19/tagging-molecules-mashup-of-connotea</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2007/09/19/tagging-molecules-mashup-of-connotea.html"><![CDATA[<p>Using the InChI and the new <a href="https://chem-bla-ics.linkedchemistry.info/2007/07/31/rdf-ing-molecular-space.html">rdf.openmolecules.net <i class="fa-solid fa-recycle fa-xs"></i></a>
website, it is now possible to tag molecules. And if you use <a href="http://www.connotea.org/">Connotea</a> for that, your tags will
even show up on the rdf.openmolecules.net website. For example, at the time of writing,
<a href="http://en.wikipedia.org/wiki/Methane">methane</a> was tagged with <em>alkanes</em> and <em>gas</em>.</p>

<p>The trick I use, is that the rdf.openmolecules.net gives every molecule a unique HTTP <a href="http://en.wikipedia.org/wiki/URL">URL</a>.
This simply web2.0 approach offers an enormous amount of possibilities. The simplest application is that you can tag your molecules
with a set label, such as <em>my-tomato-set</em>; after all, Connotea is account based. In this way, you can do open notebook QSAR studies (though the activities would still be missing).</p>

<p>The aforementioned example, however, give two classifications. <em>Methane is an alkane</em> and <em>Methane is a gas</em> (at room temperature).
Not very well determined semantics, but it is web2.0, not the semantic web.</p>

<p>Interestingly, given some loosely defined semantics, use Connotea to link a molecule to a certain publication. For example,
I can define <em><a href="http://en.wikipedia.org/wiki/Estrone">Estrone</a> is cited in the article with PubMed ID 15659855</em> using the
tag <em>pmid:15659855</em>. I’m sure using the DOI would work too, using the tag <a href="https://doi.org/10.1107/S0108768104028344">doi:10.1107/S0108768104028344</a>.
I have not used these informal semantics in the rdf.openmolecules.net website yet, but if there is such interest,
I can have such functionality hacked in minutes.</p>

<p>BTW, did anyone see <a href="http://www.geneontology.org/">Gene Ontology</a> terms being used in social bookmarking services?
For example, seeing a link to the PDB database with a tag <em>go:0008152</em>? Would be a bit cryptic, and, really, in this
case rather minimalistic on information.</p>

<h2 id="whats-next">What’s next?</h2>

<p>Now comes the tedious task of converting the QSAR data sets I used in my PhD research with these tags. It’s really
something I wanted to do for a while now. Next on my <a href="http://en.wikipedia.org/wiki/TODO">TODO</a> list is the
Greasemonkey script that adds the tags from Connotea to PubChem.</p>]]></content><author><name>Egon Willighagen</name></author><category term="rdf" /><category term="connotea" /><category term="inchi" /><category term="justdoi:10.1107/S0108768104028344" /><summary type="html"><![CDATA[Using the InChI and the new rdf.openmolecules.net website, it is now possible to tag molecules. And if you use Connotea for that, your tags will even show up on the rdf.openmolecules.net website. For example, at the time of writing, methane was tagged with alkanes and gas.]]></summary></entry><entry><title type="html">Preprint servers: the CPS failed, how will Nature Precedings do?</title><link href="https://chem-bla-ics.linkedchemistry.info/2007/06/09/preprint-servers-cps-failed-how-will.html" rel="alternate" type="text/html" title="Preprint servers: the CPS failed, how will Nature Precedings do?" /><published>2007-06-09T00:00:00+00:00</published><updated>2007-06-09T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2007/06/09/preprint-servers-cps-failed-how-will</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2007/06/09/preprint-servers-cps-failed-how-will.html"><![CDATA[<p>Some 7 years ago, following successes in physics, <a href="http://chemweb.com/">ChemWeb.com</a>
<a href="http://www.prnewswire.co.uk/cgi/news/release?id=10870">launched the Chemistry Preprint Server (CPS)</a>,
and <a href="https://doi.org/10.1021/ci025627a">Warr evaluated</a> it in a JCIM article three years later.
She wrote about ‘lessons learned’, but the only one seemed to have been that chemistry was not
ready for it, as <a href="http://www.iucr.org/iucr-top/lists/epc-l/msg00790.html">the project shutdown in 2004</a>.
The <a href="http://www.sciencedirect.com/preprintarchive?url=/CPS">archives are still available</a>,
fortunately, and you may find it amusing to look up my or some other submission.</p>

<p>Now, <a href="http://blogs.nature.com/wp/nascent/2007/06/coming_soon_nature_precedings.html">Nascent wrote that Nature is setting up</a>
<a href="http://precedings.nature.com/">Nature Precedings</a>, which was earlier
<a href="http://pbeltrao.blogspot.com/2007/06/nature-preceedings-pre-print-server-for.html">noted by Pedro</a>.
The <a href="https://doi.org/10.1038/447614a">official announcement</a> was published as an editorial in
Nature. This being a Nature initiative, and not focused on just chemistry, I am sure it will do
better than CPS. BTW, media coverage is <a href="http://www.connotea.org/user/timo/tag/Precedings">tracked in a social way</a>.</p>

<p>I might <a href="http://network.nature.com/groups/bioinformatics/notice/2007/06/08/nature-precedings-contributors-wanted">request an test account</a>;
I do have an old half-finished manuscript that I never got around to finishing. While still relevant,
it could use some community input; this preprint server would be the perfect tool. That’s how my first
manuscript ended up on CPS too :)</p>]]></content><author><name>Egon Willighagen</name></author><category term="publishing" /><category term="justdoi:10.1021/ci025627a" /><category term="justdoi:10.1038/447614a" /><category term="nature" /><category term="connotea" /><summary type="html"><![CDATA[Some 7 years ago, following successes in physics, ChemWeb.com launched the Chemistry Preprint Server (CPS), and Warr evaluated it in a JCIM article three years later. She wrote about ‘lessons learned’, but the only one seemed to have been that chemistry was not ready for it, as the project shutdown in 2004. The archives are still available, fortunately, and you may find it amusing to look up my or some other submission.]]></summary></entry><entry><title type="html">Scientific Literature: searching, ranking, storage</title><link href="https://chem-bla-ics.linkedchemistry.info/2007/06/08/scientific-literature-searching-ranking.html" rel="alternate" type="text/html" title="Scientific Literature: searching, ranking, storage" /><published>2007-06-08T00:00:00+00:00</published><updated>2007-06-08T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2007/06/08/scientific-literature-searching-ranking</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2007/06/08/scientific-literature-searching-ranking.html"><![CDATA[<p>Dealing with scientific literature has been one important theme in <a href="http://wiki.cubic.uni-koeln.de/cb/">Chemical blogspace</a>.
For example, ranking articles and how to store your personal PDF archive has been topics of discussion. In this blog I will
summarize bits of the discussion, and my personal view on things.</p>

<h2 id="searching">Searching</h2>

<p>Searching literature is traditionally done in systems like Chemical Abstracts and Web-of-Science. The open nature of a
growing number of repositories (e.g. the Dutch <a href="http://www.darenet.nl/en/page/language.view/search.page">DARE</a>) and
indexing facilities like <a href="http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed">PubMed</a> make these proprietary tools
obsolete.</p>

<p>It is incorrect to assume that these payed services are the only trustworthy sources. Even WoS fails to make the all
links between entries in the database. For example, I am aware of two missing citations to articles I have written,
even though both the cited and the citing article is available in the system. One of the citing articles was in the
<a href="http://www3.interscience.wiley.com/cgi-bin/jhome/26737?CRETRY=1&amp;SRETRY=0">Angewandte Chemie</a>!</p>

<p>Additionally, some search services, like <a href="http://scholar.google.com/">Google Scholar</a>, have the advantage that they
find copies and close variants of articles in proprietary articles on home pages and in open repositories. Today,
I learned about <a href="http://en.scientificcommons.org/">Scientific Commons</a> which indexes and links to a staggering
1.5M publications, using, among others, PubMed and university repositories. Where possible it makes direct links
to PDF versions of the article.</p>

<h2 id="ranking">Ranking</h2>

<p><a href="http://www.chemicalforums.com/index.php?topic=17653.msg67580#msg67580">Mitch set up</a> <a href="http://chemrank.com/">ChemRank</a>,
to which <a href="https://blogs.ch.cam.ac.uk/pmr/2007/05/30/ranking-chemistry-and-blogosphere-metrics/">Peter <i class="fa-solid fa-recycle fa-xs"></i></a>, the <a href="http://www.thechemblog.com/?p=552">ChemBlog</a>
and <a href="https://chem-bla-ics.linkedchemistry.info/2007/05/30/chemrank-ranking-scientific-literature.html">I replied <i class="fa-solid fa-recycle fa-xs"></i></a>. Afterwards,
I learned that other services are available too, that allow, in addition to setting up an online personal literature
database, voting and commenting on articles.</p>

<p>Apparently, <a href="http://www.citeulike.org/">CiteULike</a> (CUL) supports this too. In contrast to ChemRank, CUL requires
a login, which I personally see as an advantage, because I can browse literature bookmarked by other accounts I trust.
There is also <a href="http://www.connotea.org/">Connotea</a> but I never liked that site that much (e.g. is allows bookmarking
any web page); <a href="https://doi.org/10.59350/6zgf4-2wb06">Rich has his comments too <i class="fa-solid fa-recycle fa-xs"></i></a>.
I would also like to mention <a href="http://www.biowizard.com/">BioWizard</a> which is based on the PubMed content, which actually
covers a good deal of chemistry literature nowadays too.</p>

<h2 id="local-storage">Local Storage</h2>

<p>These above mentioned systems can be used as alternative to offline bibliographic database systems, like EndNote and
<a href="http://jabref.sf.net/">JabRef</a>. The latter is my favorite, being based on BibTeX which I use for my LaTeX based
publications, and is opensource and contains <a href="http://www.ohloh.net/accounts/2934/contributions/557">a few patches</a>
from yours truly. Jungfreudlich wondered <a href="http://www.jungfreudlich.de/2007/05/20/how-are-your-paper-files-organized/">how people organized their PDF archive</a>
and <a href="http://www.jungfreudlich.de/2007/05/20/how-are-your-paper-files-organized/#comment-3199">I commented how I do it</a>:</p>

<ul>
  <li>a directory hierarchy based on journal name and year</li>
  <li>file names that include last name of the first author and year</li>
  <li>JabRef for the bibiographic database</li>
  <li><a href="http://strigi.sf.net/">Strigi</a> for full text search</li>
</ul>

<p><a href="http://miningdrugs.blogspot.com/2007/05/literature-management.html">Jörg</a> and
<a href="http://www.thepowerofgoo.net/2007/05/20/organizing-pdfs-papers/">the power of goo</a> replied too.</p>

<h2 id="mashups">Mashups</h2>

<p>I have accounts on several online tools now (with some duplication which I don’t like), and I have no idea which of
the options will stay around. Time will learn. Good news is that the open characters of many of these allow making
mashups, and generally integrate tools. For example, JabRef allows downloading citations from PubMed, and Noel
<a href="http://baoilleach.blogspot.com/2007/05/supporting-information-available-as.html">suggested to use Greasemonkey scripts to link to the supplementary information for his articles</a>,
instead of using the mechanisms journals have. I can see the advantage of this, as, for example,
<a href="https://blogs.ch.cam.ac.uk/pmr/2007/05/09/access-to-and-re-use-of-open-data-in-chemistry-impressions/">Wiley takes full copyright of the data in SI material <i class="fa-solid fa-recycle fa-xs"></i></a>,
while Noel’s mechanism would keep the data open.</p>

<p>For now, however, I would very much like to see a meta service where I can query rankings and comment for
articles using any or all of the above tools.</p>]]></content><author><name>Egon Willighagen</name></author><category term="citeulike" /><category term="publishing" /><category term="chemistry" /><category term="justdoi:10.59350/6zgf4-2wb06" /><category term="connotea" /><summary type="html"><![CDATA[Dealing with scientific literature has been one important theme in Chemical blogspace. For example, ranking articles and how to store your personal PDF archive has been topics of discussion. In this blog I will summarize bits of the discussion, and my personal view on things.]]></summary></entry><entry><title type="html">Nature Network v2: cannot create a new group</title><link href="https://chem-bla-ics.linkedchemistry.info/2007/02/23/nature-network-v2-cannot-create-new.html" rel="alternate" type="text/html" title="Nature Network v2: cannot create a new group" /><published>2007-02-23T00:00:00+00:00</published><updated>2007-02-23T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2007/02/23/nature-network-v2-cannot-create-new</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2007/02/23/nature-network-v2-cannot-create-new.html"><![CDATA[<p><a href="http://blogs.nature.com/wp/nascent/2007/02/nature_network_v2_is_live.html">Nascent</a> reported that
<a href="http://network.nature.com/">Nature Network v2</a> has gone life. Never too anxious to try something new,
I created an account and signed in. I even joined two groups: <em>Bioinformatics</em> and <em>Semantic Web for the Life Sciences</em>.</p>

<p>But, when I tried to create a new group, the system fails. I promised me to send me email for confirmation.
Tried it twice via my <a href="http://www.sf.net/">Sourceforge</a> email account. No email. I then changed my email
for my Nature account to my Gmail address. Still no email…</p>

<p>I am not located in Boston or London, is that the problem? Is being ‘global’ not good enough? Is the requirement
to have two ‘o’s in the name? Cologne then, maybe?</p>

<h2 id="missing-features">(Missing) Features</h2>

<p>For the rest, the system seems interesting. I am not too fond of having to create accounts all over the place
(<em>what was the password again???</em>), but looks promising. The thing I missed most when filling out my profile
was a feature to import the list of my publications from <a href="http://www.connotea.org/">Connotea</a>.</p>

<p>Another thing I missed, was the ability to mention my blog(s) in my profile. May I put this in as request too?
BTW, is there a group or forum on Nature Network where I can file these things?</p>]]></content><author><name>Egon Willighagen</name></author><category term="nature" /><category term="connotea" /><summary type="html"><![CDATA[Nascent reported that Nature Network v2 has gone life. Never too anxious to try something new, I created an account and signed in. I even joined two groups: Bioinformatics and Semantic Web for the Life Sciences.]]></summary></entry><entry><title type="html">KDE desktop search: Kat, Strigi and Tenor</title><link href="https://chem-bla-ics.linkedchemistry.info/2006/06/17/kde-desktop-search-kat-strigi-and.html" rel="alternate" type="text/html" title="KDE desktop search: Kat, Strigi and Tenor" /><published>2006-06-17T00:00:00+00:00</published><updated>2006-06-17T00:00:00+00:00</updated><id>https://chem-bla-ics.linkedchemistry.info/2006/06/17/kde-desktop-search-kat-strigi-and</id><content type="html" xml:base="https://chem-bla-ics.linkedchemistry.info/2006/06/17/kde-desktop-search-kat-strigi-and.html"><![CDATA[<p>Desktop searching has become a hot topic (some <a href="https://chem-bla-ics.linkedchemistry.info/2006/05/26/molecular-indexing-on-kde-and-osx.html">earlier <i class="fa-solid fa-recycle fa-xs"></i></a>
<a href="https://chem-bla-ics.linkedchemistry.info/2005/11/07/ubuntu-dapper-will-include-chemistry.html">blogs <i class="fa-solid fa-recycle fa-xs"></i></a>), now that years of data accumulated on ones
hard disk: PDFs, OpenOffice.org documents, Latex manuscripts, old Java source code, digitized music, and a lot of chemical files. Well,
on my hard disk that is. Unlike piles of paper, a computer could search this data, but due to the size an index is required. What’s KDE4
going to offer?</p>

<p>For the <a href="http://www.kde.org/">KDE</a> desktop <a href="http://kat.mandriva.com/">Kat</a> has for more than a year offered this, and latter
<a href="http://www.kde-apps.org/content/show.php?content=36832">Kerry</a> came along as frontend to [Beagle(http://beaglewiki.org/Main_Page)],
though this does not have the nice integration with KDE <a href="http://developer.kde.org/documentation/tutorials/kfile-plugin/t1.html">kfile plugins</a>.
Since then, Kat developed has come to a stop (unfortunately), and attempts to reach the main author
(<a href="mailto:roberto.cappuccio@gmail.com">Roberto</a>) have been unsuccesfull. Last thing happening was a rewrite of the database backend.</p>

<p>Additionally, <a href="http://dot.kde.org/1109163846/">Scott Wheeler proposed Tenor</a> on <a href="http://www.fosdem.org/">FOSDEM</a> 2005:
<em>“KDE 4: Beyond Hierarchical Data, The Desktop as a Searchable Web of Context”</em>. A semantic desktop; potentially cool, but I have heard
<a href="http://www.kdedevelopers.org/blog/72?from=10">little from it lately</a>, except for some rumours that
<a href="http://mail.kde.org/pipermail/klink/2006-April/000133.html">Scott has some actual code at home</a>.</p>

<p>Now, <a href="http://www.vandenoever.info/software/strigi/">Strigi</a> (<a href="http://www.kde-look.org/content/show.php?content=40889">download</a>) has come along,
with a fast indexing engine, just the thing where the Kat developed seemed to have stopped. The design is different from that of Kat, but it
does not seem unlikely that Kat code can be ported. No support for PDF or OpenOffice.org documents yet, but that’s really the easy part, and
kfile is on its way.</p>

<p>Getting back to Tenor, one might wonder how Strigi could implement Tenor concepts. A simple approach is at least to allow users to tag files,
just like we have become used to with blogs (e.g. <a href="http://www.technorati.com/">Technorati.com</a>) and websites (e.g.
<a href="http://www.connotea.org/">Connotea</a>). This could be easily implemented using <a href="http://wiki.linuxquestions.org/wiki/Extended_attributes">extended attributes</a>
(xattr), <a href="https://chem-bla-ics.linkedchemistry.info/2006/05/26/molecular-indexing-on-kde-and-osx.html">already used by Beagle <i class="fa-solid fa-recycle fa-xs"></i></a>:</p>

<div class="language-plaintext highlighter-rouge"><div class="highlight"><pre class="highlight"><code># file: home/egonw/1CRN.jpg
user.Tenor.Keywords="crambin"
user.Tenor.Comment="Used in my ontologies presentation."
</code></pre></div></div>

<p>Obviously, this example shows not just these tags, but a user comment too. The idea, here, is that Strigi mines these attributes in
addition to the file itself, so that search on tags can be done too. BTW, my argument to use this, instead of putting these things
in the Strigi database itself, is persistence: data and metadata are kept together. KDE’s file properties dialog would be extended
with an extra tab that allows editing these fields.</p>

<p>Strigi itself can be embedded in KDE applications to search specific information (e.g. search molecular data within
<a href="http://cniehaus.livejournal.com/23010.html">Kalzium</a> using the <a href="http://www.iupac.org/inchi/">InChI</a>), and even in the FileOpen dialog.
We need patches for KDE4 that allows this, soon.</p>]]></content><author><name>Egon Willighagen</name></author><category term="kde" /><category term="strigi" /><category term="kalzium" /><category term="linux" /><category term="technorati" /><category term="connotea" /><summary type="html"><![CDATA[Desktop searching has become a hot topic (some earlier blogs ), now that years of data accumulated on ones hard disk: PDFs, OpenOffice.org documents, Latex manuscripts, old Java source code, digitized music, and a lot of chemical files. Well, on my hard disk that is. Unlike piles of paper, a computer could search this data, but due to the size an index is required. What’s KDE4 going to offer?]]></summary></entry></feed>