• Open Data: the Panton Principles

    The announcement of the Panton Principles is the big news today, though Peter already spoke about them in May last year (see coverage on FriendFeed and Twitter). The four principles list in their short versions:
  • Citing the Chemistry Development Kit

    Two weeks ago, a paper by Peter Ertl was published about Molecular structure input on the web (doi:10.1186/1758-2946-2-1). In this paper, he discusses the state of things and describes his contribution to this field, the JME Molecule Editor. The article also cites the CDK, but only the website and not one of the two papers (doi:10.1021/ci025584y, or doi:10.2174/138161206777585274). This is not an isolated case, but a common pattern. In principle, the proper work is cited, and nothing is wrong. Practically it means, that a citation to the CDK website does not show up in the citation network. This is not a problem caused by these papers, but merely by the nature current citation databases work: they only count citations between journal articles, and only sometimes extend to books or conference abstracts.
  • ChEMBL RDF #1:SPARQL end point

    In a series of SPARQL end points , I am happy to present a new Virtuoso 6.1-hosted SPARQL end point for the ChEMBL database (CC-BY-SA), at our groups new rdf.farmbio.uu.se server. The server is hosting 23.8M triples, with the data based on ChEMBL 02. There is a SPARQL end point, as well as a SNORQL interface:
  • RDF, Jena, Bioclipse, Eclipse, Zest: Mashups

    Quite a while a go, I blogged about Zest in Bioclipse showing a bit of ONS Solubility data . I could not follow up on that until now, as I had yet to do a lot of RDF work in Bioclipse, so the screenshot back then was kind of a mockup.