...research already in progress is opened up to allow labs anywhere in the world to contribute experiments. The deeply networked nature of modern laboratories, and the brief down-time that all labs have between projects, make this concept quite feasible.
I am currently sitting at the dining table of Peter Murray-Rust with Egon Willighagen opposite me talking to Jean-Claude Bradley. We pulling together sets of data from Jean-Claude’s UsefulChem project into CML to make it more semantically rich and do a bunch of cool stuff. Jean-Claude has a recently published preprint on Nature Precedings of a paper that has been submitted to JoVE. Egon was able to grab the InChiKeys from the relevant UsefulChem pages and passing those to CDK via a script that he wrote on the spot (which he has also just blogged) generated CML react for those molecules.
OpenRefine (formerly Google Refine) is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
"Open Data Excuse" Bingo
We might want to use it in a paper People may misinterpret the data Thieves will use it There's no API
I don't mind, but someone else might Lawyers want a custom License It's too complicated We will get too many enquiries
It's too big Terrorists will use it Poor Quality There's already a project to...
What if we want to sell it later It's not very interesting Data Protection We'll get spam
For open data teams; print out a copy and put it on your office wall. Cross out each excuse people give you. There are no prizes, but you can tweet "bingo! #openDataExcuses" if you think it might make you feel better
Generate your own bingo grids at http://data.dev8d.org/devbingo/
T. Velden. Proceedings of the 2013 conference on Computer supported cooperative work, page 445--458. New York, NY, USA, ACM, (2013)PDF available at http://www.cs.cornell.edu/~tvelden/pubs/2013-cscw_preprint.pdf.