opendata Archives - Henry Rzepa's Blog Henry Rzepa's Blog

Posts Tagged ‘opendata’

A two-publisher model for the scientific article: narrative+shared data.

Sunday, September 15th, 2013

I do go on rather a lot about enabling or hyper-activating[1] data. So do others[2]. Why is sharing data important?

References

O. Casher, G.K. Chandramohan, M.J. Hargreaves, C. Leach, P. Murray-Rust, H.S. Rzepa, R. Sayle, and B.J. Whitaker, "Hyperactive molecules and the World-Wide-Web information system", Journal of the Chemical Society, Perkin Transactions 2, pp. 7, 1995. https://doi.org/10.1039/p29950000007
R. Van Noorden, "Data-sharing: Everything on display", Nature, vol. 500, pp. 243-245, 2013. https://doi.org/10.1038/nj7461-243a

Tags:chemical tagger, data mining, datument, David Scheschkewitz, e-notebook, Google, opendata, Peter Murray-Rust, pre-processor, researcher, scientific tool, supervisor, United Kingdom
Posted in Chemical IT, Interesting chemistry | 9 Comments »

The Amsterdam Manifesto on Data Citation Principles

Wednesday, July 31st, 2013

The Amsterdam manifesto espouses the principles of citable open data. It is a short document, and it is worth re-stating its eight points here:

(more…)

Tags:Amsterdam, exposed algorithm, opendata, rotatable 3D model, using Jmol
Posted in Chemical IT | 3 Comments »

150,000,000 DFT calculations on 2,300,000 compounds!

Friday, July 5th, 2013

The title of this post summarises the contents of a new molecular database: www.molecularspace.org[1] and I picked up on it by following the post by Jan Jensen at www.compchemhighlights.org (a wonderful overlay journal that tracks recent interesting articles). The molecularspace project more formally is called “The Harvard Clean Energy Project: Large-scale computational screening and design of organic photovoltaics on the world community grid“. It reminds of a 2005 project by Peter Murray-Rust et al at the same sort of concept[2] (the World-Wide-Molecular-Matrix, or WWMM[3]), although the new scale is certainly impressive. Here I report my initial experiences looking through molecularspace.org

(more…)

References

J. Hachmann, R. Olivares-Amaya, S. Atahan-Evrenk, C. Amador-Bedolla, R.S. Sánchez-Carrera, A. Gold-Parker, L. Vogt, A.M. Brockway, and A. Aspuru-Guzik, "The Harvard Clean Energy Project: Large-Scale Computational Screening and Design of Organic Photovoltaics on the World Community Grid", The Journal of Physical Chemistry Letters, vol. 2, pp. 2241-2251, 2011. https://doi.org/10.1021/jz200866s
P. Murray-Rust, H.S. Rzepa, J.J.P. Stewart, and Y. Zhang, "A global resource for computational chemistry", Journal of Molecular Modeling, vol. 11, pp. 532-541, 2005. https://doi.org/10.1007/s00894-005-0278-1
P. Murray-Rust, S.E. Adams, J. Downing, J.A. Townsend, and Y. Zhang, "The semantic architecture of the World-Wide Molecular Matrix (WWMM)", Journal of Cheminformatics, vol. 3, 2011. https://doi.org/10.1186/1758-2946-3-42

Tags:energy gap, energy levels, Google, Harvard, Jan Jensen, molecularspace site, opendata, Peter Murray-Rust, software agent acting, www.compchemhighlights.org, www.molecularspace.org
Posted in Chemical IT | 7 Comments »

Research data and the “h-index”.

Monday, June 24th, 2013

The blog post by Rich Apodaca entitled “The Horrifying Future of Scientific Communication” is very thought provoking and well worth reading. He takes us through disruptive innovation, and how it might impact upon how scientists communicate their knowledge. One solution floated for us to ponder is that “supporting Information, combined with data mining tools, could eliminate most of the need for manuscripts in the first place“. I am going to juxtapose that suggestion on something else I recently discovered.

(more…)

Tags:data mining, data mining tools, Google, opendata, researcher
Posted in Chemical IT | 2 Comments »

What can chemistry learn from photos?

Sunday, June 2nd, 2013

A few years ago, we published an article which drew a formal analogy between chemistry and iTunes (sic)[1]. iTunes was the first really large commercial digital music library, and a feature under-the-skin was the use of meta-data to aid discoverability of any of the 10 million (26M in 2013) or so individual items in the store.^‡ The analogy to digital chemistry and discoverability of the 70 or so million known molecules is, we argued, a good one.

(more…)

References

O. Casher, and H.S. Rzepa, "SemanticEye: A Semantic Web Application to Rationalize and Enhance Chemical Electronic Publishing", Journal of Chemical Information and Modeling, vol. 46, pp. 2396-2411, 2006. https://doi.org/10.1021/ci060139e

Tags:Apple, BBC, digital photography, engineer, Google, Historical, HTML, metadata, opendata, RDF, search term, Steve Bachrach, United Kingdom
Posted in Chemical IT | No Comments »

The demographics of a blog readership.

Sunday, January 20th, 2013

With metrics in science publishing controversial to say the least, I pondered whether to write about the impact/influence a science-based blog might have (never mind whether it constitutes any measure of esteem). These are all terms that feature large when an (academic) organisation undertakes a survey of its researchers’ effectiveness.^‡ WordPress (the organisation that provides the software used for this blog) recently enhanced the stats it offers for its users, and one of these caught my eye.

(more…)

Tags:manager, opendata
Posted in General | 3 Comments »

Digital repositories. An update to the update.

Monday, August 13th, 2012

A third digital repository has been added to the two I described before. Chempound is a free open-source repository which (unlike DSpace and Figshare) was developed specifically for chemistry.

(more…)

Tags:opendata, Skolnik
Posted in Chemical IT | 2 Comments »

Digital repositories. An update.

Saturday, July 21st, 2012

I blogged about this two years ago and thought a brief update might be in order now. To support the discussions here, I often perform calculations, and most of these are then deposited into a DSpace digital repository, along with metadata. Anyone wishing to have the full details of any calculation can retrieve these from the repository. Now in 2012, such repositories are more important than ever.

(more…)

Tags:API, Chemspider, computational chemistry, Digital respository, Imperial College, InChI Key, Mark Hahnel, Matt Harvey, opendata, pubchem, QRCode, Skolnik, United Kingdom, wikipedia
Posted in Chemical IT | 1 Comment »

Science publishers (and authors) please take note.

Monday, October 24th, 2011

I have for perhaps the last 25 years been urging publishers to recognise how science publishing could and should change. My latest thoughts are published in an article entitled “The past, present and future of Scientific discourse” (DOI: 10.1186/1758-2946-3-46). Here I take two articles, one published 58 years ago and one published last year, and attempt to reinvent some aspects. You can see the result for yourself (since this journal is laudably open access, and you will not need a subscription). The article is part of a special issue, arising from a one day symposium held in January 2011 entitled “Visions of a Semantic Molecular Future” in celebration of Peter Murray-Rust’s contributions over that period (go read all 15 articles on that theme in fact!).

(more…)

Tags:Acrobat, Amazon, Android, chemical structure drawing packages, e-books, HTML, HTML5, iPad, iPads, Java, KF8, Kindle, mobile devices, opendata, Peter Murray-Rust, printing costs, SVG, Vector Graphics
Posted in Chemical IT, General | 3 Comments »

(re)Use of data from chemical journals.

Wednesday, December 22nd, 2010

If you visit this blog you will see a scientific discourse in action. One of the commentators there notes how they would like to access some data made available in a journal article via the (still quite rare) format of an interactive table, but they are not familiar with how to handle that kind of data (file). The topic in question deals with various kinds of (chemical) data, including crystallographic information, computational modelling, and spectroscopic parameters. It could potentially deal with much more. It is indeed difficult for any one chemist to be familiar with how data is handled in such diverse areas. So I thought I would put up a short tutorial/illustration in this post of how one might go about extracting and re-using data from this one particular source.

(more…)

Tags:chemical, chemical journals, chemist, opendata, RDF, semantic web, software tools, suitable processing programs, XML
Posted in Chemical IT, Interesting chemistry | 7 Comments »

Henry Rzepa's Blog

Posts Tagged ‘opendata’

A two-publisher model for the scientific article: narrative+shared data.

References

The Amsterdam Manifesto on Data Citation Principles

150,000,000 DFT calculations on 2,300,000 compounds!

References

Research data and the “h-index”.

What can chemistry learn from photos?

References

The demographics of a blog readership.

Digital repositories. An update to the update.

Digital repositories. An update.

Science publishers (and authors) please take note.

(re)Use of data from chemical journals.

Recent Posts

Recent Comments

Archive

Blogroll

Contributors

Previous posts

Categories

Meta