Abstract
Since 2009 the Royal Society of Chemistry (RSC) has been delivering access to chemistry data and cheminformatics tools via the ChemSpider database and has garnered a significant community following in terms of usage and contribution to the platform. ChemSpider has focused only on those chemical entities that can be represented as molecular connection tables or, to be more specific, the ability to generate an InChI from the input structure. As a structure centric hub ChemSpider is built around the molecular structure with other data and links being associated with this structure. As a result the platform has been limited in terms of the types of data that can be managed, and the flexibility of its searches, and it is constrained by the data model. New technologies and approaches, specifically taking into account a shift from relational to NoSQL databases, and the growing importance of the semantic web, has motivated RSC to rearchitect and create a more generic data repository utilizing these new technologies. This article will provide an overview of our activities in delivering data sharing platforms for the chemistry community including the development of the new data repository expanding into more extensive domains of chemistry data.






Similar content being viewed by others
References
Pence H, Williams AJ (2010) J Chem Educ 87(11):1123
Williams AJ (2011) Public compound databases—how ChemSpider changed the rules making molecules on the web free. In: Ekins S, Hupcey MAZ, Williams AJ (eds) Collaborative computational technologies for the life sciences. Wiley, Hoboken, p 363
Williams AJ (2010) ChemSpider: integrating structure-based resources distributed across the internet. In: Belford R, Moore JW, Pence HE (eds) Enhancing learning with online resources, social networking, and digital libraries, vol 1060. American Chemical Society, Washington, p 23
The IUPAC International Chemical Identifier (InChI). http://www.iupac.org/inchi/. Accessed 16 April 2014
PubMed. http://www.ncbi.nlm.nih.gov/pubmed/. Accessed 16 April 2014
Google Scholar. http://scholar.google.com/. Accessed 16 April 2014
Google Patents. http://www.google.com/patents. Accessed 16 April 2014
Published JCAMP-DX Protocols. http://www.jcamp-dx.org/protocols.html. Accessed 16 April 2014
Bradley JC, Lancashire RJ, Lang AS, Williams AJ (2009) J Cheminform 1(1):9
Williams AJ, Harland L, Groth P, Pettifer S, Chichester C, Willighagen EL, Evelo CT, Blomberg N, Ecker G, Goble C, Mons B (2012) Drug Discov Today 17(21–22):1188
Hunter AJ (2008) Drug Discov Today 13(9–10):371
PharmaSea. http://www.pharma-sea.eu/. Accessed 16 April 2014
Chemical Database Service. http://cds.rsc.org. Accessed 16 April 2014
ChemSpider Synthetic Pages. http://cssp.chemspider.com. Accessed 16 April 2014
BaseX: The XML Database. http://www.basex.org. Accessed 16 April 2014
MongoDB. http://www.mongodb.org/. Accessed 16 April 2014
Acknowledgments
ChemSpider is the result of the aggregate work of many contributors extending outside of our own team. Our RSC platforms are supported by a dedicated team of IT specialists. The authors acknowledge the support of the Open Source community, the commercial software vendors (specifically Accelrys, ACD/Labs, GGA Software, OpenEye Scientific Software, Dotmatics and many data providers, curators and users for their contributions to the development of the data content in terms of breadth and quality.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Williams, A., Tkachenko, V. The Royal Society of Chemistry and the delivery of chemistry data repositories for the community. J Comput Aided Mol Des 28, 1023–1030 (2014). https://doi.org/10.1007/s10822-014-9784-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10822-014-9784-5