Going in, we had three basic requirements, 1) must be callable from a Rails-based web app, 2) must return responses to typical requests in real-time to the webapp (30 seconds was our target) and 3) must query against a dataset that was initially estimated at around 45 TB, but later came down to around 100 GB.
DataFlow is creating a two-stage data management infrastructure that makes it easy for you and your research group to work with, annotate, publish, and permanently store your research data. You manage this locally using your own instance of DataStage, while allowing your institution to deploy DataBank easily to preserve and publish your most valuable datasets. Published datasets have assigned DOIs to make them citable and to gain you academic credit.
Several major research funders now require that researchers include data management plans (DMPs) in their requests for funding. Use the templates on this page to help create DMPs for your proposals. The templates contain suggested items to consider; not all questions will be appropriate or relevant to all projects. Templates marked with an asterisk were adapted from templates created by the University of Virginia Libraries Scientific Data Consulting Group.
The DCC has developed a suite of tools to help UK HEI researchers and research support staff to better understand their particular data management and curation needs, assess current activity and infrastructure, and to plan for improvement.