Researchers need to adapt their institutions and practices in response to torrents of new data and need to complement smart science with smart searching.
Online repository of large data sets for researchers in knowledge discovery and data mining. includes Discrete Sequence Data, Image Data, Multivariate Data, Relational Data, Spatio-Temporal Data, Text (corpora), Time Series, Web Data (web pages and log files).
Lifecycles for Information Integration in Distributed Scholarly Communication. The Pathways project will develop broadly applicable models and protocols to support a loosely-coupled, highly distributed, interoperable scholarly communication system. A graph-based information model will provide a layer of abstraction over heterogeneous resources (data, content, and services).
a quick list of companies working to aggregate, distribute, and market data via the Internet. While there are definitely differences in their goals/business models, each of these organizations has an interest in warehousing semi-public data sources for general consumption.
A Report on the Experiences of First Respondents to the Digging Into Data Challenge by Christa Williford and Charles Henry Research Design by Amy Friedlander
N. Gray, T. Carozzi, and G. Woan. (2012)cite arxiv:1207.3923 Comment: Project final report, 45 pages: see http://purl.org/nxg/projects/mrd-gw for project details, and http://purl.org/nxg/projects/mrd-gw/report for other document versions.