The burgeoning interest in R demonstrates that there’s demand for analytics to solve real, business-critical problems in a broad spectrum of companies and roles, and that some of the incumbent analytics offerings, in particular SAS and SPSS, don’t sufficiently meet the growing need for analytics in many major companies. Annotated link http://www.diigo.com/bookmark/http%3A%2F%2Fspotfire.tibco.com%2Fcommunity%2Fblogs%2Fenterpriseanalytics%2Farchive%2F2009%2F01%2F08%2Fanalytics-in-the-nyt.aspx
Datasets which are identical over a number of statistical properties, yet produce dissimilar graphs, are frequently used to illustrate the importance of graphical representations when exploring data. This paper presents a novel method for generating such datasets, along with several examples. Our technique varies from previous approaches in that new datasets are iteratively generated from a seed dataset through random perturbations of individual data points, and can be directed towards a desired outcome through a simulated annealing optimization strategy.