Datasets which are identical over a number of statistical properties, yet produce dissimilar graphs, are frequently used to illustrate the importance of graphical representations when exploring data. This paper presents a novel method for generating such datasets, along with several examples. Our technique varies from previous approaches in that new datasets are iteratively generated from a seed dataset through random perturbations of individual data points, and can be directed towards a desired outcome through a simulated annealing optimization strategy.
N. Padhy, D. Mishra, and R. Panigrahi. International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), 2 (3):
43-58(June 2012)
A. Oka, H. Masuhara, T. Imai, and T. Aotani. Companion to the First International Conference on the Art, Science and Engineering of Programming, page 26:1--26:7. New York, NY, USA, ACM, (2017)
I. Cadez, D. Heckerman, C. Meek, P. Smyth, and S. White. Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, page 280--284. New York, NY, USA, ACM, (2000)