This book will teach you how to do data science with R: You’ll learn how to get your data into R, get it into the most useful structure, transform it, visualise it and model it. In this book, you will find a practicum of skills for data science. Just as a chemist learns how to clean test tubes and stock a lab, you’ll learn how to clean data and draw plots—and many other things besides. These are the skills that allow data science to happen, and here you will find the best practices for doing each of these things with R. You’ll learn how to use the grammar of graphics, literate programming, and reproducible research to save time. You’ll also learn how to manage cognitive resources to facilitate discoveries when wrangling, visualising, and exploring data.
This cookbook contains more than 150 recipes to help scientists, engineers, programmers, and data analysts generate high-quality graphs quickly—without having to comb through all the details of R’s graphing systems. Each recipe tackles a specific problem with a solution you can apply to your own project and includes a discussion of how and why the recipe works.
Coordinate systems have two main jobs: Combine the two position aesthetics to produce a 2d position on the plot. The position aesthetics are called x and y, but they might be better called...
OLAP (Online Analytical Processing) is a very common way to analyze raw transaction data by aggregating along different combinations of dimensions. This is a...
Cox regression model is widely used in medical research to assess the effect of several risk factors on the survival time of patients. The {ggforest} function from {survminer} Paket easily creates a forest plot of its model estimates.
G. Consiglio, A. Burden, M. Maclure, L. McCarthy, and S. Cadarette. Pharmacoepidemiology and drug safety, 22 (11):
1146-53(November 2013)7472<m:linebreak></m:linebreak>CI: Copyright (c) 2013; JID: 9208369; OTO: NOTNLM; 2012/12/18 received; 2013/06/20 revised; 2013/07/29 accepted; aheadofprint;<m:linebreak></m:linebreak> <m:linebreak></m:linebreak>Dissenys híbrids; Case-crossover.
B. Gunjal, S. Urs, and H. Shi. In Proceedings of the ILA-TISS 2008 - International conference on Knowledge for All : Role of Libraries and Information Centres, page 117-127. ILA-TISS 2008, (2008)
F. Morandat, B. Hill, L. Osvald, and J. Vitek. Proceedings of the 26th European Conference on Object-Oriented Programming, page 104--131. Berlin, Heidelberg, Springer-Verlag, (2012)
P. Sastry, P. Krishnaiah, P. Rao, and D. Vathsal. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (1):
264--267(January 2015)
S. Urbanek. Workshop publication, 3rd International Workshop on Distributed Statistical Computing (DSC 2003), Vienna, Austria ISSN 1609-395X, (March 2003)
T. Kalibera, P. Maj, F. Morandat, and J. Vitek. Proceedings of the 10th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, page 89--102. New York, NY, USA, ACM, (March 2014)