### bookmarks  (hide)2311displayallbookmarks onlybookmarks per page5102050100sort bydatetitlefolkrankorderascendingdescendingRSSBibTeXXML

•

#### 1IOM’s Global Migration Data Analysis Centre | Global Migration Data Analysis Centre (GMDAC)

Portal founded Dec 2017. German government initiative.
a month ago by @mikaelbook
(0)

•

#### 158775 - Estimating nonlinear combinations of model parameters

The NLEstimate macro allows you to estimate one or more linear or nonlinear combinations of parameters from any model for which you can save the model parameters and their variance-covariance matrix. Most modeling procedures which offer ESTIMATE, CONTRAST, or LSMEANS statements only provide for estimating or testing linear combinations of model parameters. However, common estimation problems often involve nonlinear combinations, particularly in generalized models with nonidentity link functions such as logistic and Poisson models.
2 months ago by @jkd
(0)

•

#### 144124 - Counting the number of missing and non-missing values for each variable in a data set

This sample combines macro programming with PROC FREQ and DATA Step logic to count the number of missing and non-missing values for every variable in a data set. The results are stored in a data set. This sample illustrates one method of counting the number of missing and non-missing values for each variable in a data set. Two methods for structuring the resulting data set are shown.
2 months ago by @jkd
(0)

•

#### 125024 - Confidence interval and hypothesis test for variance

The %VARTEST macro provides a one-tailed test of the null hypothesis that the variance equals a non-zero constant for normally distributed data. It also provides point- and confidence interval estimates. NOTE: The CIBASIC option in PROC UNIVARIATE provides one- and two-sided confidence intervals for the standard deviation and variance. PROC TTEST provides a confidence interval for the standard deviation using either of two methods. PURPOSE: The %VARTEST macro tests the null hypothesis that the variance (or standard deviation) of a set of independent and identically normally distributed values is equal to a specified constant against an alternative that the variance (or standard deviation) exceeds the constant. The macro also provides point- and confidence interval estimates for the variance and standard deviation.
2 months ago by @jkd
(0)

•

#### 126100 - QIC goodness of fit statistic for GEE models

NOTE: Beginning in SAS 9.2, the QIC statistic is produced by PROC GENMOD. Beginning in SAS 9.4 TS1M2, QIC is available in PROC GEE. PURPOSE: The %QIC macro computes the QIC and QICu statistics proposed by Pan (2001) for GEE (generalized estimating equations) models. These statistics allow comparisons of GEE models (model selection) and selection of a correlation structure.
2 months ago by @jkd
(0)

•

#### 154866 - Logistic model selection using area under curve (AUC) or R-square selection criteria

The SELECT macro performs model selection methods for categorical-response models that can be fit in PROC LOGISTIC. These include models using the logit, probit, cloglog, cumulative logit, or generalized logit links. The macro supports binary as well as ordinal and nominal multinomial models. Standard model selection is done by choosing candidate effects for entry to or removal from the model according to their significance levels. After completion, the set of models selected at each step of this process is sorted on the selected criterion - AUC, R-square, max-rescaled R-square, AIC, or BIC. The requested number of best models on the selected criterion is displayed.
2 months ago by @jkd
(0)

•

#### 133027 - Looking for Unique Codes in Your Data

What we present here is a macro that will automatically check all the numeric variables in a SAS data set for a specific data value, and produce a report showing which variables contain this special value and how many times it appeared. The macro is called FIND_VALUE Many of us are presented with SAS data sets where codes such as 9999 are intermingled with real data values. Sometimes these codes represent missing values; sometimes they represent other non-data values. If you run SAS procedures on numeric variables in such a data set, you will, obviously, produce nonsense. What we present here is a macro that will automatically check all the numeric variables in a SAS data set for a specific data value, and produce a report showing which variables contain this special value and how many times it appeared. The macro is called FIND_VALUE and is presented below. You can download this macro and many other useful macros from the SAS Companion Web Site: support.sas.com/publishing. Search for my book, Cody's Data Cleaning Techniques, Second Edition, and then click on the link to download the programs and data files from the book.
2 months ago by @jkd
(0)

•

#### 124997 - Generate confidence ellipses for bivariate normal data

NOTE: Beginning in SAS 9, you can use the ODS GRAPHICS ON; statement and the PLOTS=SCATTER(ELLIPSE=MEAN) or PLOTS=SCATTER(ELLIPSE=PREDICTED) option in the PROC CORR statement to get confidence ellipse plots about the mean or individual values. PURPOSE: The %CONELIP macro generates confidence ellipses for bivariate normal data. It can either create ellipses for the data or ellipses about the mean.
2 months ago by @jkd
(0)

•

#### 125034 - Standardize variables

NOTE: This macro is obsolete beginning with SAS 8.0. Use the STDIZE procedure in SAS/STAT software beginning in that release. PURPOSE: The %STDIZE macro standardizes one or more numeric variables in a SAS data set by subtracting a location measure and dividing by a scale measure. A variety of location and scale measures are provided, including estimates that are resistant to outliers and clustering
2 months ago by @jkd
(0)

•

#### 125008 - Generate data from a multivariate normal distribution

NOTE: The MVN macro is obsolete. Beginning in SAS 9.2, use the RANDNORMAL function in SAS/IML software or PROC SIMNORMAL in SAS/STAT software to generate multivariate normal data. PURPOSE: The %MVN macro generates multivariate normal data using the Cholesky root of the variance-covariance matrix. Bivariate normal data can be generated using the DATA step.
2 months ago by @jkd
(0)

•

#### 130662 - Mahalanobis distance: from each observation to the mean, from each observation to a specific observation, between all possible pairs

Overview This sample shows one way of computing Mahalanobis distance in each of the following scenarios: from each observation to the mean from each observation to a specific observation from each observation to all other observations (all possible pairs)
2 months ago by @jkd
(0)

•

#### 155481 - Symmetric Confidence and Prediction Intervals for Generalized Linear Models

The GLMPI macro computes asymptotic 100(1-α)% confidence and prediction intervals that are symmetric about the predicted mean using the delta method.
2 months ago by @jkd
(0)

•

#### 124980 - Nonparametric estimation and comparison of survival curves from interval-censored data

These macros compute nonparametric survival curve estimates from interval-censored data. Confidence intervals for survival curves and log-rank tests comparing survival curves from several groups are also provided. NOTE: Beginning with SAS/STAT 13.1 in SAS 9.4 TS1M1, the functionality of these macros has been updated and added to the ICLIFETEST procedure. For details, see the ICLIFETEST documentation. PURPOSE: These macros compute nonparametric maximum likelihood estimates (NPMLEs) of survival curves from interval-censored data. Confidence intervals for survival curves and log-rank tests comparing survival curves from several groups are also provided.
2 months ago by @jkd
(0)

•

#### 125010 - Create a polychoric correlation or distance matrix

NOTE: Beginning in SAS 9.4, this macro is no longer needed. Use the OUTPLC= option in Base SAS PROC CORR to save a matrix of polychoric (or tetrachoric) correlations. PURPOSE: The %POLYCHOR macro creates a SAS data set containing a correlation matrix of polychoric correlations or a distance matrix based on polychoric correlations.
2 months ago by @jkd
(0)

•

#### 150096 - Coloring the clusters in a dendrogram

The %CLUSTERGROUPS macro creates a custom template that combines a dendrogram and a blockplot to highlight each of the specified number of clusters with a different color. The %CLUSTERGROUPS macro enhances dendrograms produced in SAS by adding color to highlight the clusters. You specify the number of clusters desired as input to the macro.
2 months ago by @jkd
(0)

•

#### 124982 - Jackknife and Bootstrap Analyses

The %JACK and %BOOT macros do jackknife and bootstrap analyses for simple random samples, computing approximate standard errors, bias-corrected estimates, and confidence intervals assuming a normal sampling distribution. The %JACK macro does jackknife analyses for simple random samples, computing approximate standard errors, bias-corrected estimates, and confidence intervals assuming a normal sampling distribution. The %BOOT macro does elementary nonparametric bootstrap analyses for simple random samples, computing approximate standard errors, bias-corrected estimates, and confidence intervals assuming a normal sampling distribution. Also, for regression models, the %BOOT macro can resample either observations or residuals. The %BOOTCI macro computes several varieties of confidence intervals that are suitable for sampling distributions that are not normal.
2 months ago by @jkd
(0)

•

#### 160162 - R-square and partial R-square for generalized linear models based on the variance function

The RsquareV macro provides an R-square measure for models with a well-defined variance function such as generalized linear and generalized additive models. R2 is a popular measure of fit used for ordinary regression models. The RsquareV macro provides the R_V^2 statistic proposed by Zhang (2016) for use with any model based on a distribution with a well-defined variance function. This includes the class of generalized linear models and generalized additive models based on distributions such as the binomial for logistic models, Poisson, gamma, and others. It also includes models based on quasi-likelihood functions for which only the mean and variance functions are defined. A partial R2 is provided when comparing a full model to a nested, reduced model. Partial R can be obtained from this when the difference between the full and reduced model is a single parameter. A penalized R2 is also available adjusting for the additional parameters in the full model.
2 months ago by @jkd
(0)

•

#### 143773 - Adverse event relative risk macro

2 months ago by @jkd
(0)

•

#### 124983 - Macro to test multivariate normality

The %MULTNORM macro provides tests and plots of multivariate normality. A test of univariate normality is also given for each of the variables. A chi-square quantile-quantile plot of the observations' squared Mahalanobis distances can be obtained allowing a visual assessment of multivariate normality. Univariate histograms with overlaid normal curves are also available.
2 months ago by @jkd
(0)

•

#### 124981 - Perform item analysis for multiple choice tests

The %ITEM macro computes descriptive statistics for analysis of data from a multiple-choice test. Each observation contains the answers from one subject to a set of questions ("items"). The data are compared to an answer key to determine which answers are correct. The score for each subject is computed as the number of correct answers. The output is very similar to that from the ITEM procedure in the SUGI Supplemental library, but several incorrect statistics have been fixed.
2 months ago by @jkd
(0)

•

#### 1Using simulation studies to evaluate statistical methods.

, , and . (2017)cite arxiv:1712.03198Comment: 31 pages, 9 figures (2 in appendix), 8 tables (1 in appendix).
4 days ago by @jpvaldes
(0)

•

#### 1Validating Bayesian Inference Algorithms with Simulation-Based Calibration.

(2018)cite arxiv:1804.06788Comment: 26 pages, 14 figures.
4 days ago by @jpvaldes
(0)

•

#### 3Polynomial Regression As an Alternative to Neural Nets.

(2018)cite arxiv:1806.06850Comment: 23 pages, 1 figure, 13 tables.
4 days ago by @jpvaldes
(0)

•

#### 2Extending Stan for Deep Probabilistic Programming

(2018)cite arxiv:1810.00873.
4 days ago by @jpvaldes
(0)

•

#### 2A Tale of Three Probabilistic Families: Discriminative, Descriptive and Generative Models.

, , , and . (2018)cite arxiv:1810.04261.
4 days ago by @jpvaldes
(0)

•

#### 2On the Hyperprior Choice for the Global Shrinkage Parameter in the Horseshoe Prior

, and . Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, volume 54 of Proceedings of Machine Learning Research, page 905--913. PMLR, (20--22 apr 2017)
4 days ago by @jpvaldes
(0)

•

#### 1Pareto Smoothed Importance Sampling

, , and . (Oct 21, 2017)
4 days ago by @jpvaldes
(0)

•

#### 2Comparison of Bayesian predictive methods for model selection

, and . Statistics and Computing 27 (3): 711--735 (Mar 23, 2017)
4 days ago by @jpvaldes
(0)

•

#### 1Piecewise linear regularized solution paths

, and . Annals of Statistics 35 (3): 1012--1030 (2017)
4 days ago by @jpvaldes
(0)

•

#### 1Abandon Statistical Significance

(Apr 10, 2018)
4 days ago by @jpvaldes
(0)

•

#### 1Classical Statistics and Statistical Learning in Imaging Neuroscience.

Frontiers in neuroscience (2017)
4 days ago by @jpvaldes
(0)

•

#### 1Beyond differences in means: robust graphical methods to compare two groups in neuroscience

, , and . European Journal of Neuroscience 46 (2): 1738--1748 (July 2017)
4 days ago by @jpvaldes
(0)

•

#### 4Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

European Journal of Epidemiology 31 (4): 337--350 (2016)
4 days ago by @jpvaldes
(0)

•

#### 2An investigation of the false discovery rate and the misinterpretation of p-values

Royal Society Open Science 1 (3): 140216 (Nov 1, 2014)
4 days ago by @jpvaldes
(0)

•

#### 1Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints.

BMC medical research methodology 14 (1): 137+ (Dec 22, 2014)
4 days ago by @jpvaldes
(0)

•

#### 1Moving beyond P values: Everyday data analysis with estimation plots

bioRxiv (2018)
4 days ago by @jpvaldes
(0)

•

#### 1Why P Values Are Not a Useful Measure of Evidence in Statistical Significance Testing

, and . Theory & Psychology 18 (1): 69--88 (Feb 1, 2008)
4 days ago by @jpvaldes
(0)

•

#### 4Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical Accuracy

, and . Statist. Sci. 1 (1): 54--75 (1986)doi:10.1214/ss/1177013815.
4 days ago by @jpvaldes
(0)

•

#### 1Margin maximizing loss functions

, , and . Advances in Neural Information Processing Systems 16, (2004)
4 days ago by @jpvaldes
(0)

•

#### 2A survey of cross-validation procedures for model selection

, and . Statistics Surveys 4 (0): 40--79 (Jul 27, 2009)
4 days ago by @jpvaldes
(0)