Index

A

acceptance-rejection technique 126128

addition (+) operator 329

Akaike information criterion 249

ALL function 326

alternative parameterizations 216, 218

annotation facility, SGPLOT procedure 301

ANOVA procedure 199, 208210

ANY function 326

APPEND statement 330331

approximate sampling distribution

See ASD (approximate sampling distribution)

AR model 252, 256258

AR(1) model

about 185186, 252

approximating sampling distributions for parameters 254256

generating covariance matrix with known structure 183

generating multivariate binary variates 155

simulating data in DATA step 252254

simulating data in SAS/IML software 258260

ARIMA procedure

about 252

BY statement 254

ESTIMATE statement 254

estimating AR and MA model parameters 258

IDENTIFY statement 253254

simulating AR(1) data 252253

ARMA models

about 252

approximating sampling distributions for AR(1) parameters 254256

multivariate 260261

simple AR model 252

simulating AR and MA data in DATA step 256258

simulating AR(1) data in DATA step 252254

simulating AR(1) data in SAS/IML software 258260

ARMACOV function 258

ARMALIK function 258

ARMASIM function 251, 258259

ARRAY statement 284

arrays, holding explanatory variables 201

ASD (approximate sampling distribution)

about 5255

number of samples and 96

sampling distribution of Pearson correlations 7071

sampling distribution of statistics for normal data 6061

sampling distribution of the mean 5759, 68

simple regression model example 202203

at (@) symbol 233

autocorrelated data 251

autoregressive and moving average models

See ARMA models

autoregressive model 252, 256258

B

BARCHART statement, TEMPLATE procedure 3839

BarPMF template 39

baseline hazard function 242243

BC (bias-corrected) confidence intervals 295

Bernouilli distribution

about 1415

logistic regression and 226227

parameters for 27

simulating data from inhomogeneous Poisson process 275

beta distribution 28, 301

bias-corrected (BC) confidence intervals 295

binary variates 154157

binomial distribution

about 1516

negative 27, 39

parameters for 27

BINOMIAL option, TABLES statement (FREQ) 81, 86

BISECTION module 118, 156, 334335

block-diagonal matrices 232233

BLOCK function 233, 326

%BOOT macro 295

%BOOTCI macro 295

bootstrap confidence intervals 283, 295

bootstrap distribution

about 283, 285

computing standard deviation of 294295

for skewness and kurtosis 285289

bootstrap methods

about 281

computing bootstrap confidence intervals 283, 295

computing bootstrap standard error 294295

parametric 281, 291

plotting estimates of standard errors 305306

resampling with DATA step 262266

resampling with SAS/IML software 282, 288291

resampling with SURVEYSELECT procedure 282, 286288

smooth 281, 292294

bootstrap standard error 283, 294295, 305306

BY-group technique

about 55

computing p-values 90

macro usage considerations 101

resampling example 285

suppressing output and graphics 97100

writing efficient simulations 9697, 99

BY statement

ARIMA procedure 254

BY-group technique and 55

MEANS procedure 64

simulating data with DATA step and procedures 56

TTEST procedure 80, 85

writing efficient simulations 97

C

CALIS procedure 180

case resampling 284

case sensitivity 6

Cauchy distribution 28

CDF function

about 116117, 326

checking correctness of simulated data 35

parameter considerations 110

QUANTILE function and 30, 32

working with statistical distributions 3032

CEIL function 17, 326

censored observations 124125, 244

central limit theorem (CLT) 5758

chi-square distribution 28, 6263

chi-square statistic 89

chi-square test 8890

CHISQ option, TABLES statement (FREQ) 90

Cholesky transformation 146150

CHOOSE function 326

CL option, MIXED procedure 235

CLASS statement

LOGISTIC procedure 218

MEANS procedure 64

classification variables

explanatory variables and 199

linear regression models with 208211

CLB option, MODEL statement (REG) 221

CLOSE statement 331

CLT (central limit theorem) 5758

coefficient of excess 299

COLNAME= option

FROM clause, APPEND statement 331

FROM clause, CREATE statement 331

PRINT statement 328

colon (:) operator

See mean (:) operator

COLVEC function 326, 331

comparison (<=) operator 6

complete spatial randomness 273

components (subpopulations) 119121

compound symmetry model 184

conditional distribution technique 145

conditional distributions 142144

conditional simulations

about 264

of one-dimensional data 270271

of two-dimensional data 272273

CONDMV function 270

CONDMVN function 270

CONDMVNMEANCOV function 144145

confidence intervals

about 74

bias-corrected 295

bootstrap 283, 295

computing coverage in SAS/IML language 7778

coverage for nonnormal data 7677

coverage for normal data 7476

MEANS procedure computing 315

CONSTANT function 178

contaminated normal distribution

multivariate 138140

univariate 121122

CONTENTS procedure 46

continuous distributions

about 2021

CDF function and 31

exponential distribution 2224, 28

moment-ratio diagram for 300301

normal distribution 2122, 28

parameters for 28

PDF function and 30

simulating in SAS/IML software 2627

skewness and kurtosis for 300301

uniform distribution 22, 28

continuous mixture distribution 122

continuous variables

explanatory variables and 199

linear regression models with 200203, 210211

contour plots 268269

control statements 6, 326

COORDINATES statement, SIM2D procedure 272

COPULA procedure

about 12, 169172

SIMULATE statement 170

copula technique

about 153, 164, 168169

fitting and simulating data 169173

usage example 164168

CORR function 290, 326

CORR procedure

computing sample moments 319

COV option 180

estimating covariance matrix from data 180

FISHER option 168, 173

fitting and simulating data from copula model 171, 173

generating data from copulas 168

NOMISS option 180, 190

OUTP= option 97, 180

POLYCHORIC option 190

simple linear regression models 203

simulating data from multinomial distributions 135140

correlated random errors 232236

correlation matrices

about 175176

converting between covariance and 176177

finance example 189190

finding nearest 191193

generating random 187189

problems faced with 189191

COUNTN function 326

COV function 180, 326

COV option, CORR procedure 180

covariance matrices

about 175176

building 179186

converting between correlation and 176177

estimating from data 180181

generating diagonally dominant 181182

generating from Wishart distribution 186187

generating with known structure 183186

testing for 177179

with AR(1) structure 185186

with compound symmetry 184

with diagonal structure 183184

with Toeplitz structure 185

Cox models 242243

CREATE statement

about 330331

FROM clause 331

row-major order for matrices 260

VAR clause 331

cumulative distributions

See CDF function

CUPROD function 326

CUSUM function 326

CUTVAL. format 60

D

data sets

creating from ODS tables 4546, 57

creating matrices from 330

macro usage considerations 101

reading data from 329330

writing data to 330331

data simulation

See simulating data

DATA step

See also specific techniques

observations and 6

resampling with 282286

SAS/IML language comparison 6

simulating AR data in 256258

simulating AR(1) data in 252254

simulating data using 5567

simulating MA data in 256258

densities, computing

See also PDF function

finite mixture distribution and 120

overlaying theoretical density on histograms 4041

overlaying theoretical PMF on frequency plots 38

design matrices

about 215216

creating for fixed and random effects 238239

for alternative parameterizations 218

reading 239240

with GLM parameterization 216218

design of simulation studies

about 9395

disadvantages of simulation 105

effect of number of samples 9596

moment-ratio diagram as tool for 315317

writing efficient simulations 96105

DIAG function 326

DIAGONAL= option, MATRIX statement (SGSCATTER) 291

diagonally dominant covariance matrices 181184

discrete distributions

about 14

Bernouilli distribution 1415, 27

binomial distribution 1516, 27

CDF function and 31

discrete uniform distribution 1718

geometric distribution 1617, 27

parameters for 27

PDF function and 30

Poisson distribution 1920, 27

simulating in SAS/IML software 2425

tabulated distributions 1819

discrete uniform distribution 1718

dispersion constant 226

DISTANCE function

about 326, 333334

simulating data from Gaussian random field 265

simulating data from regular process 277

DO function 326

The DO Loop blog 5, 325

DO loops

as simulation loops 55

effect of sample size on sampling distribution 64

matrix arithmetic versus 155

sampling distribution of Pearson correlations 70

simulating fixed effect by reversing order of 202203

tips for shortening simulation times 104

using multivariate data 55, 97

using univariate data 1314, 2122, 24

writing efficient simulations 97

DYNAMIC statement, SGRENDER procedure 39, 41

E

ECDF (empirical CDF) 119, 281

effect parameterization 218

effect size 8788

EIGEN routine 193, 326

eigenvalue decomposition 150151

eigenvalues (spectrum) 187189

EIGVAL function 179, 182, 326

elliptical distributions 169

empirical CDF (ECDF) 119, 281

Emrich-Piedmonte algorithm 154, 158

equality (=) operator 6

Erlang distribution 28

ESTIMATE statement, ARIMA procedure

NOPRINT option 254

OUTEST= option 254

WHERE clause 254

EUCLIDEANDISTANCE module 265, 277, 333334

evaluating power of t test 8486

evaluating statistical techniques

about 7374

assessing two-sample t test for equality of means 7884

confidence interval for a mean 7478

effect of sample size on power of t test 8788

evaluating power of t test 8486

using simulation to compute p-values 8890

excess kurtosis

about 299

computing 336337

for continuous distributions 300301

EXP function 326

EXPAND2GRID function 315

explanatory variables

about 198199

arrays holding 201

classification variables and 199

continuous variables and 199

exponential distribution

about 2224

confidence interval for a mean 7677

goodness-of-fit tests 117

inverse transformation algorithm and 117118

parameters for 28, 110

plotting PDF of 31

proportional hazards model and 242

shape parameters and 301

F

F distribution 28, 117119

F test 212, 215

factor pattern matrix 133, 150151

FACTOR procedure 133, 150

feasible region 300

final weighted least squares (FWLS) estimate 220221

FINISH statement 326

finite mixture distributions

about 119120

contaminated normal distribution 121122

simulating from 120121

FISHER option, CORR procedure 168, 173

Fisher's z transformation 290

FITFLEISHMAN module 312

fixed effects

creating design matrices for 238239

generating variables for 201

simulating by reversing order of DO loops 202203

simulating random effects components 236242

simulating with arrays 201

Fleishman's method 115, 298, 311314

FMM procedure 120, 297

FORM= option, SIMULATE statement (SIM2D) 267268

FORMAT= option, PRINT statement 328

FORMAT procedure 60

FREE statement 327

FREQ procedure

BY-group processing 90

chi-square tests and 89

confidence interval for a mean 75

design of simulation studies and 315

OUTPUT statement 97, 190

simulating multivariate ordinal variates 161

simulating univariate data 15, 1718

t tests and 8081, 8586

TABLES statement 56, 81, 86, 90

usage examples involving tables 4445

frequency plots 3739

frequency variables 287288

FROM clause

APPEND statement 331

CREATE statement 331

FROOT function

about 118119, 327, 334335

finding intermediate correlations 156

functions

See also specific functions

parameters and 110

SAS/IML language supported 326328

SAS/IML modules replicating 333336

FWLS (final weighted least squares) estimate 220221

FWLS option, ROBUSTREG procedure 220221

G

GAM procedure 247

GAMINV function 32

gamma distribution

checking correctness of simulated data 3537

chi-square distribution and 6263

fitting to data 305308

parameters for 28, 110

shape parameters and 301

GAUSS functions 188

Gaussian random field

about 263264

conditional simulation of one-dimensional data 270271

conditional simulation of two-dimensional data 272273

unconditional simulation of one-dimensional data 264267

unconditional simulation of two-dimensional data 267269

generalized Pareto distribution 113

GENMOD procedure 229

geometric distribution

about 1617

drawing random sample from 3739

parameters for 27

Givens rotations 187

GLIMMIX procedure

estimating covariance matrix 181

OUTDESIGN= option 238239

reading design matrices 239

simulating random effects components 238

GLM parameterization 216218

GLM procedure

design matrices with GLM parameterization 217

OUTSTAT= option 97

simple linear regression models and 199, 211

GLMMOD procedure

creating design matrices for fixed and random effects 238239

GLM parameterization and 216217

simulating random effects components 238

goodness-of-fit tests 3537, 117

Graph Template Language (GTL)

defining contour plots 268

overlaying theoretical density on histograms 4041

overlaying theoretical PMF on frequency plots 3739

Grid Manager, SAS 102

grid of values, creating 332333

GRID statement, SIM2D procedure 267

GTL (Graph Template Language)

defining contour plots 268

overlaying theoretical density on histograms 4041

overlaying theoretical PMF on frequency plots 3739

Gumbel distribution 111112, 301, 305

H

hard-core processes 276278

hazard function 242

hazard rate 123

high-leverage points 219, 221224

Higham's method 190193

HISTOGRAM statement, UNIVARIATE procedure

overlaying theoretical density on histograms 40

plotting bootstrap estimates of standard errors 306

sampling distribution for AR(1) parameters 255

sampling distribution of the variance 62

HistPDF template 41

homogeneous Poisson process

about 273

regular process and 276278

simulating data from 273275

HOMOGPOISSONPROCESS function 277

hypergeometric distribution 27

hypothesis testing, computing p-values for 32, 8890

I

I function 327

ID vectors, creating 331332

IDENTIFY statement, ARIMA procedure

NOPRINT option 254

VAR= option 253

IF-THEN/ELSE control statement 326

Iman-Conover method 161164, 176

IML (interactive matrix language) 5

See also SAS/IML language

IML procedure

Cholesky transformation and 148

DATA step function support 227

design matrices and 215216

estimating covariance matrix from data 180

license considerations 56

LOAD statement 159, 188, 327

matrix multiplication and 217

multivariate normal distributions and 133

sampling distribution of Pearson correlations 70

simulating ARMA samples 259

simulating Gaussian random fields 267

simulating univariate data 24

t tests and 84

in-memory technique 55, 97

index of maximum (<:>) operator 329

index of minimum (>:<) operator 329

inequality (^=) operator 6

inhomogeneous Poisson process 273, 275276

INSET statement

SGPLOT procedure 90

UNIVARIATE procedure 306

instrumental distribution 126128

interactive matrix language (IML) 5

See also SAS/IML language

intermediate correlation 155156, 167

INTO clause, READ statement 330

INV function 327

inverse CDF function

See QUANTILE function

inverse Gaussian distribution 28, 112

inverse transformation algorithm 117119

iterative DO statement 326

J

J function

about 327

sampling distribution of the mean 68

simulating univariate data 24

writing efficient simulations 97

jackknife methods 287

jitter technique 132

Johnson system of distributions 114116, 302, 308311

K

KDE (kernel density estimate) 120121

KDE procedure 132, 293

KEEP statement 284

kernel density estimate (KDE) 120121

Kronecker product matrix operator 233

kurtosis

bootstrap resampling 285289

checking correctness of simulated data 36

computing 336337

design of simulation studies and 315316

estimate bias in small samples 6567

Fleishman distribution and 115, 311

for gamma distribution 306308

Johnson system of distributions 310311

moment matching and 303

moments and 299301

plotting variations on moment-ratio diagram 303306

KURTOSIS module 288

KURTOSIS= option, OUTPUT statement (MEANS) 66

L

LABEL= option, PRINT statement 328

Laplace distribution 28

LCLM= option, OUTPUT statement (MEANS) 7475

least trimmed squares (LTS) estimate 220223

LEVERAGE option, MODEL statement (ROBUSTREG) 222

LEVERAGE= option, OUTPUT statement (ROBUSTREG) 222

LIFETEST procedure 123124, 245246

linear mixed models

about 230231

repeated measures model with random effect 231232

simulating correlated random errors 232236

with random effects 226, 230232

linear predictor

about 226

in generalized linear models 226

in proportional hazards model 243

linear regression models

about 199

based on real data 204208

generalized 226230

with classification and continuous variables 210211

with interaction and polynomial effects 215218

with single classification variable 208210

with single continuous variable 200203

LINEPARM statement, SGPLOT procedure 219220

link functions 226

listwise deletion 190

LOAD statement, IML procedure 159, 188, 327

LOC function 327, 329

location parameter 109111

LOESS procedure

about 247

MODEL statement 249

logistic distribution 28

LOGISTIC procedure

alternative parameterizations 216, 218

CLASS statement 218

logistic regression example 228

OUTDESIGN= option 218

OUTDESIGNONLY option 218

logistic regression model 226229

lognormal distribution

parameters for 28, 111

plotting bootstrap estimates of standard errors 305

shape parameters and 301

LTS (least trimmed squares) estimate 220223

M

MA model 256258

machine epsilon 178

machine precision 178

macro-loop technique 100101

macros

macro-loop technique 100101

packaging commands into 9899

usage considerations 101102

Matérn model II 277

MATLAB functions 188

matrices

See also correlation matrices

See also covariance matrices

block-diagonal 232233

checking if PSD 178179

checking if symmetric 178

constructing 240

creating data sets from 331

creating from data sets 330

design 215218

efficiency of 6

eigenvalues for 187189

Iman-Conover method 161164

reshaping 69

row-major order for 260

SAS/IML language and 6

subscript reduction operators for 328329

tips for shortening simulation times 103

matrix arithmetic 155, 217

MATRIX statement, SGSCATTER procedure 291

MAX function 327, 329

maximum likelihood estimate

checking correctness of simulated data 36

fitting gamma distribution to data 306

suppressing notes to SAS log 99100

maximum (<>) operator 329

MCD subroutine 140

MCMC procedure

about 9

Gibbs sampling and 145

parameter considerations 110

mean

assessing two-sample t test 7884

computing variances of 6162

confidence interval for 7478, 295

sampling distribution of 5759, 6869

MEAN function

about 68, 327

computing confidence interval for a mean 77

subscript reduction operator equivalent 329

writing efficient simulations 97

mean mapping method 158159

mean (:) operator 68, 77, 329

mean square error 54

MEAN statement, SIM2D procedure 267268

MEANS procedure

approximating sampling distribution 55

bootstrap resampling 287288

BY statement 64

CLASS statement 64

computing point estimates 282

computing sample kurtosis 66

computing sample moments 319

computing variances 61

design of simulation studies and 315

displaying descriptive statistics 255

OUTPUT statement 56, 66, 7475

P5 option 58, 285

P95 option 58, 285

sampling distribution of the mean 5759

unconditional simulation of one-dimensional data 266267

VARDEF= option 295

median, computing variances of 6162

MEDIAN function 68, 327

Mersenne-Twister algorithm 3233

METHOD= option

ROBUSTREG procedure 220221

SURVEYSELECT procedure 287

MIN function 327, 329

minimum (><) operator 329

mixed models

See linear mixed models

MIXED procedure

CL option 235

covariance structures supported 183

estimating covariance matrices 181

repeated measures model with random effect 231232

mixing probabilities 120

mixture distributions

about 119120

contaminated normal distribution 121122

simulating from 120121

MODEL procedure

about 9, 252

parametric bootstrap method 291

simulating data from copula model 169

MODEL statement

LOESS procedure 249

REG procedure 221

ROBUSTREG procedure 222

moment matching

about 298, 303

as modeling tool 302303

as tool for designing simulation studies 315317

moment-ratio diagram

about 298302

as tool for designing simulation studies 315317

comparing simulations and choosing models 314

extensions to multivariate data 318331

fitting gamma distribution to data 306308

Fleishman's method 311314

for continuous distributions 300301

Johnson system of distributions 308311

plotting variation of skewness and kurtosis on 303306

MOMENTS module 312

moments of a distribution 299301

Monte Carlo estimates

about 54

bias of kurtosis estimates in small samples 67

effect of sample size on sampling distribution 6364

MCMC procedure and 9

number of samples and 96

sampling distribution of the mean 58

simple regression model example 202203

Monte Carlo standard error 96

multinomial distribution

about 130132

generating random samples from 89

simulating data from 130132

tabulated distributions and 19, 130

multiplication (#) operator 215, 329

multivariate ARMA models 260261

multivariate contaminated normal distribution 138140

multivariate distributions

See also multinomial distribution

See also MVN (multivariate normal distributions)

advanced techniques for simulating data 153174

basic technique for simulating data 129152

Cholesky transformation and 146150

constructing with Fleishman distribution 115

extensions to 318331

generating data from 137

generating data from copulas 164173

generating multivariate binary variates 154157

generating multivariate ordinal variates 158161

methods for generating data from 144146

mixtures of 138141

reordering multivariate data 161164

resampling with SAS/IML software 289291

simulating data from 129, 153154

simulating data in time series 251

simulating data with given moments 298

spectral decomposition and 150151

using DO loop 55, 97

multivariate normal distributions (MVN)

about 133

conditional 142144

estimating covariance matrix from data 180

mixtures of 140141

simulating in SAS/IML software 133136

simulating in SAS/STAT software 136

%MULTNORM macro 135136, 140

MVN (multivariate normal distributions)

about 133

conditional 142144

estimating covariance matrix from data 180

mixtures of 140141

simulating in SAS/IML software 133136

simulating in SAS/STAT software 136

MYSQRVECH function 335

N

naive bootstrap

See bootstrap methods

NARROW option, SIM2D procedure 267

NCOL function 327

nearest correlation matrix 191193

negative binomial distribution 27, 39

NLIN procedure 291

NOMISS option, CORR procedure 180, 190

nonnormal distributions 8182

NONOTES system option 101, 116

nonparametric models 247249

nonsingular parameterizations 218

NOPRINT option

ESTIMATE statement, ARIMA procedure 254

IDENTIFY statement, ARIMA procedure 254

procedures and 97, 254

normal distribution

computing p-values 32

computing quantiles for 156

confidence interval for a mean 7477

contaminated 121122, 138140

parameters for 28

shape parameters and 301

simulating data from 1213, 2122

normal mixture distribution 28

NORTA method 168169

notes, suppressing to SAS log 99100

NOTES system option 101

NROW function 327

number of samples (repetitions) 9596

NUMREAL= option, SIMULATE statement (SIM2D) 267

O

observations

correlating 181

DATA step and 6

high-leverage points 219

ODS EXCLUDE ALL statement 97

ODS EXCLUDE statement 4546

ODS GRAPHICS statement 46

ODS OUTPUT statement

creating data sets from tables 4546, 57

suppressing output 9798

usage example 80

ODS SELECT statement 4546

ODS statements, controlling output with 4446, 9799

ODS TRACE statement 44

%ODSOFF macro 80, 98, 228, 236

%ODSON macro 99

OF operator 19

OLS (ordinary least squares) 200

one-dimensional data

conditional simulation of 270271

unconditional simulation of 264267

ordinal variates 158161

ORDMEAN function 159160

ORDVAR function 159160

OUT= option

OUTPUT statement, FREQ procedure 97

TABLES statement, FREQ procedure 56

OUTDESIGN= option

GLIMMIX procedure 238239

LOGISTIC procedure 218

OUTDESIGNONLY option, LOGISTIC procedure 218

OUTEST= option

ESTIMATE statement, ARIMA procedure 254

REG procedure 56, 97

OUTHITS option, SURVEYSELECT procedure 287288

outliers 219221

OUTP= option, CORR procedure 97, 180

output, controlling with ODS statements 4446, 9799

OUTPUT statement

FREQ procedure 97, 190

MEANS procedure 56, 66, 7475

REG procedure 205

ROBUSTREG procedure 222

OUTSTAT= option, GLM procedure 97

P

P= option, OUTPUT statement, REG procedure 205

P5 option, MEANS procedure 58, 285

P95 option, MEANS procedure 58, 285

p-values, computing for hypothesis testing 32, 8890

pairwise correlations 190

PAIRWISEDIST module 333334

PARAM= option, CLASS statement (LOGISTIC) 218

parameter estimates

reading 239240

using as parameters 207208

parameters

for Bernouilli distribution 27

for binomial distribution 27

for continuous distributions 28

for discrete distributions 27

for Emrich-Piedmonte algorithm 155

for exponential distribution 28, 110

for gamma distribution 28, 110

for geometric distribution 27

for logistic distribution 28

for lognormal distribution 28, 111

for normal distribution 28

for Poisson distribution 27

for standard normal distribution 28

for tabulated distributions 27

for uniform distribution 28, 111

for univariate distributions 109111

for Weibull distribution 28

location 109111

rate 22

scale 109111

shape 110, 299, 301

using parameter estimates as 207208

parametric bootstrap method 281, 291

Pareto distribution 28, 112113

PD (positive definite)

about 177

generating covariance or correlation matrix 179180

generating diagonally dominant covariance matrix 181182

problems with covariance matrices 190

testing covariance matrices 177

PDF function

about 326

checking correctness of simulated data 35

finite mixture distribution and 120

overlaying theoretical density on histograms 4041

overlaying theoretical PMF on frequency plots 38

parameter considerations 110

simulating data from continuous distributions 21, 23

working with statistical distributions 3031

Pearson correlations

bootstrap resampling 290

copula technique and 169170

correlation matrices and 176

sampling distribution of 6971

simple regression model example 202203

Pearson system of distributions 302

PHREG procedure 244

PLCORR option, OUTPUT statement (FREQ) 190

PLOTS= option, SIM2D procedure 268

PMF function

checking correctness of simulated data 35

generating multivariate ordinal variates 158161

overlaying on frequency plot 3739

working with statistical distributions 3031

POINT= option, SET statement 205, 282284

Poisson distribution 1920, 27, 229

Poisson process

about 273

homogeneous 273275

inhomogeneous 273, 275276

Poisson regression model 226, 229230

%POLYCHOR macro 190

POLYCHORIC option, CORR procedure 190

polynomial effects, linear models 215218

POLYROOT function 327

pooled variance t test

about 78

assessing in SAS/IML software 8384

effect of sample size on power of 8788

evaluating power of 8486

robustness to nonnormal populations 8182

robustness to unequal variances 7881

positive definite (PD)

about 177

generating diagonally dominant covariance matrix 181182

problems with covariance matrices 190

testing covariance matrices 177

positive semidefinite (PSD) 177179

power function distribution 113

power of regression tests 211215

power of t test

effect of sample size on 8788

evaluating 8486

exact power analysis 8485

simulated analysis 8586

POWER procedure

effect size and 8788

evaluating power of t test 8486

PRINT statement

about 328

COLNAME= option 328

FORMAT= option 328

LABEL= option 328

ROWNAME= option 328

sampling distribution example 69

PRINTTO procedure 99100

probability distributions

See continuous distributions

See discrete distributions

probability mass function

See PMF function

PROBBNRM function 155

PROBGAM function 32

PROBIT function 32

PROBNORM function 32

procedures

BY statement in 55

data simulation using 5567

NOPRINT option in 97, 254

suppressing notes to SAS log 99100

PROD function 327, 329

profiling simulations 102103

PROJS function 191

PROJU function 191

proportional hazards model 242245

PSD (positive semidefinite) 177179

pseudorandom numbers 33

%PUT statement 35

Q

Q-Q (quantile-quantile) plot 4144

QNTL subroutine 68, 289, 327

QQPLOT statement, UNIVARIATE procedure 4144

QUANTILE (inverse CDF) function

about 326

acceptance-rejection technique and 126127

computing confidence interval for a mean 77

computing quantile of normal distribution 156

creating Q-Q plots 4243

fitting and simulating data from copula model 170171

generating data from copulas 165

parameter considerations 110

sampling method 116122

univariate distribution support 112113

working with statistical distributions 30, 32

quantile-quantile (Q-Q) plot 4144

quantiles

See also QUANTILE function

about 32

checking correctness of simulated data 35

computing for normal distributions 156

R

RAND function

about 11, 3334

finite mixture distribution and 120

linear regression model and 227

logistic regression model and 227

overlaying theoretical PMF on frequency plots 38

parameter considerations 110111

Poisson regression model and 229

simulating data from inhomogeneous Poisson process 275

simulating univariate data 1314, 18, 2324, 27

univariate distribution support 111112, 114

working with statistical distributions 30

RANDDIRICHLET function 137

%RandExp macro 23, 123

RANDFLEISHMAN module 312

RANDGEN subroutine

about 33, 227, 327

computing confidence interval for a mean 77

distributions supported by 112

J function and 97

overlaying theoretical PMF on frequency plots 38

sampling distribution of the mean 68

simulating data from homogeneous Poisson process 274

simulating univariate data 12, 18, 2427

two-sample pooled variance t test 83

working with statistical distributions 30

RANDMULTINOMIAL function 89, 130, 327

RANDMVBINARY function 157

RANDMVORDINAL function 159160

RANDMVT function 137, 327

RANDNORMAL function

about 133, 327

Cholesky transformation 146

conditional simulations 143

simulating data from multinomial distributions 70, 133, 138, 145

unconditional simulation of one-dimensional data 265

random correlation matrices 187189

random effects

about 226

creating design matrices for 238239

generating variables for 201

linear mixed models with 226, 230232

repeated measures model with 231232

simulating components 236242

random error term 198199

random number generation

about 3335

ARMASIM function and 259

Mersenne-Twister algorithm 3233

RANDSEED subroutine and 259

setting seed value for 3335

random values for distributions

See RAND function

random variates 12

RANDSEED subroutine

about 33, 327

random number generation and 259

sampling distribution of the mean 68

simulating univariate data 24, 26

RANDVALEMAURELLI function 318319

RANDWISHART function 137, 186, 327

RANGAM function 32

RANGE= option, SIMULATE statement (SIM2D) 267

rank (Spearman) correlations 169, 176

RANK function 327

RANNOR function 32

RANPERK function 2526

RANPERM function 2526

rate parameter 22

Rayleigh distribution 114

READ statement

about 329

INTO clause 330

WHERE clause 55

reading data from data sets 329330

reference parameterization 218

REFLINE statement, SGPLOT procedure 90

REG procedure

MODEL statement 221

OUTEST= option 56, 97

OUTPUT statement 205

simple linear regression models and 199200, 204205

TEST statement 211213

regression models

about 197

components of 198199

linear 199211, 215218, 226230

linear mixed models 226, 230242

logistic regression model 226229

nonparametric models 247249

outliers and 219224

Poisson regression model 226, 229230

power of regression tests 211215

survival analysis models 123125, 242247

regular processes 276278

rejection method 126128

REPEAT function 69, 327, 331

repeated measures model with random effect 231232

repetitions (number of samples) 9596

REPS= option, SURVEYSELECT procedure 287

resampling

case 284

with DATA step 282286

with SURVEYSELECT procedure 282, 286288

reshaping matrices 69

response variables

about 197198

in logistic regression 226

outliers for 219221

simulating 240242

RETURN statement 327

ridge factor 190191

RMSE (root mean square error)

about 199

linear model based on real data 204

linear model with continuous variable 201

nonparametric models 248

RANDMVT function 327

ROBUSTREG procedure

FWLS option 220221

METHOD= option 220221

MODEL statement 222

OUTPUT statement 222

ROBUSTREG routine 140

ROOT function

about 327

checking if matrix is PD 182

checking if matrix is PSD 179

Cholesky transformation and 147

root mean square error (RMSE)

about 199

linear model based on real data 204

linear model with continuous variable 201

nonparametric models 248

row-major order for matrices 260

ROWNAME= option, PRINT statement 328

ROWVEC function 327

RSREG procedure 316

S

SAMPLE function

about 327, 336

simulating univariate data 18, 2526

sample moments

checking correctness of simulated data 3537

computing 336337

sample size

bias of kurtosis estimates and 6567

effect of on power of t test 8788

effect of on sampling distribution 6365

number of samples and 9596

standard error and 96

SAMPLEREPLACE module 288, 336

sampling distribution

approximating 5255

approximating for AR(1) parameters 254256

bias of kurtosis estimates 6567

effect of sample size on 6365

estimating probability with 5960

evaluating statistical techniques for 7391

Monte Carlo estimates 54

of a statistic 5153

of Pearson correlations 6971

of statistics for normal data 6063

of the mean 5759, 6869

simulating data using SAS/IML language 6771

simulating data with DATA step and procedures 5567

sampling variation 16

SAMPRATE= option, SURVEYSELECT procedure 287

SAS Grid Manager 102

SAS/IML language

about 56, 12, 325326

additional resources 325326

assessing t test in 8384

computing confidence interval for a mean 7778

constructing block-diagonal matrix 232233

creating grid of values 332333

creating ID vectors 331332

DATA step comparison 6

design of simulation studies and 315

Fleishman's method and 312313

functions supported 326328

generating symmetric matrices 181

Iman-Conover method 162

license considerations 56

matrices and 6

modules for sample moments 336337

modules replicating functions 333336

obtaining programs used in book 8

PRINT statement 328

reading data from data sets 329330

reading design matrices into 239

resampling support 282, 288291

simulating AR(1) data 258260

simulating data from regression models 206207

simulating data using 6771

simulating multivariate normal data 133136, 140

simulating responses 240241

subscript reduction operators 328329

writing data to data sets 330331

SAS log, suppressing notes to 99100

SAS Simulation Studio 9

SAS/STAT software 133, 136, 140

SASFILE statement 283

SCALE= option, SIMULATE statement (SIM2D) 267

scale parameter 109111

scatter matrix 186

SCATTER statement, SGPLOT procedure

YERRORLOWER= option 86

YERRORUPPER= option 86

SDF (survival distribution function) 245

SEED= option, SURVEYSELECT procedure 287288

seed value

for random number generation 3335

for sampling distribution examples 55

semicolon (;) 6

SET statement 205, 282284

SETDIF function 327

SGPLOT procedure

annotation facility 301

bias of kurtosis estimates in small samples 66

conditional distributions 144

conditional simulations 271

creating Q-Q plots 4243

generating data from copulas 168

INSET statement 90

jitter technique and 132

LINEPARM statement 219220

nonparametric models example 248249

 

plotting PDF 31

profiling simulations 102103

REFLINE statement 90

SCATTER statement 86

visualizing stationary time series 257258

SGRENDER procedure

conditional simulations 272

DYNAMIC statement 39, 41

overlaying theoretical density on histograms 4041

overlaying theoretical PMF on frequency plots 38

SGSCATTER procedure 135, 141, 291

SHAPE function

about 327

generating ID variables 331

generating matrices from Wishart distribution 187

reshaping matrices 69

shape parameters 110, 299, 301

shrinkage methods 190191

SIM2D procedure

about 12

COORDINATES statement 272

GRID statement 267

MEAN statement 267268

NARROW option 267

PLOTS= option 268

producing contour plots 269

SIMULATE statement 267268, 272

simulating data from Gaussian field 263264, 267, 272

SIMNORMAL procedure

about 12, 133

conditional simulations 142

simulating MVN distributions 136

simple bootstrap

See bootstrap methods

SIMULATE statement

COPULA procedure 170

SIM2D procedure 267268, 272

simulating data

See also under specific techniques

about 34

advanced techniques for multivariate data 153174

advanced techniques for univariate data 109128

building correlation and covariance matrices 175194

checking correctness of 3544

disadvantages of 105

for advanced regression models 225249

for basic regression models 197224

from basic multivariate distributions 129152

from common univariate distributions 1128

from spatial models 263279

from time series models 251261

moment matching and moment-ratio diagram 297322

preliminary and background information 2947

resampling and bootstrap methods 281295

shortening simulation times 103105

specialized tools for 89

strategies for 93106

to estimate sampling distributions 5171

to evaluate statistical techniques 7391

using DATA step and procedures 5567

using SAS/IML language 6771

simulation loop 55

Simulation Studio 9

singular parameterization 216

skewness

bootstrap resampling 285289

checking correctness of simulated data 36

computing 336337

design of simulation studies and 315316

Fleishman distribution and 115, 311

for gamma distribution 306308

Johnson system of distributions 310311

moment matching and 303

moments and 299301

plotting variations on moment-ratio diagram 303306

sampling distribution example 6567

SKEWNESS module 288

Sklar's theorem 169

smooth bootstrap method 281, 292294

SMOOTH= option, MODEL statement (LOESS) 249

SMOOTHBOOTSTRAP module 294

SOLVE function 327

SORT call 327

SORT procedure 56

spatial functions 273274

spatial models

about 263

simulating data from a regular process 276278

simulating data from Gaussian random field 263273

simulating data from homogeneous Poisson process 274275

simulating data from inhomogeneous Poisson process 275276

simulating data from spatial point process 273274

simulating data using other techniques 278279

spatial point processes 273274

Spearman (rank) correlations 169, 176

spectral decomposition 150151

spectrum (eigenvalues) 187189

SQRT function 326

SQRVECH function

about 327, 335

generating symmetric matrices 181

multivariate normal distributions and 140141

SSQ function 327, 329

standard errors

about 53

bootstrap 283, 294295, 305306

Monte Carlo 96

plotting bootstrap estimates of 305306

sample size and 96

standard normal distribution

computing p-values 32

computing quantiles for 156

contaminated 121122, 138140

parameters for 28

simulating data from 1213, 2122

standardized uniform distribution 22

START statement 327

STAT= option, BARCHART statement (TEMPLATE) 3839

STATESPACE procedure 261

statistic

sampling distribution of 5153

standard error of 53

statistical distributions

checking correctness of simulated data 3544

essential functions for working with 3033

random number streams 3335

STD function 68, 77, 328

STOP statement 205, 328

STORE statement 328

STREAMINIT function

about 3334

linear regression example 227

macro-loop technique and 101

simulating univariate data 1314

Student's t distribution 137

subpopulations (components) 119121

subscript reduction operators

about 328329

assessing t test 84

sampling distribution of the mean 68

writing efficient simulations 97

SUM function 328329

sum of squares (##) operator 329

SURVEYSELECT procedure

about 282

METHOD= option 287

OUTHITS option 287288

REPS= option 287

resampling with 282, 286288

SAMPRATE= option 287

SEED= option 287288

survival analysis models

about 125

proportional hazards model 242245

simulating data from multiple survivor functions 245247

simulating data in 123125

survival distribution function (SDF) 245

survivor function 245247

SYMCHECK function 178

symmetric matrices 181

SYMPUTX subroutine 90

%SYSEVALF macro 172

SYSRANDOM macro variable 3435

system time, setting seed value from 3435

T

t distribution 28, 301

T function 328

t test

assessing for equality of means 7884

effect of sample size on 8788

evaluating power of 8488

tables

creating data sets from 4546, 57

excluding 45

finding names of 4445

selecting 45

TABLES statement, FREQ procedure

BINOMIAL option 81, 86

CHISQ option 90

OUT= option 56

TABULATE call 328

tabulated distributions

about 1819

finite mixture distribution and 120

multinomial distribution and 19, 130

parameters for 27

sampling from finite sets and 26

TEMPLATE procedure

BARCHART statement 3839

overlaying theoretical density on histograms 40

templates for simulating data

defining contour plots 268

macro-loop technique and 100101

overlaying theoretical densities on histograms 4041

overlaying theoretical PMF on frequency plots 3738

univariate distributions 1314

with DATA step and procedures 5557

_TEMPORARY_ keyword 19

TEST statement, REG procedure 211213

testing for covariance matrices 177179

thinning algorithms 275, 278279

TIME function 102, 161

time series models

about 251

simulating data from ARMA models 252261

using arrays to hold explanatory variables 201

visualizing stationary time series with SGPLOT 257258

time-to-event data 123127

TOEPLITZ function 185, 328

Toeplitz matrix 185

TPSPLINE procedure 247

transformation technique 146

TRANSPOSE procedure 66

triangle distribution 28

TRISOLV function 149150, 328

truncated distribution 121, 126

TTEST procedure

BY statement 80, 85

simulated power analysis 85

two-sample pooled variance t test 80, 83, 85

two-dimensional data

conditional simulation of 272273

unconditional simulation of 267269

two-sample t test

about 78

assessing in SAS/IML software 8384

effect of sample size on power of 8788

evaluating power of 8486

robustness to nonnormal populations 8182

robustness to unequal variances 7881

Type I extreme value distribution 111112

U

UCLM= option, OUTPUT statement (MEANS) 7475

unconditional simulations

about 264

of one-dimensional data 264267

of two-dimensional data 267269

UNCONDSIMGRF function 269

unequal variances, robustness of t test to 7881

uniform correlation structure 184

uniform distribution

continuous 22

discrete 1718

linear regression model example 210211

parameters for 28, 111

UNION function 328

UNIQUE function 328

univariate distributions

acceptance-rejection technique 126128

adding location and scale parameters 109111

finite mixture distributions 119122

inverse CDF sampling 116119

resampling with SAS/IML software 288289

SAS software support for 2728

simulating data 1112

simulating data from continuous distributions 2024, 28

simulating data from discrete distributions 1420, 27

simulating data from standard normal distribution 1213

simulating data in DATA step 1114

simulating data in SAS/IML software 2427

simulating data in time series 251

simulating data with given moments 297

simulating from less common 111116

simulating survival data 123125

UNIVARIATE procedure

approximating sampling distribution 55

bootstrap resampling 285

checking correctness of simulated data 3537

distributions supported by 111112, 114116

fitting gamma distribution to data 306

HISTOGRAM statement 40, 62, 255, 306

INSET statement 306

inverse transformation algorithm 117

Johnson system of distributions and 309310

parametric bootstrap method 291

QQPLOT statement 4144

sampling distribution of Pearson correlations 70

sampling distribution of the mean 5759

sampling distribution of the variance 62

VARDEF= option 295

USE statement 329

V

Vale-Maurelli algorithm 318321

VAR clause, CREATE statement 331

VAR function 68, 328

VAR= option, IDENTIFY statement (ARIMA) 253

VARDEF= option

MEANS procedure 295

UNIVARIATE procedure 295

variance components model 183184

variance function 226

variance reduction techniques 96

variances

computing for mean 6162

computing for median 6162

of random error term 198199

robustness of t test to unequal 7881

sampling distribution of 6263

VARIOGRAM procedure 264, 268, 272

VARMA model 260261

VARMASIM subroutine 251

VECDIAG function 328

VECH function 181

vectors

creating data sets from 330331

creating grid of values 332333

creating ID vectors 331332

efficiency of 6

reading data into 329

tips for shortening simulation times 103

VMTARGETCORR function 318319

W

Wald distribution 28, 112

Weibull distribution

about 2324

parameters for 28

proportional hazards model and 242

Rayleigh distribution and 114

WHERE clause

ESTIMATE statement, ARIMA procedure 254

READ statement 55

Wishart distribution 176, 186187

writing data to data sets 330331

writing efficient simulations

avoiding macro loops 100101

basic structure of efficient simulations 9697

disadvantages of simulations 105

macro usage considerations 101102

profiling SAS/IML simulation 102103

shorting simulation times 103105

suppressing notes to SAS log 99100

suppressing ODS output and graphics 9799

X

XSECT function 328

Y

YERRORLOWER= option, SCATTER statement (SGPLOT) 86

YERRORUPPER= option, SCATTER statement (SGPLOT) 86

Symbols

+ (addition) operator 329

@ (at) symbol 233

<= (comparison) operator 6

= (equality) operator 6

<:> (index of maximum) operator 329

>:< (index of minimum) operator 329

^= (inequality) operator 6

<> (maximum) operator 329

: (mean) operator 68, 77, 329

>< (minimum) operator 329

# (multiplication) operator 215, 329

; (semicolon) 6

## (sum of squares) operator 329

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.222.182.169