The FMM Procedure

PROC FMM Statement

  • PROC FMM <options>;

The PROC FMM statement invokes the FMM procedure. Table 2 summarizes the options available in the PROC FMM statement. These and other options in the PROC FMM statement are then described fully in alphabetical order.

Table 2: PROC FMM Statement Options

Option Description
Basic Options
DATA= Specifies the input data set
EXCLUSION= Specifies how the procedure responds to support violations in the data
NAMELEN= Specifies the length of effect names
ORDER= Determines the sort order of CLASS variables
SEED= Specifies the random number seed for analyses that require random number draws
Displayed Output
COMPONENTINFO Displays information about the mixture components
CORR Displays the asymptotic correlation matrix of the maximum likelihood parameter estimates or the empirical correlation matrix of the Bayesian posterior estimates
COV Displays the asymptotic covariance matrix of the maximum likelihood parameter estimates or the empirical covariance matrix of the Bayesian posterior estimates
COVI Displays the inverse of the covariance matrix of the parameter estimates
FITDETAILS Displays fit information for all examined models
ITDETAILS Adds estimates and gradients to the "Iteration History" table
NOCLPRINT Suppresses the "Class Level Information" table completely or partially
NOITPRINT Suppresses the "Iteration History Information" table
NOPRINT Suppresses tabular and graphical output
PARMSTYLE= Specifies how parameters are displayed in ODS tables
PLOTS Produces ODS statistical graphics
Computational Options
CRITERION= Specifies the criterion used in model selection
NOCENTER Prevents centering and scaling of the regressor variables
PARTIAL= Specifies a variable that defines a partial classification
Options Related to Optimization
ABSCONV= Tunes an absolute function convergence criterion
ABSFCONV= Tunes an absolute function difference convergence criterion
ABSGCONV= Tunes the absolute gradient convergence criterion
FCONV= Specifies a relative function convergence criterion that is based on a relative change of the function value
FCONV2= Specifies a relative function convergence criterion that is based on a predicted reduction of the objective function
GCONV= Tunes the relative gradient convergence criterion
MAXITER= Specifies the maximum number of iterations in any optimization
MAXFUNC= Specifies the maximum number of function evaluations in any optimization
MAXTIME= Specifies the upper limit of CPU time in seconds for any optimization
MINITER= Specifies the minimum number of iterations in any optimization
TECHNIQUE= Selects the optimization technique
Singularity Tolerances
INVALIDLOGL= Tunes the value assigned to an invalid component log likelihood
SINGCHOL= Tunes singularity for Cholesky decompositions
SINGRES= Tunes singularity for the residual variance
SINGULAR= Tunes general singularity criterion
ZEROPROB= Tunes component weight threshold for number of effective components


You can specify the following options in the PROC FMM statement.

ABSCONV=r
ABSTOL=r

specifies an absolute function convergence criterion. For minimization, the termination criterion is f left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis less-than-or-equal-to r, where bold-italic psi is the vector of parameters in the optimization and f left-parenthesis dot right-parenthesis is the objective function. The default value of r is the negative square root of the largest double-precision value, which serves only as a protection against overflows.

ABSFCONV=r
ABSFTOL=r

specifies an absolute function difference convergence criterion. For all techniques except NMSIMP, the termination criterion is a small change of the function value in successive iterations:

StartAbsoluteValue f left-parenthesis bold-italic psi Superscript left-parenthesis k minus 1 right-parenthesis Baseline right-parenthesis minus f left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis EndAbsoluteValue less-than-or-equal-to r

Here, bold-italic psi denotes the vector of parameters that participate in the optimization, and f left-parenthesis dot right-parenthesis is the objective function. The same formula is used for the NMSIMP technique, but bold-italic psi left-parenthesis k right-parenthesis is defined as the vertex with the lowest function value, and bold-italic psi Superscript left-parenthesis k minus 1 right-parenthesis is defined as the vertex with the highest function value in the simplex. The default value is r=0.

ABSGCONV=r
ABSGTOL=r

specifies an absolute gradient convergence criterion. The termination criterion is a small maximum absolute gradient element:

max Underscript j Endscripts StartAbsoluteValue g Subscript j Baseline left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis EndAbsoluteValue less-than-or-equal-to r

Here, bold-italic psi denotes the vector of parameters that participate in the optimization, and g Subscript j Baseline left-parenthesis dot right-parenthesis is the gradient of the objective function with respect to the jth parameter. This criterion is not used by the NMSIMP technique. The default value is r=1E–5.

COMPONENTINFO
COMPINFO
CINFO

produces a table with additional details about the fitted model components.

COV

produces the covariance matrix of the parameter estimates. For maximum likelihood estimation, this matrix is based on the inverse (projected) Hessian matrix. For Bayesian estimation, it is the empirical covariance matrix of the posterior estimates. The covariance matrix is shown for all parameters, even if they did not participate in the optimization or sampling.

COVI

produces the inverse of the covariance matrix of the parameter estimates. For maximum likelihood estimation, the covariance matrix is based on the inverse (projected) Hessian matrix. For Bayesian estimation, it is the empirical covariance matrix of the posterior estimates. This matrix is then inverted by sweeping, and rows and columns that correspond to linear dependencies or singularities are zeroed.

CORR

produces the correlation matrix of the parameter estimates. For maximum likelihood estimation this matrix is based on the inverse (projected) Hessian matrix. For Bayesian estimation, it is based on the empirical covariance matrix of the posterior estimates.

CRITERION=keyword
CRIT=keyword

specifies the criterion by which the FMM procedure ranks models when multiple models are evaluated during maximum likelihood estimation. You can choose from the following criteria to rank models by specifying the appropriate keyword:

AIC

uses Akaike’s information criterion.

AICC

uses the bias-corrected AIC criterion.

BIC

uses the Bayesian information criterion.

GRADIENT

uses the largest element of the gradient (in absolute value).

LOGL | LL

uses the mixture log likelihood.

PEARSON

uses the Pearson statistic.

The default for maximum likelihood estimation is CRITERION=BIC.

In Bayesian model selection, you can choose from the following criteria to rank models:

DIC

uses the deviance information criterion.

LOGL | LL

uses the mixture log likelihood

The default for Bayesian estimation is CRITERION=DIC.

DATA=SAS-data-set

names the SAS data set to be used by PROC FMM. The default is the most recently created data set.

EXCLUSION=NONE | ANY | ALL
EXCLUDE=NONE | ANY | ALL

specifies how the FMM procedure handles support violations of observations. For example, in a mixture of two Poisson variables, negative response values are not possible. However, in a mixture of a Poisson and a normal variable, negative values are possible, and their likelihood contribution to the Poisson component is zero. An observation that violates the support of one component distribution of the model might be a valid response with respect to one or more other component distributions. This requires some nuanced handling of support violations in mixture models.

The default exclusion technique, EXCLUSION=ALL, removes an observation from the analysis only if it violates the support of all component distributions. The other extreme, EXCLUSION=NONE, permits an observation into the analysis regardless of support violations. EXCLUSION=ANY removes observations from the analysis if the response violates the support of any component distributions. In the single-component case, EXCLUSION=ALL and EXCLUSION=ANY are identical.

FCONV=r
FTOL=r

specifies a relative function convergence criterion that is based on the relative change of the function value. For all techniques except NMSIMP, PROC FMM terminates when there is a small relative change of the function value in successive iterations:

StartFraction StartAbsoluteValue f left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis minus f left-parenthesis bold-italic psi Superscript left-parenthesis k minus 1 right-parenthesis Baseline right-parenthesis EndAbsoluteValue Over StartAbsoluteValue f left-parenthesis bold-italic psi Superscript left-parenthesis k minus 1 right-parenthesis Baseline right-parenthesis EndAbsoluteValue EndFraction less-than-or-equal-to r

Here, bold-italic psi denotes the vector of parameters that participate in the optimization, and f left-parenthesis dot right-parenthesis is the objective function. The same formula is used for the NMSIMP technique, but bold-italic psi Superscript left-parenthesis k right-parenthesis is defined as the vertex with the lowest function value, and bold-italic psi Superscript left-parenthesis k minus 1 right-parenthesis is defined as the vertex with the highest function value in the simplex.

The default is r equals 10 Superscript minus normal upper F normal upper D normal upper I normal upper G normal upper I normal upper T normal upper S, where FDIGITS is by default minus log Subscript 10 Baseline left-brace epsilon right-brace, and epsilon is the machine precision.

FCONV2=r
FTOL2=r

specifies a relative function convergence criterion that is based on the predicted reduction of the objective function. For all techniques except NMSIMP, the termination criterion is a small predicted reduction

d f Superscript left-parenthesis k right-parenthesis Baseline almost-equals f left-parenthesis bold-italic theta Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis minus f left-parenthesis bold-italic theta Superscript left-parenthesis k right-parenthesis Baseline plus bold s Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis

of the objective function. The predicted reduction

StartLayout 1st Row 1st Column d f Superscript left-parenthesis k right-parenthesis 2nd Column equals minus bold g Superscript left-parenthesis k right-parenthesis prime Baseline bold s Superscript left-parenthesis k right-parenthesis Baseline minus one-half bold s Superscript left-parenthesis k right-parenthesis prime Baseline bold upper H Superscript left-parenthesis k right-parenthesis Baseline bold s Superscript left-parenthesis k right-parenthesis Baseline 2nd Row 1st Column Blank 2nd Column equals minus one-half bold s Superscript left-parenthesis k right-parenthesis prime Baseline bold g Superscript left-parenthesis k right-parenthesis Baseline 3rd Row 1st Column Blank 2nd Column less-than-or-equal-to r EndLayout

is computed by approximating the objective function f by the first two terms of the Taylor series and substituting the Newton step:

bold s Superscript left-parenthesis k right-parenthesis Baseline equals minus left-bracket bold upper H Superscript left-parenthesis k right-parenthesis Baseline right-bracket Superscript negative 1 Baseline bold g Superscript left-parenthesis k right-parenthesis

For the NMSIMP technique, the termination criterion is a small standard deviation of the function values of the n plus 1 simplex vertices bold-italic theta Subscript l Superscript left-parenthesis k right-parenthesis, l equals 0 comma ellipsis comma n,

StartRoot StartFraction 1 Over n plus 1 EndFraction sigma-summation Underscript l Endscripts left-bracket f left-parenthesis bold-italic theta Subscript l Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis minus ModifyingAbove f With bar left-parenthesis bold-italic theta Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis right-bracket squared EndRoot less-than-or-equal-to r

where ModifyingAbove f With bar left-parenthesis bold-italic theta Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis equals StartFraction 1 Over n plus 1 EndFraction sigma-summation Underscript l Endscripts f left-parenthesis bold-italic theta Subscript l Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis. If there are n Subscript a c t boundary constraints active at bold-italic theta Superscript left-parenthesis k right-parenthesis, the mean and standard deviation are computed only for the n plus 1 minus n Subscript a c t unconstrained vertices.

The default value is r = 1E–6 for the NMSIMP technique and r = 0 otherwise.

FITDETAILS

requests that the "Optimization Information," "Iteration History," "Convergence Status," and "Fit Statistics" tables be produced for all optimizations when models with different number of components are evaluated. For example, the following statements fit a binomial regression model with up to three components and produces fit and optimization information for all three:

proc fmm fitdetails;
   model y/n  = x / kmax=3;
run;

Without the FITDETAILS option, only the "Fit Statistics" table for the selected model is displayed.

In Bayesian estimation, the FITDETAILS option displays the following tables for each model that the procedure fits: "Bayes Information," "Iteration History," "Prior Information," "Fit Statistics," "Posterior Summaries," "Posterior Intervals," and any requested diagnostics tables. The "Iteration History" table appears only if the BAYES statement includes the INITIAL=MLE option.

Without the FITDETAILS option, these tables are listed only for the selected model.

GCONV=r
GTOL=r

specifies a relative gradient convergence criterion. For all techniques except CONGRA and NMSIMP, the termination criterion is a small normalized predicted function reduction:

StartFraction bold g left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis prime left-bracket bold upper H Superscript left-parenthesis k right-parenthesis Baseline right-bracket Superscript negative 1 Baseline bold g left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis Over StartAbsoluteValue f left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis EndAbsoluteValue EndFraction less-than-or-equal-to r

Here, bold-italic psi denotes the vector of parameters that participate in the optimization, f left-parenthesis dot right-parenthesis is the objective function, and bold g left-parenthesis dot right-parenthesis is the gradient. For the CONGRA technique (where a reliable Hessian estimate bold upper H is not available), the following criterion is used:

StartFraction parallel-to bold g left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis parallel-to Subscript 2 Superscript 2 Baseline parallel-to bold s left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis parallel-to Over parallel-to bold g left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis minus bold g left-parenthesis bold-italic psi Superscript left-parenthesis k minus 1 right-parenthesis Baseline right-parenthesis parallel-to Subscript 2 Baseline StartAbsoluteValue f left-parenthesis bold-italic psi Superscript left-parenthesis k right-parenthesis Baseline right-parenthesis EndAbsoluteValue EndFraction less-than-or-equal-to r

This criterion is not used by the NMSIMP technique. The default value is r = 1E–8.

HESSIAN

displays the Hessian matrix of the model. This option is not available for Bayesian estimation.

INVALIDLOGL=r

specifies the value assumed by the FMM procedure if a log likelihood cannot be computed (for example, because the value of the response variable falls outside of the response distribution’s support). The default value is –1E20.

ITDETAILS

adds parameter estimates and gradients to the "Iteration History" table. If the FMM procedure centers or scales the model variables (or both), the parameter estimates and gradients reported during the iteration refer to that scale. You can suppress centering and scaling with the NOCENTER option.

MAXFUNC=n
MAXFU=n

specifies the maximum number of function calls in the optimization process. The default values are as follows, depending on the optimization technique:

  • TRUREG, NRRIDG, and NEWRAP: 125

  • QUANEW and DBLDOG: 500

  • CONGRA: 1000

  • NMSIMP: 3000

The optimization can terminate only after completing a full iteration. Therefore, the number of function calls that are actually performed can exceed the number that is specified by the MAXFUNC= option. You can choose the optimization technique with the TECHNIQUE= option.

MAXITER=n
MAXIT=n

specifies the maximum number of iterations in the optimization process. The default values are as follows, depending on the optimization technique:

  • TRUREG, NRRIDG, and NEWRAP: 50

  • QUANEW and DBLDOG: 200

  • CONGRA: 400

  • NMSIMP: 1000

These default values also apply when n is specified as a missing value. You can choose the optimization technique with the TECHNIQUE= option.

MAXTIME=r

specifies an upper limit of r seconds of CPU time for the optimization process. The time is checked only at the end of each iteration. Therefore, the actual run time might be longer than the specified time. By default, CPU time is not limited.

MINITER=n
MINIT=n

specifies the minimum number of iterations. The default value is 0. If you request more iterations than are actually needed for convergence to a stationary point, the optimization algorithms can behave strangely. For example, the effect of rounding errors can prevent the algorithm from continuing for the required number of iterations.

NAMELEN=number

specifies the length to which long effect names are shortened. The default and minimum value is 20.

NOCENTER

requests that regressor variables not be centered or scaled. By default the FMM procedure centers and scales columns of the bold upper X matrix if the models contain intercepts. If NOINT options in MODEL statements are in effect, the columns of bold upper X are scaled but not centered. Centering and scaling can help with the stability of estimation and sampling algorithms. The FMM procedure does not produce a table of the centered and scaled coefficients and provides no user control over the type of centering and scaling that is applied. The NOCENTER option turns any centering and scaling off and processes the raw values of the continuous variables.

NOCLPRINT<=number>

suppresses the display of the "Class Level Information" table if you do not specify number. If you specify number, the values of the classification variables are displayed for only those variables whose number of levels is less than number. Specifying a number helps to reduce the size of the "Class Level Information" table if some classification variables have a large number of levels.

NOITPRINT

suppresses the display of the "Iteration History Information" table.

NOPRINT

suppresses the normal display of tabular and graphical results. The NOPRINT option is useful when you want to create only one or more output data sets with the procedure. This option temporarily disables the Output Delivery System (ODS); see Chapter 23, Using the Output Delivery System, for more information.

ORDER=order-type

specifies the sort order for the levels of CLASS variables. This ordering determines which parameters in the model correspond to each level in the data.

You can specify the following values for order-type:

DATA

sorts the levels by order of appearance in the input data set.

FORMATTED

sorts the levels by external formatted value, except for numeric variables with no explicit format, which are sorted by their unformatted (internal) value.

FREQ

sorts the levels by descending frequency count; levels with the most observations come first in the order.

INTERNAL

sorts the levels by unformatted value.

FREQDATA

sorts the levels by order of descending frequency count, and within counts by order of appearance in the input data set when counts are tied.

FREQFORMATTED

sorts the levels by order of descending frequency count, and within counts by formatted value (as above) when counts are tied.

FREQINTERNAL

sorts the levels by order of descending frequency count, and within counts by unformatted value when counts are tied.

When the default ORDER=FORMATTED is in effect for numeric variables for which you have supplied no explicit format, the levels are ordered by their internal values. To order numeric class levels with no explicit format by their BEST12. formatted values, you can specify this format explicitly for the CLASS variables.

When FORMATTED and INTERNAL values are involved, the sort order is machine-dependent.

When the response variable appears in a CLASS statement, the ORDER= option in the PROC FMM statement applies to its sort order. For example, in the following statements the sort order of the wheeze variable is determined by the order of appearance in the input data set because the response variable appears in the CLASS statement:

proc fmm order=data;
   class city wheeze;
   model wheeze = city age / dist=binary s;
run;

However, in the following statements the sort order of the wheeze variable is determined by the formatted value (the default response-option in the MODEL statement):

proc fmm order=data;
   class city;
   model wheeze = city age / dist=binary s;
run;

The ORDER= option in the PROC FMM statement has no effect on the sort order of the wheeze variable because it does not appear in the CLASS statement.

When you specify a response-option in the MODEL statement, it overrides the ORDER= option in the PROC FMM statement.

For more information about sort order, see the chapter on the SORT procedure in the Base SAS Procedures Guide and the discussion of BY-group processing in the "Grouping Data" section of SAS Programmers Guide: Essentials.

PARMSTYLE=EFFECT | LABEL

specifies the display style for parameters and effects. The FMM procedure can display parameters in two styles:

EFFECT

displays parameters and effects in the EFFECT style (which is used by the MIXED and GLIMMIX procedure, for example), which identifies a parameter with an "Effect" column and adds separate columns for the CLASS variables in the model.

LABEL

displays parameters and effects in the LABEL style, which creates one column, named Parameter, that combines the relevant information about a parameter into a single column. If your model contains multiple CLASS variables, this style might use space more economically.

The EFFECT style is the default for models that contain effects; otherwise the LABEL style is used (for example, in homogeneous mixtures). You can change the display style by using the PARMSTYLE= option. Regardless of the display style, ODS output data sets that contain information about parameter estimates contain columns for both styles.

PARTIAL=variable
MEMBERSHIP=variable

specifies a variable in the input data set that identifies component membership. You can specify missing values for observations whose component membership is undetermined; this is known as a partial classification (McLachlan and Peel 2000, p. 75). For observations with known membership, the likelihood contribution is no longer a mixture. If observation i is known to be a member of component m, then its log likelihood contribution is

log left-brace pi Subscript m Baseline left-parenthesis bold z comma bold-italic alpha Subscript m Baseline right-parenthesis p Subscript m Baseline left-parenthesis y semicolon bold x prime Subscript m Baseline bold-italic beta Subscript m Baseline comma phi Subscript m Baseline right-parenthesis right-brace

Otherwise, if membership is undetermined, it is

log left-brace sigma-summation Underscript j equals 1 Overscript k Endscripts pi Subscript j Baseline left-parenthesis bold z comma bold-italic alpha Subscript j Baseline right-parenthesis p Subscript j Baseline left-parenthesis y semicolon bold x prime Subscript j Baseline bold-italic beta Subscript j Baseline comma phi Subscript j Baseline right-parenthesis right-brace

The variable that you specify in this option can be numeric or character. In case of a character variable, the variable must also appear in the CLASS statement. If it also appears in the CLASS statement, the membership assignment is made on the basis of the levelized values of the variable, as shown in the "Class Level Information" table. If the variable is numeric, any nonmissing values of the variable must take values in the range 1 comma 2 comma ellipsis comma k, where k is the number of mixture components. Invalid variable values are ignored.

In a model in which label switching is a problem, you can sometimes avoid switching by assigning just a few observations to categories. For example, in a three-component model, switching might be prevented if you assign the observation that has the smallest response value to the first component and the observation that has the largest response value to the last component.

PLOTS <(global-plot-options)> <=plot-request <(options)>>
PLOTS <(global-plot-options)> <=(plot-request <(options)> <…plot-request <(options)>>)>

controls the graphical output that is produced through ODS Graphics.

ODS Graphics must be enabled before plots can be requested. For example:

ods graphics on;
proc fmm data=yeast seed=12345;
   model count/n = / k=2;
   freq f;
   performance cpucount=2;
   bayes;
run;
ods graphics off;

For more information about enabling and disabling ODS Graphics, see the section Enabling and Disabling ODS Graphics in Chapter 24, Statistical Graphics Using ODS.

Global Plot Options

The global-plot-options apply to all relevant plots generated by the FMM procedure. The global-plot-options supported by the FMM procedure are as follows:

UNPACKPANEL
UNPACK

displays each plot separately. (By default, some plots can appear together in a single panel.)

ONLY

produces only the specified plots. This option is useful if you do not want the procedure to generate all default plots, but only the ones specified.

Specific Plot Options

The following listing describes the specific plots and their options.

ALL

requests that all plots appropriate for the analysis be produced.

NONE

requests that no ODS graphics be produced.

DENSITY <(density-options)>

requests a plot of the data histogram and mixture density function. This is a default plot in models that have no effects in the MODEL statements and is available only in these models. Furthermore, all distributions that are involved in the mixture must be continuous. You can specify the following density-options to modify the plot:

CUMULATIVE
CDF

displays the histogram and densities in cumulative form.

NBINS=n
BINS=n

specifies the number of bins in the histogram; n is greater than or equal to 0. By default, the FMM procedure computes a suitable bin width and number of bins, based on the range of the response and the number of usable observations. The option has no effect for binary data.

NOCOMPONENTS
NOCOMP

suppresses the component densities from the plot. If the component densities are displayed, they are scaled so that their sum equals the mixture density at any point on the graph. In single-component models, this option has no effect.

NODENSITY
NODENS

suppresses the computation of the mixture density and the component densities. If you specify the NOHISTOGRAM and the NODENSITY option, no plot is produced.

NOLABEL

suppresses the component identification with labels. By default, the FMM procedure labels component densities in the legend of the plot. If you do not specify a model label with the LABEL= option in the MODEL statement, an identifying label is constructed from the parameter estimates that are associated with the component. In this case the parameter values are not necessarily the mean and variance of the distribution; the values used to identify the densities on the plot are chosen to simplify linking between graphical and tabular results.

NOHISTOGRAM
NOHIST

suppresses the computation of the histogram of the raw values. If you specify the NOHISTOGRAM and the NODENSITY option, no plot is produced.

NPOINTS=n
N=n

specifies the number of values used to compute the density functions; n is greater than or equal to 0. The default is N=200.

WIDTH=value
BINWIDTH=value

specifies the bin width for the histogram. The value is specified in units of the response variable and must be positive. The option has no effect for binary data.

TRACE <(tadpanel-options)>

requests a trace panel with posterior diagnostics for a Bayesian analysis. If a BAYES statement is present, the trace panel plots are generated by default, one for each sampled parameter. You can specify the following tadpanel-options to modify the plot:

BOX
BOXPLOT

replaces the autocorrelation plot with a box plot of the posterior sample.

SMOOTH=NONE | MEAN | SPLINE

adds a reference estimate to the trace plot. By default, SMOOTH=NONE. SMOOTH=MEAN uses the arithmetic mean of the trace as the reference. SMOOTH=SPLINE adds a penalized B-spline.

REFERENCE=reference-style

adds reference lines to the density plot, trace plot, and box plot. You can specify the following reference-styles:

NONE

suppresses the reference lines.

EQT

requests equal-tail intervals.

HPD

requests intervals of highest posterior density. The level for the credible or HPD intervals is chosen based on the "Posterior Interval Statistics" table.

PERCENTILES

(or PERC) for percentiles. The largest and smallest percentiles in the "Posterior Summary Statistics" table are used.

The default is REFERENCE=EQT.

UNPACK

unpacks the panel graphic and displays its elements as separate plots.

CRITERIONPANEL <(critpanel-options)>

requests a plot for comparing the model fit criteria for different numbers of components. This plot is available only if you also specify the KMAX option in at least one MODEL statement. The plot includes different criteria, depending on whether you are using maximum likelihood or Bayesian estimation. You can specify the following critpanel-option to modify the plot:

UNPACK

unpacks the panel plot and displays its elements as separate plots, one for each fit criterion.

SEED=n

determines the random number seed for analyses that depend on a random number stream. If you do not specify a seed or if you specify a value less than or equal to zero, the seed is generated from reading the time of day from the computer clock. The largest possible value for the seed is 2 Superscript 31 Baseline minus 1. The seed value is reported in the "Model Information" table.

You can use the SYSRANDOM and SYSRANEND macro variables after a PROC FMM run to query the initial and final seed values. However, using the final seed value as the starting seed for a subsequent analysis does not continue the random number stream where the previous analysis left off. The SYSRANEND macro variable provides a mechanism to pass on seed values to ensure that the sequence of random numbers is the same every time you run an entire program.

Analyses that use the same (nonzero) seed are not completely reproducible if they are executed with a different number of threads because the random number streams in separate threads are independent. You can control the number of threads used by the FMM procedure with system options or through the PERFORMANCE statement in the FMM procedure.

SINGCHOL=number

tunes the singularity criterion in Cholesky decompositions. The default is 1E4 times the machine epsilon; this product is approximately 1E–12 on most computers.

SINGRES=number

sets the tolerance for which the residual variance or scale parameter is considered to be zero. The default is 1E4 times the machine epsilon; this product is approximately 1E–12 on most computers.

SINGULAR=number

tunes the general singularity criterion applied by the FMM procedure in sweeps and inversions. The default is 1E4 times the machine epsilon; this product is approximately 1E–12 on most computers.

TECHNIQUE=keyword
TECH=keyword

specifies the optimization technique to obtain maximum likelihood estimates. You can choose from the following techniques by specifying the appropriate keyword:

CONGRA

performs a conjugate-gradient optimization.

DBLDOG

performs a version of double-dogleg optimization.

NEWRAP

performs a Newton-Raphson optimization combining a line-search algorithm with ridging.

NMSIMP

performs a Nelder-Mead simplex optimization.

NONE

performs no optimization.

NRRIDG

performs a Newton-Raphson optimization with ridging.

QUANEW

performs a dual quasi-Newton optimization.

TRUREG

performs a trust-region optimization.

The default is TECH=QUANEW.

For more details about these optimization methods, see the section Choosing an Optimization Algorithm in Chapter 20, Shared Concepts and Topics.

ZEROPROB=number

tunes the threshold (a value between 0 and 1) below which the FMM procedure considers a component mixing probability to be zero. This affects the calculation of the number of effective components. The default is the square root of the machine epsilon; this is approximately 1E–8 on most computers.

Last updated: December 09, 2022