The SURVEYSELECT Procedure

Sequential Random Sampling

If you specify the METHOD=SEQ option and do not include a SIZE statement, PROC SURVEYSELECT uses the equal probability version of Chromy’s method for sequential random sampling. This method selects units sequentially with equal probability and without replacement. For more information, see Chromy (1979) and Williams and Chromy (1980). For information about Chromy’s PPS selection method, see the section PPS Sequential Sampling.

Sequential random sampling controls the distribution of the sample by spreading it throughout the sampling frame or stratum, thus providing implicit stratification according to the order of units in the frame or stratum. You can use the CONTROL statement to sort the input data set by the CONTROL variables before sample selection. If you also use a STRATA statement, PROC SURVEYSELECT sorts by the CONTROL variables within strata. By default (or if you specify the SORT=SERP option), the procedure uses hierarchic serpentine ordering for sorting. If you specify the SORT=NEST option, the procedure uses nested sorting. See the section Sorting by CONTROL Variables for descriptions of serpentine and nested sorting. If you do not specify a CONTROL statement, PROC SURVEYSELECT applies sequential selection to the observations in the order in which they appear in the input data set.

Following Chromy’s method of sequential selection, PROC SURVEYSELECT randomly chooses a starting unit from the entire stratum (or frame, if the design is not stratified). With this unit as the first one, the procedure treats the stratum units as a closed loop. This is done so that all pairwise (joint) selection probabilities are positive and an unbiased variance estimator can be obtained. The procedure numbers units sequentially from the random start to the end of the stratum and then continues from the beginning of the stratum until all units are numbered.

Beginning with the randomly chosen starting unit, PROC SURVEYSELECT accumulates the expected number of selections (hits), where the expected number of selections normal upper E left-parenthesis upper S Subscript h i Baseline right-parenthesis is n Subscript h Baseline slash upper N Subscript h for all units i in stratum h. The procedure computes

upper I Subscript h i Baseline equals normal upper I normal n normal t left-parenthesis sigma-summation Underscript j equals 1 Overscript i Endscripts normal upper E left-parenthesis upper S Subscript h j Baseline right-parenthesis right-parenthesis equals normal upper I normal n normal t left-parenthesis i n Subscript h Baseline slash upper N Subscript h Baseline right-parenthesis
upper F Subscript h i Baseline equals normal upper F normal r normal a normal c left-parenthesis sigma-summation Underscript j equals 1 Overscript i Endscripts normal upper E left-parenthesis upper S Subscript h j Baseline right-parenthesis right-parenthesis equals normal upper F normal r normal a normal c left-parenthesis i n Subscript h Baseline slash upper N Subscript h Baseline right-parenthesis

where normal upper I normal n normal t left-parenthesis dot right-parenthesis denotes the integer part of the number, and normal upper F normal r normal a normal c left-parenthesis dot right-parenthesis denotes the fractional part.

Considering each unit sequentially, Chromy’s method determines whether unit i is selected by comparing the total number of selections for the first (i – 1) units,

upper T Subscript h left-parenthesis i minus 1 right-parenthesis Baseline equals sigma-summation Underscript j equals 1 Overscript i minus 1 Endscripts upper S Subscript h j

with the value of upper I Subscript h left-parenthesis i minus 1 right-parenthesis.

If upper T Subscript h left-parenthesis i minus 1 right-parenthesis Baseline equals upper I Subscript h left-parenthesis i minus 1 right-parenthesis, Chromy’s method determines whether or not unit i is selected as follows. If upper F Subscript h i Baseline equals 0 or upper F Subscript h left-parenthesis i minus 1 right-parenthesis Baseline greater-than upper F Subscript h i, then unit i is selected with certainty. Otherwise, unit i is selected with probability

left-parenthesis upper F Subscript h i Baseline minus upper F Subscript h left-parenthesis i minus 1 right-parenthesis Baseline right-parenthesis slash left-parenthesis 1 minus upper F Subscript h left-parenthesis i minus 1 right-parenthesis Baseline right-parenthesis

If upper T Subscript h left-parenthesis i minus 1 right-parenthesis Baseline equals left-parenthesis upper I Subscript h left-parenthesis i minus 1 right-parenthesis Baseline plus 1 right-parenthesis, Chromy’s method determines whether or not unit i is selected as follows. If upper F Subscript h i Baseline equals 0 or upper F Subscript h i Baseline greater-than upper F Subscript h left-parenthesis i minus 1 right-parenthesis, then the unit is not selected. Otherwise, unit i is selected with probability

upper F Subscript h i Baseline slash upper F Subscript h left-parenthesis i minus 1 right-parenthesis
Last updated: December 09, 2022