Historic, Archive Document
Do not assume content reflects current
scientific knowledge, policies, or practices.
gSB^A
/?,// States
nent of
ml ture
Forest
Service
Forest Pest
Management
Methods
Application
Group
Fort Collins,
Colorado 80526
SAMPLING DESIGNS AND ALLOCATIONS
YIELDING MINIMUM COST ESTIMATORS
FOR MOUNTAIN PINE BEETLE LOSS
ASSESSMENT SURVEYS
Report No. 83-3
3400
November 1982
SAMPLING DESIGNS AND ALLOCATIONS YIELDING MINIMUM COST
ESTIMATORS FOR MOUNTAIN PINE BEETLE LOSS ASSESSMENT SURVEYS
1/ 2/
Nancy X. Sharpnack— and John Wong-'
U.S.D.A. Forest Service
ABSTRACT
Tables of optimum sample size for each of three stages are presented for
estimating mountain pine beetle loss in ponderosa and lodgepole pine forests
in the western United States . These are listed for varying levels of
precision and are based on data collected during surveys conducted between
1977 and 1980.
INTRODUCTION
Information from previous mountain pine beetle loss assessment surveys has
provided the opportunity to improve future sampling designs and allocations.
The material for this study was obtained from several surveys of ponderosa and
lodgepole pine mortality that were conducted by the Forest Service in the
western Regions between 1977 and 1980. These surveys provided a sufficient
data base from which variance components were estimated, costs were assessed
and various strategies could be evaluated.
The procedures used in the previous surveys and their results have been
documented in various reports (Hostetler and Young 1979, Bennett and Bousfield
1978, Bennett et al . 1980, and Lister and Young 1981). The steps involved:
1. Aerial sketchmapping.
2. Stratification based on intensity of mortality per acre as
determined from the sketchmapping.
3. Random sample of aerial photos within each stratum (Stage 1).
4. Subsample of aerial photos chosen with probabilities proportional
to photo interpreted dead tree counts (Stage 2).
5. Sample of ground plots selected with probabilities proportional
to dead tree counts (Stage 3).
1 Mathematical Statistician, Pacific Southwest Forest and Range
Experiment Station, Berkeley, California.
2 Formerly Mathematical Statistician, Forest Pest Management, Methods
Application Group, Davis, California. Presently Mathematical
Statistician, Information Systems Management Staff, Pacific Southwest
Region, San Francisco, California.
2
The objectives of this study were to evaluate whether or not stratification
is effective, and to find those allocations of sampling units between the
three stages which costs the least for a fixed percent standard error. The 20%
standard error of the estimate is a requirement specified in the Forest Insect
and Disease Information System Implementation Plan (FIDIS) (Ciesla and
Yasinski 1980). We also considered the optimum allocation for 10%, 15% and
25% standard errors whenever these were attainable. We were constrained to
considering 40 and 90 acre photo plots and 2.5 acre ground plots since all of
the usable data available from the previous surveys fell into these
categories. We considered the effects of taking one, two or three ground
plots per photo plot in the third sampling stage where traditionally two plots
have been taken.
METHODS
We used data from four previous mountain pine beetle surveys (Table 1).
Cost factors applicable to the analysis were obtained from data furnished by
Dayl e Bennett (personal communication) in Region 3 and confirmed by Richard
Myhre of the Rocky Mountain Forest and Range Experiment Station, Ft. Collins,
Colorado. These costs are in units of the number of person hours to do photo
interpretation and obtain ground measurements (Table 2). The cost estimates
include plot set-up time and travel time. As expected, the time required to
make a ground measurement is significantly higher than to interpret a photo,
so that it is evident that a major cost element in sampling is associated with
the time spent on the ground.
For each of the previous surveys, the variance components associated with
each of the three stages were computed. From these it was possible to
estimate the variance which would have resulted for any allocation of sampling
units of interest. The estimate of total variance was computed using the
following formula:
v(y) = (N-n/N) (l/n)v(y1) + ( 1/m) v(y2 ) + (l/ml)v(y3)
in which
y = the estimate of total mortality
v(y) = the estimate of the variance of the estimate of total
mortal i ty
v(y^ ) = the estimate of stage i variance
N = the total number of possible photo plots in the stratum
n = the number of photo plots to be sampled at Stage 1
m = the number of photo plots to be subsampled at Stage 2
1 = the number of ground plots per photo plot at Stage 3
3
TABLE 1. Summary of Mountain Pine Beetle data sets used in generating
estimates of variances for this study.
SURVEY
TREE
SPECIES
STRATUM
: number of
photos
(Stage 1)
:photo
plot
si ze
( acres)
:number
photos
subsampl ed
(Stage 2)
1978
LP
L
< 4.9 trees/acre
94
40
24
Beaverhead/
M
T.0-9.9
24
16
Gal 1 ati n
H
>_ 10.0
26
16
1979
LP
L
<10.0 trees/acre
174
40
18
Montana
M
10.0-41.0
87
19
H
>41.0
81
20
1978
PP
M
109
90
50
Black Hills
H
77
50
1979
PP
Conti guous
175
90
20
Col orado
Front Range
TABLE 2. Cost per plot for photo interpretation and ground measurements (in
person hours)
Ground Plot
Photo Plot Size Size
Host Species Stratum 40 Acres 62.5 Acres 90 Acres 2.5 Acres
LP
L
0.39
0.53
0.73
10.60
M
0.48
0.66
0.85
12.40
H
1.04
1.11
1.40
13.10
PP
L
0.33
0.39
0.49
10.60
M
0.40
0.51
0.63
12.40
H
0.65
0.81
1.01
13.10
4
The percent standard error of the estimate is computed as 100 v(y)/y.
Formulas for computing variance components are given in the appendix.
A computer program used iterative methods to solve for a fixed percent
standard error while varying the Stage 2 allocation for numerous values of n
and for 1 = 1, 2, and 3. For each solution, the cost was computed with the
formul a:
C = n CpI + ml CG
in which
C = total cost for this allocation
Cpj = cost of interpreting a photo plot
CG = cost of measuring a ground plot
the minimum cost was then selected from the possible combinations of n, m, 1.
In order to combine strata, weights were used. The weight for each stratum
was the ratio of the expected number of plots that would have fallen in the
stratum had no stratification been imposed and the actual number of plots that
were sampled in the stratum in the previous survey. The variance component
for stage i, v was computed using the formula:
in which
v. = the variance component for stage i
k = the number of strata
w. . = the weight for stage i, stratum j
■ vJ
v.. = the variance component for stage i, stratum j
* J
Likewise, since the costs were dependent upon whether the plots were from
the light, medium, or heavy stratum, weighted costs were used in the combined
strata analysis.
RESULTS
The results are summarized (Tables 3 through 6) to provide a readily usable
tool with which to better plan future surveys. For each host species, those
sampling allocations which yielded the minimum cost for the various levels of
precision are provided. In addition, the costs for allocations other than the
optimum are presented. This is for planning purposes. Often, cost is not the
5
TABLE 3. Optimal Sampling Allocations for the 1978 Beaverhead-Gallatin
Survey in Lodgepole Pine.
STRATUM
% Std.
Error
One Ground
Per Stage 2
Plot
Plot
Two Ground Plots
Per Stage 2 Plot
Three Ground
Per Stage 2
PI ots
Plot
No. of
PI
Plots
No. of
PI Sub
Plots
Cost
- ($)
No. of
PI
Plots
No. of
PI Sub
PI ots
Cost
- ($)
No. of
PI
Plots
No. of
PI Sub
Plots
Cost
- ($)
L
10%
300
21
339
450
14
471
400
13
569
M
200
19
331
250
12
416
250
10
491
H
200
18
574
250
16
679
250
13
770
TOTAL
700
58
1244
950
42
1566
900
36
1830
NO STRATIFICATION
300
28
517
350
18
626
350
15
733
L
15%
150
9
154
150
7
207
200
6
258
M
80
8
137
60
6
177
60
5
215
H
100
13
274
150
7
339
100
7
379
TOTAL
330
30
565
360
20
723
360
18
852
NO STRATIFICATION
140
12
241
140
8
282
150
7
331
L
20%
80
5
84
70
4
112
70
3
123
M
50
4
73
40
3
93
40
3
131
H
60
7
154
60
4
179
70
4
203
TOTAL
190
16
311
170
11
384
180
10
457
NO STRATIFICATION
80
7
130
90
4
158
90
4
184
L
25%
50
3
51
50
2
62
50
2
83
M
20
3
47
30
2
64
20
2
84
H
40
4
100
40
3
115
40
2
130
TOTAL
110
10
198
120
7
241
110
6
297
NO STRATIFICATION
50
5
84
50
3
102
50
2
120
6
TABLE 4. Optimal Sampling Allocations for the 1979 Montana Survey in
Lodgepole Pine.
STRATUM
% Std .
Error
One Ground
Per Stage 2
PI ot
Plot
Two
Per
Ground Plots
Stage 2 Plot
Three Ground
Per Stage 2 1
PI ots
Plot
No. of
PI
PI ots
No. of
PI Sub
Plots
Cost
- ($)
No. of
PI
Plots
No. of
PI Sub
Plots
Cost
- ($)
No. of
PI
Plots
No. of
PI Sub
PI ots
Cost
- ($)
L
10%
500
no
1360
500
70
1678
500
56
1975
M
600
177
2480
700
104
2913
700
81
3339
H
400
40
944
400
34
1318
500
29
1676
TOTAL
1500
327
4784
1600
208
5909
1700
166
6990
NO STRATIFICATION
900
114
1904
1100
77
2519
1200
66
3113
L
15%
250
48
606
300
30
752
250
25
892
M
250
81
1121
300
47
1315
300
37
1508
H
150
20
421
200
15
594
200
14
754
TOTAL
650
149
2148
800
92
2661
750
76
3154
NO STRATIFICATION
400
51
852
500
34
1127
500
30
1392
L
20%
110
28
339
80
18
413
100
14
484
M
50
45
634
150
27
744
200
20
855
H
60
11
238
100
9
335
120
8
426
TOTAL
220
84
1211
330
54
1492
420
42
1765
NO STRATIFICATION
250
28
480
250
20
636
300
17
785
L
25%
70
18
218
70
11
260
90
9
321
M
100
29
407
120
17
479
150
13
555
H
60
7
152
70
5
215
80
5
275
TOTAL
230
54
777
260
33
954
320
27
1151
NO STRATIFICATION
150
18
307
200
12
408
200
11
503
7
TABLE 5.
Optimal Sampling
Ponderosa Pine*
Allocations for
“ the
1978 Black
Hi 1 1 s
Survey in
STRATUM
% Std.
One
Ground
Plot
Two
Ground Plots
Three
Ground
Plots
Error
Per Stage 2
Plot
Per Stage 2 Plot
Per Stage 2 Plot
No. of
No. of
Cost
No. of No. of
Cost
No. of
No. of
Cost
PI
PI Sub
- ($)
PI
PI Sub-
($)
PI
PI Sub-
($)
PI ots
PI ots
PI ots
PI ots
PI ots
PI ots
M
10%
1400
62
1646
1400
37
1797
1500
27
1944
H
400
52
1084
400
32
1246
500
23
1407
TOTAL
1800
114
2730
1800
69
3043
2000
50
3351
M
15%
700
35
870
800
18
953
800
14
1027
H
200
25
530
200
15
607
200
12
687
TOTAL
900
60
1400
1000
33
1560
1050
26
1714
M
20%
400
22
528
500
11
577
500
8
621
H
110
15
309
120
9
355
130
7
397
TOTAL
510
37
837
620
20
932
630
15
1018
M
25%
300
13
347
300
8
378
300
6
411
H
70
10
201
80
6
230
80
5
260
TOTAL
370
23
548
380
14
608
380
11
671
^Strata were not combined since no data were available for the light condition.
8
TABLE 6. Optimal Sampling Allocations for the 1979 Colorado Survey in
Ponderosa Pine.
STRATUM % Std.
Error
One Ground
Per Stage 2
Plot
Plot
Two Ground Plots
Per Stage 2 Plot
Three Ground Plots
Per Stage 2 Plot
No. of
PI
PI ots
No. of
PI Sub
PI ots
Cost
- ($)
No. of
PI
PI ots
No. of
PI Sub-
Pi ots
Cost
($)
No. of
PI
PI ots
No. of
PI Sub-
Pi ots
Cost
($)
10%
1400
352
5240
1600
228
6654
1800
186
8044
15%
600
165
2421
800
104
3079
900
85
3725
20%
400
91
1378
400
61
1763
500
49
2135
25%
200
62
894
200
35
894
300
38
1130
only consideration in allocation of resources. Scheduling, amount of training
required, travel restrictions, etc., often play a part. Therefore, it is
important to be able to know how much will be sacrificed in having a design
which is suboptimum in some respect.
Some of the results were consistent over all of the previous survey data
sets. In no case was stratification beneficial. This is probably because in
variable probability sampling the measurement upon which the variance depends
is the ration of the next stage measurement to the previous stage measurement.
This would not be necessarily more homogeneous within strata defined by
intensities of mortality. In every case, sampling with only one ground plot
per photoplot in the final stage was best.
Results for ponderosa pine are somewhat limited in that for the Black
Hills, data from only two strata were available and for the Colorado Front
Range survey, only one stratum was usable. The results are summarized in
Tables 5 and 6. Since data were incomplete, we were not able to do the
analysis for no stratification.
9
DISCUSSION
The variances in any two surveys will not be the same. Much depends upon
the geographic location of the survey, the quality of the photos, the skill of
the interpreters, and the inherent variability in the population being
surveyed. Consequently, the results presented here should be used
conservatively as a guideline, not as an absolute rule.
It is felt that the results derived from these analyses are highly
dependent on the cost information used. If more precise answers are to be
obtained, more effort should be directed in the future to obtain and maintain
cost data. For each survey, a good estimate of costs could be obtained if the
total person hours spent doing photo interpretation and the total number of
photos interpreted were tallied, as well as the total person hours spent doing
ground work plus the number of plots measured on the gorund were recorded.
It is not necessary to take more than one plot in the final stage of
sampling in order to estimate the standard error of the estimate of total for
any single survey; however, it is impossible to evaluate the variance
component in the final stage for the optimization of future surveys if only
one plot is sampled. For this reason, it is often desirable to consider only
those designs with at least two ground plots per photo plot in the final stage
even though this may not be the most cost effective. Also, in choosing a
viable alternative for a particular survey, some provision should be made to
allow for missing or unusable data. This is another reason why it may be
better to use more than one ground plot per cell in the final sampling stage.
REFERENCES
Bennett, D.D., and W.E. Bousfield. 1979. A pilot survey to measure annual
mortality caused by the mountain pine beetle in lodgepole pine on the
Beaverhead and Gallatin National Forests. USDA For. Serv., Northern
Region, Rep. No. 79-20. 13 pp.
Bennett, D.D., and W.E. Bousfield, M.D. McGregor, and K.E. Gibson. 1980.
Evaluation of multistage sampling techniques to measure lodgepole pine
mortality caused by mountain pine beetle in Montana, 1979. USDA For.
Serv., Northern Region, Rep. No. 80-13. 11 pp.
Ciesla, W.M., and F.M. Yasinski. 1980. Forest insect and disease information
systems (FIDIS) implementation plan. USDA For. Serv., FIDM/MAG Rep. No.
80-3. 10 pp.
Hosteller, B.B., and R.W. Young. 1979. Estimation procedures for determining
annual tree mortality caused by the mountain pine beetle. USDA For. Serv.,
Rocky Mtn. Region, Tech. Rep. R2-20. 25 pp.
Klein, W.H., D.D. Bennett, R.W. Young. 1979. A pilot survey to measure
annual mortality of lodgepole pine caused by the mountain pine beetle.
USDA For. Serv., FIDM/MAG Rep. No. 78-4. 15 pp.
Lister, C.K., and R.W. Young. 1981. 1979 Colorado forest range mountain pine
beetle survey. USDA For. Serv., Rocky Mtn. Region, Tech. Rep. R2-22. 19 pp.
10
APPENDIX
The variance components of the unbiased estimator for the number of dead
trees are defined in this section. Let V(Y^) be the variance component
associated with the first stage sampling, then V(Y^) is as follows:
00
_ N1 (h* ~ n)
N 2- » “ Z.
" ^ - n v
J
A
A - 1
Let V ( Y^ ) be the variance component associated with the second stage
sampling, the V ( Y^ ) is as follows:
MV
a.
I
Let V (Y^) be the variance component associated with the third stage
sampling, the V ( Y3 ) is as follows:
V
k
k - L'
lJ - \
The definitions for the variables applicable to the above formulas as follows
N = total possible number of PI plots
N' = total number of PI plots taken
x. = total PI for plot i.
x = mean of Pi values for all photos
Vjk = ground measurement for photo j and ground plot k
Pjk = probability of ground plot k within photo plot j
l
L'
U
Pi*-
11
number of ground plots samled per photo plot
total possible number of ground plots/photo plot
probability for selecting photo plot j
number of photo plots sub-sampled during second stage in
original survey
number of photo plots to sub-sample during second stage for next
time (parameter to be optimized)
number of ground plots/photo plot to sample next time (parameter
to be optimized)
number of photos in first stage next time (parameter to be
optimized)