Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Ecological Indicators 7 (2007) 751–767 This article is also available online at: www.elsevier.com/locate/ecolind Improving biological indicators to better assess the condition of streams M.T. Southerland a,*, G.M. Rogers a, M.J. Kline b, R.P. Morgan b, D.M. Boward c, P.F. Kazyak c, R.J. Klauda c, S.A. Stranko c a Versar, Inc., ESM Operations, 9200 Rumsey Road, Columbia, MD 21045, USA University of Maryland, Center for Environmental Science, Appalachian Laboratory, Frostburg, MD 21532, USA c Maryland Department of Natural Resources, 580 Taylor Avenue, Annapolis, MD 21401, USA b Received 2 November 2005; received in revised form 16 August 2006; accepted 18 August 2006 Abstract Biological indicators of stream condition are in use by water resource managers worldwide. The State of Maryland and many other organizations that use Indices of Biotic Integrity (IBIs) must determine when and how to refine their IBIs so that better stream condition information is provided. With completion of the second statewide round in 2004, the Maryland Biological Stream Survey (MBSS) had collected data from 2500 stream sites, more than doubling the number of sites that were available for the original IBI development. This larger dataset provided an opportunity for the MBSS to address the following shortcomings in the original IBIs: (1) substantial disturbance apparent in some reference sites, (2) fish IBIs could not be applied to very small streams, (3) natural variability within IBIs (based on regions) resulted in some stream types (e.g., coldwater and blackwater streams) receiving lower IBI scores and (4) one IBI was not able to discriminate degradation as desired (i.e., Coastal Plain fish IBI). Therefore, development of new fish and benthic macroinvertebrate IBIs was undertaken to achieve the goals of: (1) increased confidence that the reference conditions are minimally disturbed, (2) including more natural variation (such as stream size) across the geographic regions and stream types of Maryland and (3) increased sensitivity of IBIs by using more classes (strata) and different metric combinations. New fish IBIs were developed for four geographical and stream type strata: Coastal Plain, Eastern Piedmont, warmwater Highlands and coldwater Highlands streams; new benthic macroinvertebrate IBIs were developed for three geographical strata: Coastal Plain, Eastern Piedmont and Highlands streams. The addition of one new fish IBI and one new benthic macroinvertebrate IBI partitioned natural variability into more homogeneous strata. At the same time, smaller streams (i.e., those draining catchments <300 ac), which constituted a greater proportion of streams (25%) sampled in Round Two (2000–2004) than Round One (1995–1997), because of the finer map scale, were included in the reference conditions used to develop the new IBIs. The resulting new IBIs have high classification efficiencies of 83–96% and are well balanced between Type I and Type II errors. By scoring coldwater streams, smaller streams and to some extent blackwater streams higher, the new IBIs improve on the original IBIs. Overall, the new IBIs provide better assessments of * Corresponding author. Tel.: +1 410 740 6074; fax: +1 410 964 5156. E-mail address: southerlandmar@versar.com (M.T. Southerland). 1470-160X/$ – see front matter # 2006 Elsevier Ltd. All rights reserved. doi:10.1016/j.ecolind.2006.08.005 752 M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 stream condition to support sound management decisions, without requiring substantial changes by cooperating stream assessment programs. # 2006 Elsevier Ltd. All rights reserved. Keywords: Indices of Biotic Integrity (IBI); Fish; Benthic macroinvertebrate; Coldwater streams; Reference condition 1. Introduction Biological indicators of stream condition are in use by water resource managers worldwide. The State of Maryland and many other organizations that use these indicators must determine when and how to refine them so that better stream condition information is provided. This paper describes analyses undertaken to improve the biological indicators used by the Maryland Biological Stream Survey (MBSS), a statewide monitoring program conducted by the Maryland Department of Natural Resources (DNR). The benefits of these changes are described as well as the costs to the MBSS and cooperating programs. The MBSS is a probability-based sampling program that can describe streams at varying spatial scales (Klauda et al., 1998). An objective of the MBSS is to assess the status and trends in biological integrity for all 9400 non-tidal stream miles (based on the U.S. Geological Survey 1:100,000 stream network) in Maryland. Therefore, the MBSS provides estimates of the biological condition of streams statewide using indicators based on references of biological integrity. Karr and Dudley (1981) used reference condition as the basis for their definition of biological integrity, i.e., ‘‘the ability of an aquatic ecosystem to support and maintain a balanced, integrated, adaptive community of organisms having a species composition, diversity and functional organization comparable to that of natural habitats in the region.’’ Multi-metric Indices of Biotic Integrity (IBIs), originally developed by Karr et al. (1986), are the most common indicators of stream condition in use today. Most IBIs develop their expectations for the structure and function of biological assemblages from reference sites. Originally, however, the variability in these reference sites was not explicitly modeled nor was the ability of the indicator to distinguish deviation from reference condition directly tested. Currently, it is standard practice to test the performance of IBIs by determining the percentage of reference sites and known degraded sites that are correctly classified. This was first done in Maryland by Weisberg et al. (1997) for the Chesapeake Bay Estuary. More recently, researchers have demonstrated the utility of empirically modeling reference condition from reference sites as exemplified in Bailey et al. (2004) ‘‘reference condition approach.’’ Thus, IBI development today involves the careful testing of the performance of individual metrics and their combinations as indicators that work best for the geographic regions and stream types of interest. The MBSS developed the first fish (Roth et al., 1998) and benthic macroinvertebrate (Stribling et al., 1998) IBIs for Maryland in 1998. Subsequently, Roth et al. (2000) refined the Maryland fish IBI and Southerland et al. (2004) developed a stream salamander IBI for Maryland. These original Maryland IBIs have performed well, helping Maryland DNR and other agencies better characterize and manage State waters, and have produced dozens of assessments and research findings (e.g., Vølstad et al., 2003b). At the same time, these IBIs had the following shortcomings: (1) substantial disturbance apparent in some reference sites, (2) IBIs could not be applied to smallest streams, (3) natural variability within IBIs (based on regions) resulted in some stream types (e.g., coldwater and blackwater streams) receiving lower IBI scores and (4) one IBI was not able to discriminate degradation as desired (i.e., Coastal Plain fish IBI). Specifically, either a more general IBI was applied to two classes of streams (e.g., both Highlands and Piedmont streams for the benthic macroinvertebrate IBI) or no IBI was calculated (e.g., streams draining catchments of less than 300 acres for the fish IBI). To better assess Maryland streams, IBIs that accurately characterize stream condition in more stream classes were needed. With completion of the second statewide round in 2004, the MBSS had collected data from approximately 2500 stream sites, M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 more than doubling the number of sites that were available for the original IBI development. Therefore, development of new fish and benthic macroinvertebrate IBIs was undertaken with the following goals:  increase confidence that the reference conditions used are minimally disturbed by refining the criteria for selecting reference sites;  better capture the full range of natural variation in reference condition within Maryland by including more reference sites from unique geographic regions and stream types (e.g., small streams);  increase the sensitivity of IBIs for distinguishing human disturbance by segregating variation into more classes of reference condition. At the same time, development of the new IBIs had to take into account the following practical constraints:  fewer reference sites are available to characterize reference condition when a larger number of geographic or stream type classes are used;  IBIs developed for larger geographic or stream type classes may be less sensitive for distinguishing between reference condition and degraded condition (because they encompass greater natural variability, see Kilian, 2004). Complete details on this IBI development are provided in the Maryland DNR report available at www.dnr.state.md.us/streams (Southerland et al., 2005a). Future refinements to the IBIs will also be documented in this report. 2. Development of new IBIs With these objectives and constraints in mind, we undertook development of new fish and benthic macroinvertebrate IBIs for Maryland following the same steps used to develop the original MBSS IBIs:      develop the database; identify reference and degraded sites; determine the appropriate strata; test the candidate metrics; test and validate the indices. 753 2.1. MBSS database It is essential that the data used to develop IBIs (e.g., reference sites) are comparable to the data collected at the sites of concern (test sites). A virtue of the MBSS is that the same biological, chemical, physical habitat and land use data are collected for all sites used in stream assessment and indicator development. The MBSS is also ideal for the development of IBIs because the sampling protocols are rigorously applied through annual training sessions and a quality assurance program (Roth et al., 2005a). MBSS sites are selected using a probability-based design applied to all first- through fourth-order streams in Maryland based on a map scale of 1:100,000 (Roth et al., 2005b). Benthic macroinvertebrates are sampled in the spring and identified to genus or lowest practical taxon in the laboratory from 100organism subsamples. Fish are sampled in the summer using double-pass electrofishing of 75-m stream segments. Water chemistry and physical habitat data are collected from these same segments. Land use information is extracted from Maryland Office of Planning data for the catchments draining to each segment. For more details on MBSS methodologies see Roth et al. (2005b). As was done for the original IBIs and described in Roth et al. (2000), we developed an integrated dataset that included all site and landscape environmental variables linked to the biological data and their derived attributes, such as tolerance values and functional groups. The original IBIs were developed with data collected from 1994 to 1997 from a maximum of 1098 sites, divided into 732 calibration sites and 366 (33%) validation sites. The dataset for the new IBIs included all samples from 1994 to 2004, totaling 2508 sites with 353 (14%) reserved for validation. We believed this large number of sites provided us with enough reference sites to create reference conditions for additional classes of Maryland stream types. Small numbers of (or no) reference sites in a stream type (e.g., coldwater streams) prevent development of effective IBIs. 2.2. Better reference conditions Using reference sites that are minimally disturbed is perhaps the most important component of IBI development. If reference sites are only relatively less 754 M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 disturbed than other sites (such sites are often referred to as least disturbed), assigning quality levels to IBI scores becomes problematic. Therefore, we reviewed the reference criteria used in the original MBSS IBIs (a site must meet all criteria to be designated as a reference site) to identify changes that would result in greater confidence that the new reference sites were minimally disturbed. We decided to retain the following reference criteria that we believe reflect levels at which individual stressors will probably not result in adverse effects to stream biota (Roth et al., 2000):  pH  6 or blackwater stream (pH < 6 and DOC  8 mg/l);  ANC  50 meq/l;  DO  4 ppm;  nitrate  300 meq/l (4.2 mg/l);  remoteness rating: optimal or suboptimal;  aesthetics rating: optimal or suboptimal;  in stream habitat rating: optimal or suboptimal;  no channelization;  no point source discharges. Because the remoteness variable was replaced with ‘‘distance to nearest road’’ in Round Two and the channel alteration variable was replaced with ‘‘channelization,’’ comparable replacement criteria were developed and applied to Round Two sites. Specifically, the surrogate ‘‘remoteness’’ variable was obtained by converting the distance to nearest road value p toffiffiffiaffiffiffiffi0–20 ffiffiffiffiffiffiffiffiffiffiffiscore ffiffiffiffiffiffiffiffiffiffiffiusing ffiffiffiffiffi the equation: ¼ 0:615 þ 0:733 meters from road (Paul et al., 2003). A regression of this new remoteness variable on the original variable yielded a reference criterion threshold of 70. For Round Two sites, the reference criterion of no channelization was indicated by a ‘‘no’’ value for the channelization variable. At the same time, we believed that the land use criteria were not strict enough to eliminate sites with adverse effects. Based on analysis of urban effects on stream condition (Vølstad et al., 2003b), the presence of original reference sites with relatively high levels of urban land (i.e., 5–20%) indicated that not all reference sites were minimally disturbed. Therefore, we changed the minimum allowable forested land use from 25% to 35% of the catchment area and the maximum allowable urban land use from 20% to 5% of the catchment area. In addition, studies have indicated that wider vegetated riparian buffers often ameliorate land use effects (see Naiman and Décamps, 1997; Lee et al., 2004), so the minimum allowable riparian buffer width was changed from 15 to 30 m. These changes in land use and riparian width thresholds resulted in a smaller proportion of stream sites meeting the reference site criteria. Using the original reference site criteria, 152 of the 1098 Round One sites (14%) were designated as reference sites. Using the new criteria, 196 of the total 2508 sites (8%) were designated as reference. Fig. 1 shows that the cumulative distribution of stream miles with equal or greater benthic macroinvertebrate IBI scores for Round Two sites meeting the new reference criteria is to the right (i.e., higher quality) than sites meeting the original reference criteria. This result is consistent with greater confidence that the sites are minimally disturbed and, since the total number of reference sites is greater than that used to develop the original IBIs, the characterization of reference condition should be more robust. We retained the criteria for degraded sites from the original IBI development procedures as follows (a site failing any one of the criteria is designated as a degraded site):  pH  5 and ANC 0 meq/l (except for blackwater streams, DOC  8 mg/l) (n = 23 sites);  DO  2 ppm (n = 20); Fig. 1. Cumulative distribution of stream miles with benthic macroinvertebrate IBI scores for: (1) MBSS sites sampled in 2000–2004 (solid line), (2) subset of 2000–2004 sites meeting original reference criteria (dashed line) and (3) subset of 2000–2004 sites meeting new reference criteria (dotted line). M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767  nitrate > 500 meq/l (7 mg/l) and DO < 3 ppm (n = 0);  in stream habitat rating poor and urban land use >50% of catchment area (n = 15);  in stream habitat rating poor and bank stability rating poor (n = 34);  in stream habitat rating poor and channel alteration rating poor (n = 69);  urban land use >50% of catchment area and riparian buffer width = 0 m (n = 48). A total of 170 of the 2508 sites (7%) were designated as degraded using these criteria. 2.3. Full range of natural variability While the original benthic macroinvertebrate IBI was applied to all sampled streams, the MBSS recognized that the reference sites did not adequately capture the natural variation of fish assemblages in small streams. This was in part due to the lower natural abundance of fish and fewer fish species in small streams, but also due to the small number of reference sites in these streams. Therefore, streams draining catchments of less than 300 ac (i.e., where the number of fish and fish species sampled were frequently less than 100 and 5, respectively) were not rated using the original fish IBI (Roth et al., 2000). This resulted in 98 (11%) of streams sampled from 1995 to 1997 being not rated for fish because of their small size (an additional 5% of sites were not rated because they were dry and therefore not sampleable in the summer). From 2000 to 2004, the MBSS sampled streams from a new 1:100,000-scale map that included a greater proportion of small streams than was sampled from 1995 to 1997. Specifically, while 11% of streams sampled in 1995–1997 drained <300 ac, 25% of streams sampled in 2000–2004 drained <300 ac. Only 5% of streams draining <300 ac had <100 fish sampled, so data limitation was not a justification for excluding all <300 ac streams. Therefore, we attempted to include these smaller streams in the development of the new fish IBI. We included all stream sizes in the analyses, creating a more representative but more variable reference condition; subsequently, we investigated partitioning this variability into separate small stream or coldwater stream type classes (see next section). 755 2.4. More classes of reference condition As described above, the 196 reference sites meeting the more restrictive reference criteria are representative of the best 8% of streams in Maryland. These reference sites were randomly selected within 84 primary sampling units (i.e., individual and combinations of Maryland 8-digit watersheds) and are distributed across all regions and stream types. This distribution allowed us to identify stream classes using empirical data, which is generally preferable to a priori classification (Hawkins et al., 2000). The goal of the classification step in indicator development is to partition the variability in reference condition into homogeneous regions or stream types that are best addressed with separate IBIs. The original fish IBIs were developed for three geographical strata: Coastal Plain, Eastern Piedmont (as defined by fish distributions ending at Great Falls) and Highlands (Fig. 2). The original benthic macroinvertebrate IBIs were developed for the Coastal Plain and non-Coastal Plain (Highlands and Eastern Piedmont combined). For the new IBIs, we performed cluster analyses as described in Roth et al. (2000) to identify groups of sites with similar biological assemblages (as represented by log-transformed percentages of species abundance). To ensure sufficient sample size, sites meeting the original reference criteria (i.e., 261 of the 2500 sites) were used in the analysis. Separate cluster analyses were done for fish and benthic macroinvertebrates. Another approach to identifying useful strata is to apply cluster analysis or other multivariate techniques to the metrics likely to be used in the IBIs (Angermeier et al., 2000). The cluster analyses in this study indicated that fish assemblages divided into four fairly distinct groups: Coastal Plain, Eastern Piedmont, small streams in the Highlands (draining <3000-ac catchments) and large streams in the Highlands (Fig. 2). Smaller clusters were also significantly different, but only these larger clusters could be associated with consistent abiotic variables (e.g., geographic boundaries or stream size). The differentiation among benthic macroinvertebrate assemblages was similar but less strong; comparisons of the benthic macroinvertebrate taxa list among the groups also indicated consistent differences. Distinct blackwater assemblages were not discernable in cluster analyses for either fish or benthic 756 M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 Fig. 2. Map of ecological regions of Maryland and distribution of non-degraded sites with distinctly different fish assemblages. Each stratum represents sites that were found to be similar using cluster analysis. Thick lines denote the separation (from left to right) of the Highlands, Eastern Piedmont and Coastal Plain regions. Medium lines denote Maryland 8-digit watershed boundaries. Thin lines denote Maryland stream network at 1:100,000 scale. macroinvertebrates, nor were there enough blackwater reference sites to create a separate stratum. Nonetheless, other evidence, including analysis of sentinel blackwater sites (T. Prochaska, personal communication, 2005), indicates that there are differences that may justify not rating blackwater sites with the new fish and benthic macroinvertebrate IBIs. Using the original fish IBIs, the MBSS determined that many smaller streams meeting the reference criteria, and especially coldwater streams, were scoring lower than larger reference streams by approximately one-third. These erroneously low scores could lead to designating small streams as impaired when they are not. Therefore, we used both the segregated Highlands stream strata (small and large streams separately) and the combined Highlands stream stratum in subsequent indicator development steps. We also developed a coldwater streams stratum based on current and likely sustainable distributions of brook trout (M. Kline, personal communication, 2005) for use in indicator development. The coldwater stratum included all streams west of Evitts Creek in western Maryland. Isolated brook trout streams in the Catoctin Mountain area and parts of the Patapsco, Gunpowder and Susquehanna watersheds were not included in the coldwater streams stratum. The selection of each of these geographic or stream type strata has an ecological basis and potential for improving the performance of IBIs by reducing the variation in reference condition within each stratum. The number of reference sites occurring in each stratum is a practical limitation to IBI development. Bailey et al. (2004) recommends using 5–10 reference sites per class (stratum) as a minimum; experience of the MBSS indicates that 40 reference sites in each stratum is effective for developing IBIs. Even though more restrictive reference criteria were used to develop the new IBIs, the large dataset of 2500 sites still provided enough reference sites (approximately 40) for fish IBI development in each different stream type (Table 1). For the new benthic macroinvertebrate IBI, the coldwater stratum was not used because, unlike fish, benthic macroinvertebrate assemblages Table 1 Reference and degraded sites occurring in each geographic or stream type stratuma Coastal Plain Eastern Piedmont Warmwater Highlands Coldwater Highlands a Reference Degraded 52 43 53 48 82 40 35 13 Includes both calibration and validation data. M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 are not typically depauperate in minimally disturbed coldwater streams. 2.5. Testing candidate metrics In developing the original fish and benthic macroinvertebrate IBIs, the MBSS compiled and tested more than 100 candidate metrics (Roth et al., 2000; Stribling et al., 1998). For the new IBIs, we retained all metrics that showed promise in the original analysis (i.e., all that had significantly different values for reference and degraded sites) and added selected new candidate metrics. The list of candidate metrics for the new fish IBI included 44 original metrics and the following new metrics:  Pirhalla (2004) habitat tolerance metrics;  log-transformed metrics that included sculpins (10 metrics including versions adjusted for catchment area);  observed/expected (O/E) for fish (Stranko et al., 2005). The list of candidate metrics for the new benthic IBI included 51 original metrics and the following new versions of original metrics calculated based on new benthic macroinvertebrate tolerance values (derived separately for urban and agricultural land use effects) calculated from the MBSS dataset (Bressler et al., 2004):     number and percentage of intolerants; percentage of tolerants; Hilsenhoff Biotic Index (HBI); Beck’s index. The log-transformed metrics were included because analysis indicated that the original fish IBIs might have been overly influenced by sculpin abundance. The other new metrics were not available when the original IBIs were developed. Each candidate fish metric and its predicted response to stress are described in Appendix B. Detailed results of testing each metric can be found in Southerland et al. (2005a). As was done for the original fish IBIs, metrics of fish abundance and species richness were tested within each stratum, both as raw values and values adjusted 757 for stream size (Roth et al., 2000). We tested all candidate metrics by comparing mean values and distributions between reference and degraded sites in each stratum, in combined strata and statewide. We also looked at including and excluding sites with no fish, sites draining <300 ac and sites with <60 benthic macroinvertebrates to evaluate these effects on the metrics. These different comparisons ensured that the usefulness of each metric for all possible IBIs was considered. The ability of fish metrics to discriminate between reference and degraded sites (i.e., the number of such sites correctly classified) was similar when sites draining <300 ac were included. Ann Roseberry Lincoln (personal communication, 2005) found no evidence of a bias in benthic IBIs resulting from small stream size. Only three reference sites had <60 benthic macroinvertebrates, so all sites were included in the metric testing. The first step in metric testing was to test for significant differences in: (1) the mean values between reference and degraded sites using the Mann–Whitney U-test and (2) the distributions of values using the Kolmogorov–Smirnov test. The next step was to score the metrics based on the distribution of values observed at reference sites within each stratum. In developing the original IBIs we scored each metric as 5, 3 or 1, depending on whether its value at a site approximates, deviates slightly from or deviates greatly from conditions at reference sites (Karr et al., 1986). In other IBI applications (e.g., Barbour et al., 1996; Fore et al., 1996; Lyons et al., 1996; Blocksom, 2003), a number of different methods have been used to establish scoring thresholds, based on varying subdivisions of observed values. For the new IBIs, we retained our discrete scoring approach so that direct comparisons with the original IBIs could be made. In our analysis, threshold values for each selected metric were established as approximately the 10th and 50th (median) percentile values for reference sites (see Fig. 3), and were established separately for each stratum. For each metric expected to decrease with degradation, values below the 10th percentile were scored as 1. Values between the 10th and 50th percentiles were scored as 3, as they fell short of median expected values for reference sites. Values above the 50th percentile were scored as 5. Scoring 758 M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 Fig. 3. Schematic illustration of the process used to derive and interpret scores for the MBSS Indices of Biotic Integrity (IBIs). Scores are based on the distribution of reference sites, as depicted in the top figure. The bottom figure shows hypothetical reference sites in the context of other hypothetical sites, including those with known degradation. was reversed for metrics expected to increase with degradation (e.g., values below the 50th percentile were scored as 5, and values above the 90th percentile were scored as 1). In this method, both the upper and lower thresholds are independently derived from the distribution of reference site values. The 10th percentile threshold for designating scores of 1 represents our intent to identify values that are outside the natural expectation for reference sites. To test the discriminatory power of each candidate metric, we evaluated the degree of overlap between metric values at reference and degraded sites by examining the number of sites scoring above and below the lower threshold. A classification efficiency was calculated as the percent of reference sites with values scoring 3 plus degraded sites scoring <3, out of the total number of sites evaluated. Reference sites misclassified as degraded (score <3) and degraded sites misclassified as reference (score 3) make up the remainder of the sites. A high classification efficiency indicates a small amount of overlap between values for reference and degraded sites. In addition to overall classification efficiencies, classification efficiencies were also reported separately for reference and degraded sites. The term discrimination efficiency is often applied to the percentage of degraded sites alone that are correctly classified (Gerritsen et al., 2000). Most candidate metrics were significantly different between reference and degraded sites, and many had high classification efficiencies (i.e., exceeding 70%). Certain metrics in some strata exceeded 90%. Classification efficiencies were used as the primary means of selecting metrics for potential inclusion in the IBIs. Among similar metrics (e.g., number of species versus number of native species to describe species richness), the best performing metric (balanced across strata for core metrics) was used. The classification efficiencies of the fish abundance and richness metrics were very similar for both raw scores and scores adjusted for catchment area. We selected only adjusted metrics for inclusion in IBI testing because they make ecological sense and are consistent with the original MBSS IBIs. The lognormal metrics of sculpin abundance rarely had good classification efficiencies and were not selected; it is possible that sculpin absence at apparent reference sites is actually linked to current (or historical) degradation rather than unaccounted for natural differences. The observed/expected (O/E) metric for fish species could not be calculated for MBSS sites outside the values used to develop the model (i.e., later sample years), so the metric was not selected for IBI testing. In the future, refinement of the O/E models with more data may support its use as an independent indicator of stream condition. Some of the Pirhalla metrics performed adequately but were not better than traditional metrics, so they were not selected. The number of salamander species metric was also tested and had a high classification efficiency in the Coastal Plain and small stream Highlands, but was not selected because salamander sampling is not currently conducted at all MBSS sites. Some metrics with narrow thresholds, i.e., number of benthic species adjusted for M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 catchment area, percent non-tolerant suckers (all suckers except white sucker), percent Tanytarsini, number of Ephemeroptera and number of Scrapers, are essentially presence/absence metrics (in some cases no scores of 3 were assigned). Three such metrics were included in the original benthic IBIs. We evaluated the effect of eliminating these ‘‘presence/ absence’’ metrics (or using the number of benthic species without adjustment) but determined that the original formulations performed better (i.e., had higher classification efficiencies). 2.6. IBI combinations and testing As with the original IBIs, we iteratively tested many combinations of metrics to develop the new fish and benthic macroinvertebrate IBIs. For each combination, an index was calculated as the mean of the metrics included, scaled from 1 to 5. Classification efficiencies of different metric combinations (indices) were calculated as above, separately for reference and degraded sites, and overall. Individual IBI combinations were done separately for each of the provisional strata. At first, the combinations of metrics for IBI testing were selected in a stepwise manner, starting with the best performing metric (i.e., highest classification efficiency). Additional metrics were added as long as they increased the overall classification efficiency of the index. In no stratum did the classification efficiency improve after a second metric was added. This is a result of the very high classification efficiencies achieved by individual metrics in each stratum. To ensure that the final IBIs were a more complete representation of the fish and benthic assemblages (as recommended by Karr et al., 1986 and done for the original MBSS IBIs), we selected a core of four metrics that performed well and represented different assemblage characteristics for each of the strata (in the coldwater Highlands stratum, only two of these core metrics were used). The core metrics for the new fish IBI were abundance per square meter; number of benthic species (adjusted for catchment area); percentage of tolerant fish; percentage of generalists, omnivores and invertivores. Only the abundance per square meter and number of benthic species (adjusted for catchment area) core metrics were included in the coldwater Highlands fish IBI. The core metrics for the 759 new benthic IBI were number of taxa; number of Ephemeroptera, Plecoptera and Trichoptera (EPT) taxa; number of Ephemeroptera taxa; percentage of benthic macroinvertebrates intolerant to urban stress (after Bressler et al., 2004). The core fish metrics represent four of the five assemblage components identified by Karr et al. (1986): species richness and composition, indicator species, trophic composition and fish abundance and condition. The reproductive function component was not represented in the core metrics as no reproductive metrics had high classification efficiencies. Subsequently, we attempted to improve the performance of the IBIs by adding other metrics to the core suite in the same stepwise fashion. Additional metrics were added until they no longer improved the classification efficiency of the index. The provisional small stream Highlands stratum had the lowest classification efficiency and the large stream Highlands stratum had so few degraded sites that its performance was suspect. At the same time, the coldwater stratum performed well and effectively captured most of the streams draining catchments <5000 acres, so the small stream Highlands stratum was abandoned and the remaining Highlands streams combined as a separate warmwater Highlands stratum (i.e., the remaining Highlands streams outside the geographic boundaries of the coldwater streams stratum). For the four final strata, two additional metrics were added to each new fish IBI, improving the calibration classification efficiency to at least 83%. For the final three new benthic macroinvertebrate IBIs, two to four additional metrics were added to the core suite, improving the calibration classification efficiency to at least 85%. 2.7. IBI validation As described above, we reserved 353 of all sites sampled from 1994 to 2004 for validation of the new MBSS IBIs. This number of sites is comparable to the number of validation sites used to develop the original MBSS IBIs. For the fish IBIs in the Highlands and Coastal Plain, the overall classification efficiencies of the validation sites were even higher than the calibration classification efficiencies (88%). The validation classification efficiency for the fish IBI in the Eastern Piedmont was 760 M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 lower at 71%, but this validation is less reliable because only seven reference and degraded sites from this region were in the validation dataset by chance. The validation classification efficiencies for the benthic IBI were higher than for calibration in the Coastal Plain at 96%, somewhat lower in the Eastern Piedmont at 86% (but again there were only seven validation sites in this stratum) and comparable in the Highlands at 88%. These high classification efficiencies using only validation sites indicate that the performance of the IBIs was not derived from overfitting to the calibration dataset. Therefore, the IBIs are likely to be robust when applied to new data. 3. Comparison of original and new IBIs Using the indicator development process described above, we created new MBSS fish and benthic macroinvertebrate IBIs as shown in Tables 2 and 3. The new fish IBIs differ from the original IBIs in that they divide the original Highlands stratum into two strata, one for coldwater Highlands streams and one for the remaining warmwater Highlands streams. In addition, smaller streams (i.e., those draining <300 ac catchments) that were not included in the original fish IBI development have been included in the new IBIs; therefore the new IBIs can be applied to these smaller streams (25% of stream miles in 2000–2004). The new benthic IBIs differ from the original IBIs in that they divide the original non-Coastal Plain stratum into new Highlands and Eastern Piedmont strata. As with the original benthic and new fish IBIs, smaller (<300 ac) streams are included in the new benthic IBIs. The number and composition of metrics differ between the new and original IBIs. The following metrics from the original fish IBIs are included in the new fish IBIs for the same strata: number of benthic species (adjusted for catchment area); percent tolerants; percent generalists, omnivores and invertivores. The abundance per square meter metric that appeared in the original Coastal Plain and Eastern Piedmont fish IBIs is now in all four new fish IBIs. The new Coastal Plain fish IBI has only six metrics compared to the eight metrics in the original IBI, and the only new metric is the percent non-tolerant suckers (i.e., all suckers except white sucker). The new Eastern Table 2 New fish IBIs for Maryland by stratum and with metric scoring thresholds Thresholds 1 Coastal Plain Abundance per square meter Number of benthic species adjusted Percent tolerants Percent generalists, omnivores, invertivores Percent non-tolerant suckers (all suckers except white sucker) Percent abundance of dominant species Eastern Piedmont Abundance per square meter Number of benthic species adjusted Percent tolerants Percent generalists, omnivores, invertivores Biomass per square meter Percent lithophilic spawners Warmwater Highlands Abundance per square meter Number of benthic species adjusted Percent tolerants Percent generalists, omnivores, invertivores Percent insectivores Percent abundance of dominant species Coldwater Highlands Abundance per square meter Percent tolerants Percent brook trout Percent sculpins 3 5 <0.45 0 >97 100 0.72 0.22 68 92 0 2 >69 40 <0.25 <0.09 >68 100 1.25 0.26 45 80 <4.0 <32 8.6 61 <0.31 <0.11 >80 >96 0.65 0.25 39 61 <1 >89 33 38 2.24 0.81 0 0 0.88 0.22 0.14 0.44 Piedmont fish IBI has six metrics compared to the nine metrics in the original IBI and includes no new metrics. The new warmwater Highlands fish IBI has six metrics compared to the seven metrics in the original Highlands IBI, while the new coldwater Highlands fish IBI has only four metrics, including two new metrics appropriate to its stream type: percent brook trout and percent sculpins. The following metrics from the original benthic IBIs are included in the new benthic IBIs for the same strata: number of taxa and number of EPT taxa. The new Coastal Plain IBI also includes the percent Ephemeroptera and number of scraper taxa metrics M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 Table 3 New benthic IBIs for Maryland by stratum and with metric scoring thresholds Thresholds 1 3 5 Coastal Plain Number of scraper taxa Number of EPT taxa Number of Ephemeroptera taxa Percent intolerant to urban Percent Ephemeroptera Number of Scraper taxa Percent climbers <14 <2 <1 <10 <0.8 <1 <0.9 22 5 2 28 11 2 8 Eastern Piedmont Number of taxa Number of EPT taxa Number of Ephemeroptera taxa Percent intolerant to urban Percent Chironomidae Percent clingers <15 <5 <2 <12 >63 <31 25 11 4 51 24 74 Combined Highlands Number of taxa Number of EPT taxa Number of Ephemeroptera taxa Percent intolerant to urban Percent Tanytarsini Percent scrapers Percent swimmers Percent Diptera <15 <8 <3 <38 <0.1 <3 <3 >50 24 14 5 80 4 13 18 26 from the original IBI, plus three new metrics: number of Ephemeroptera taxa, percent intolerant to urban stressors and percent climbers. The new benthic IBI for Eastern Piedmont includes number of Ephemeroptera taxa, and the new benthic IBI for Highlands includes number of Ephemeroptera taxa and percent Tanytarsini, both of which were included in the original non-Coastal Plain IBI. The Eastern Piedmont benthic IBI has six metrics and the Highlands IBI has eight metrics compared to the nine metrics in the original non-Coastal Plain IBI. The new Eastern Piedmont benthic IBI includes three new metrics and the Highlands IBI four new metrics. In addition to including different combinations of metrics, the new IBIs have different scoring thresholds. Because a new set of reference sites was used to develop the new fish and benthic IBIs, the metric values at the 10th and 50th percentiles of reference were different. For example, the degradation threshold (above which a score of 3 is given) for the abundance 761 per square meter metric changed from 0.42 (old) to 0.45 (new) in the Coastal Plain fish IBIs and from 0.56 to 0.25 in the Eastern Piedmont fish IBIs. For the Coastal Plain benthic macroinvertebrate IBIs, the degradation threshold for number of taxa changed from 11 to 14 and the threshold for the number of EPT taxa changed from 3 to 2. Larger changes occurred for some metrics and are attributable to changes in the reference condition resulting from stricter criteria, more small streams and chance. 3.1. Comparison of how original and new IBIs score reference condition As described above, the new MBSS IBIs were developed using a more restrictive set of reference sites (8% of all sites versus 14% of all sites for the original IBIs). Because stricter thresholds for land use and riparian disturbance were applied, we are more confident that the new IBI reference sites are minimally disturbed. At the same time, the reference sites for the new fish IBIs included smaller streams draining <300 ac that were not included in the original fish IBI. In addition, the sampling design for 2000–2004 on the 1:100,000-scale stream network resulted in more small streams being sampled. The distribution of the new reference sites included 38% that were <1000 ac, compared to 19% of the original reference sites. Including more small streams in the new reference condition means that more natural variability is included in the new IBIs, but that they are representative of smaller streams as well (i.e., they can now be rated). The mean score for all new reference sites was 3.7 using the original fish IBIs and 4.0 using the new fish IBIs. Mean reference site scores were 3.6 for the original benthic IBIs and 3.9 using the new benthic IBIs. For reference sites <1000 ac, the mean benthic IBI was 3.7 compared to 3.6 for larger streams and the fish IBI was 3.3 compared to 3.9. This indicates that smaller streams still scored somewhat lower on the fish IBI using the new fish IBI. 3.2. Comparison of how all streams score with original and new IBIs We conducted a direct comparison of the original and new MBSS IBIs (both fish and benthic macroinvertebrate) by applying them to the 2000–2004 762 M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 MBSS dataset of 1367 sites. The statewide mean for the new fish IBIs was virtually unchanged with an original IBI of 2.91 and a new IBI of 2.93. The statewide mean of the new benthic IBIs was only 3% higher, increasing from 2.96 to 3.07. On a regional basis, the greatest difference in mean scores between original and new IBIs was an increase of 0.64 (16%) for the Coastal Plain benthic IBI and 0.32 (8%) for the Highlands fish IBIs. In the other regions, the mean benthic IBIs decreased 9% in the Highlands and 2% in the Eastern Piedmont, using the new IBIs. The mean fish IBIs decreased 5% in the Coastal Plain while staying the same in the Eastern Piedmont. On a county basis, 17 (one-third) of the 48 possible original and new mean IBI pairs (24 counties times both fish and benthic IBIs) changed by 0.5 units or more. The greatest increase was 1.14 for the benthic IBI in Caroline County and the greatest decrease was 0.58 for benthic IBI in Frederick County. Most of these changes were for the benthic IBIs in the nonCoastal Plain, which was separated into Highlands and Eastern Piedmont strata in the new benthic IBIs. The distributions of stream miles among the four MBSS condition classes (good, fair, poor, very poor) were also somewhat different between the original and new IBIs (Fig. 4). For both the new fish and benthic IBIs, the proportion of stream miles statewide changed by less than 10% in each condition class. The new Highlands fish IBI resulted in 16% more good streams and fewer fair and very poor streams. The increase in proportion of good streams is likely attributable to the appropriately higher scores for coldwater streams which have their own stratum in the new fish IBIs. Overall, the distribution of stream miles in each condition class was more even with the new IBIs than with the original IBIs. The influence of the new coldwater Highlands fish IBI is also evident when comparing it to the original Highlands fish IBI. The mean score for coldwater streams was 3.56 with the new fish IBI and 2.75 with the original fish IBI, an increase of 0.81 (20%). Although a separate blackwater stratum was not developed, the new Coastal Plain fish IBI rates blackwater streams 0.34 (8%) higher and the new Coastal Plain benthic IBI rates them 0.74 (18%) higher. This is likely due to the greater number of blackwater reference sites sampled in 2000–2004 and used to develop the new IBIs; only 24 blackwater Fig. 4. Percentage of stream miles in each condition class statewide for 2000–2004 sites scored with original (98) and new (05) MBSS IBIs. Fish and benthic IBIs are shown separately. reference sites were used to develop the original IBIs, while 64 were used for the new IBIs. Even though the blackwater stream type may still be scored lower than other types, the new IBIs better represent the expectation for natural blackwater streams. The different IBI scores that result from using the new IBIs rather than the old IBIs also affect the designations of watersheds as impaired according to Maryland’s biological criteria (MDE, 2005). These biological criteria are applied to Maryland 8-digit watersheds (or combined watershed Primary Sampling Units, PSUs) with 10 or more MBSS sample sites. Mean IBIs and one-sided 90% confidence interval values are calculated to give one of three ratings:  Does not meet criteria ( fails): If the mean and upper bound of the one-sided 90% confidence interval (upper) of either index (FIBI or BIBI) is less than 3.0, the 8-digit watershed (or PSU) is listed as failing to meet the proposed criteria.  Meets criteria ( passes): If the mean and lower bound of the one-sided 90% confidence interval M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 (lower) of both indices (FIBI and BIBI) are greater than or equal to 3.0, the 8-digit watershed (or PSU) is listed as meeting the proposed criteria.  Inconclusive: All other cases are inconclusive. Applying the original MBSS IBIs to 2000–2004 data, 40 watersheds fail, 37 are inconclusive and 7 pass biological criteria; using the new MBSS IBIs, 31 watersheds fail, 41 are inconclusive and 12 pass. Overall, 22% fewer watersheds fail biological criteria with the new IBIs. The most frequent changes in the designation of individual watersheds are the 17 watersheds that failed with the original IBIs, but that are inconclusive with the new IBIs. In addition, among the 37 watersheds that were inconclusive with the original IBIs, 24 (65%) remain inconclusive with the new IBIs, while 5 pass and 8 fail. 4. Discussion As stated at the outset, refinements to biological indicators can improve the information on stream condition provided to water resource managers. Such changes also entail development costs and infrastructure costs to the host and cooperating programs. Therefore, indicator changes should only be undertaken with specific goals in mind. In Maryland, the development of new fish and benthic macroinvertebrate IBIs achieved the goals of: (1) increasing confidence that the reference conditions are minimally disturbed (less disturbed reference sites), (2) including more natural variation across the geographic regions and stream types of Maryland (including small streams among reference sites so that these streams could be rated) and (3) increasing sensitivity of IBIs with more classes (one additional stratum each for the fish IBI and benthic macroinvertebrate IBI) and different metric combinations (new IBIs with higher classification efficiencies). This was largely possible because of the large number of stream sites in the 1994–2004 MBSS dataset. At the same time, adoption of the new IBIs does entail costs to the MBSS and cooperating programs in the form of: (1) making assessments of stream condition using the original IBIs obsolete, (2) changing impairment decisions, (3) requiring staff and resources to implement the new IBIs and (4) 763 potentially affecting consistency with stream monitoring programs that choose to retain the original IBIs. Nonetheless, these new MBSS IBIs are generally consistent with the old IBIs; they remain transparent and understandable, and they provide clear thresholds of impairment for both the biointegrity and interim (fishable and swimmable) water quality goals. Using the new IBIs it is still possible to calculate joint estimates between sampling rounds and detect trends in stream condition. 4.1. Minimally disturbed reference condition The reference conditions used to develop the new MBSS IBIs represented only 8% of all stream sites in Maryland. They did not include original reference sites that were most likely to be affected by land use changes. For these reasons, we are more confident that the new IBIs are based on minimally disturbed reference conditions for Maryland streams. This is important so that degraded streams are not identified as meeting reference conditions. 4.2. IBIs that better predict degradation Given minimally disturbed reference conditions, the ability of IBIs to distinguish deviation from those reference conditions is based on how predictably IBI scores change with disturbance. This ability to predict deviation comes from: (1) choosing metrics that vary predictably and precisely with disturbance and (2) combining these metrics into an index that consistently changes with disturbance across the natural variation gradients encountered. We reduced the natural variation that each new IBI had to address by increasing the number of geographic or stream type classes, i.e., the number of new MBSS IBIs. We now have four rather than three fish IBIs and three rather than two benthic IBIs. In the case of the new fish IBIs, we increased the natural variation of reference condition by adding smaller streams <300 ac, but this did not adversely affect the performance of the new IBIs given the four strata. Within each stratum (i.e., new IBI), the combination of metrics changed from the old IBIs and in every case the ability of the IBI to distinguish reference from degraded sites (i.e., the classification efficiency) increased (Table 4). By convention, classification 764 M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 Table 4 Comparison of classification efficiencies (CEs) between original and new MBSS IBIs Region Fish IBI Coastal Plain Eastern Piedmont Highlands Warmwater Highlands Coldwater Highlands Benthic IBI Coastal Plain Non-Coastal Plain Eastern Piedmont Highlands Original IBI calibration (validation) CE New IBI calibration (validation) CE Reference CE (%) Degraded CE (%) 74 (72) 90 (94) 86 (75) 85 (88) 96 (71) 89 95 80 97 83 (88) 85 85 78 87 (96) 89 83 93 (86) 91 (88) 94 93 92 88 87 (72) 88 (82) CEs are the percentage of reference and degraded sites that are correctly classified by each IBI. efficiencies above 80% are good and above 90% are excellent. In addition to these good-to-excellent overall IBI classification efficiencies, each new IBI was effective at correctly classifying both reference and degraded sites (Table 4). Misclassification of reference sites (saying they are degraded when they are not) is essentially a false positive or Type I error. Among the new fish IBIs, the classification efficiencies for reference sites ranged from 80% to 95%; among the new benthic IBIs, these classification efficiencies ranged from 89% to 94%. Misclassification of degraded sites (saying they are not degraded when they are) is essentially a false negative or Type II error. Among the new fish IBIs, the classification efficiencies for degraded sites ranged from 78% to 97%; among the new benthic IBIs, these classification efficiencies ranged from 83% to 92%. Low misclassification rates for both reference and degraded sites indicate that the new MBSS IBIs are a good balance between both types of error, i.e., not many degraded streams will be missed, nor will we be ‘‘crying wolf’’ about streams that are actually not degraded. This is important because indicators that make frequent errors or are biased in one direction are of little use for management decisions. 4.3. Applying the new MBSS IBIs The MBSS IBIs are central to water resource management in Maryland and have special implications for the designation of watersheds as impaired under Section 303d of the Clean Water Act (CWA). Therefore, it is critical that stream condition ratings be founded in ecological knowledge and solid science. The MBSS recognizes that there are no truly pristine streams left in Maryland; most have a history of human disturbance and all are affected by atmospheric deposition. Nonetheless, there are high quality streams in Maryland that can be accurately called minimally disturbed and equated with Biological Condition Gradient (BCG) level 2, ‘‘minimal changes in structure and function’’ (EPA, 2005). Adoption of the new IBIs provides us with more confidence that the reference conditions we are using to create IBIs and rate stream condition reflect BCG level 2, rather than BCG level 3, ‘‘evident changes in structure and minimal changes in function.’’ In addition to indicating when stream condition deviates from reference condition (i.e., is degraded), IBIs provide a means of determining the degree to which streams deviate or the ‘‘severity of failing’’ to meet the criterion (Bailey et al., 2004). The original MBSS IBIs used four ‘‘bands’’ of IBI scores to designate stream condition: 1.0–1.9 very poor, 2.0–2.9 poor, 3.0–3.9 fair and 4.0–5.0 good. This convention was retained for the new IBIs. Given the new reference conditions, these bands can be more confidently assigned to the biointegrity goal of CWA (good) and the interim goal of CWA (fair). The two additional bands (i.e., the poor and very poor classes of stream condition) are consistent with variability in stream condition relative to reference condition. Creation of more stream condition bands is not justified by the precision of the IBIs. The limits on M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 IBI precision are to be expected, as IBIs balance sensitivity to degradation and incorporation of natural variability. While IBIs are founded in the concept of biological integrity, they are only a rough approximation of the ecological structure and function of stream resources. We argue that protection of biological diversity in its most expansive definition (CEQ, 1993; Noss and Cooperrider, 1994) cannot be achieved solely through the use of IBIs. Augmented or separate monitoring and assessment focused on rare species and habitats is needed to fully protect stream ecosystems (see Kazyak et al., 2005). 4.4. Continuity across the original and new IBIs We determined that the improvements in the performance of the new IBIs, especially the more accurate coldwater Highlands and Coastal Plain fish IBIs, and the ability to rate the large number of small streams with the fish IBI warranted adoption of the new IBIs. At the same time, the final construction of the new fish and benthic IBIs for the MBSS is very similar to the original MBSS IBIs. The basis in reference condition, the discrete 1–3–5 scoring and the four bands of stream condition were retained (an analysis showing that the use of continuous 0–100 scoring based on the range of values for all sites does not improve on discrete scoring is described in Southerland et al., 2005a). More elaborate modeling of reference condition (e.g., independent of geographic or stream type classification) was not incorporated. While new IBIs need to be calculated (using new metric combinations and thresholds), the IBI application process is unchanged. This is important so that cooperating programs (e.g., many Maryland counties) do not have to undertake infrastructure changes to continue their programs. As needed, the new MBSS IBIs can be calculated for stream sites sampled in the past to maintain continuity of the long-term MBSS dataset. It is also possible to convert IBI results between different sampling periods by using regressions between the original and new IBIs. In general the regression R2 are about 0.75; lower R2 occurs for the non-Coastal Plain benthic IBI where two new strata have been created and for the original Highlands fish IBI when compared to the new coldwater fish IBI, as expected. 765 Five of the metrics in the new MBSS benthic IBIs are shared by the benthic indices (Stream Condition Indices) of Virginia and West Virginia. The metric combinations in these indices performed adequately in Maryland but with lower classification efficiencies. Similarly, the new MBSS IBIs also share metrics with the Montgomery County IBIs. Comparability studies (Vølstad et al., 2003a; Southerland et al., 2005b) indicate that the indices for all these programs can be readily integrated. Acknowledgements We would like to thank the entire MBSS team, including field crews, laboratory staff and analysts, that sustain the program and provide the data needed to develop and apply the IBIs. Funding for this effort came primarily from the Maryland DNR, but was supplemented by U.S. EPA. Many other cooperators contributed valuable discussion. Useful comments on the manuscript were provided by Paul Angermeier and Greg Pond. Appendix A. Supplementary data Supplementary data associated with this article can be found, in the online version, at doi:10.1016/ j.ecolind.2006.08.005. References Angermeier, P.L., Smogor, R.A., Stauffer, J.R., 2000. Regional framework and candidate metrics for assessing biotic integrity in Mid-Atlantic Highland streams. Trans. Am. Fisheries Soc. 129, 962–981. Bailey, R.C., Norris, R.H., Reynoldson, T.B., 2004. Bioassessment of Freshwater Ecosystems: Using the Reference Condition Approach. Springer Science/Business Media, Inc., New York, NY. Barbour, M.T., Gerritsen, J., Griffith, G.E., Frydenborg, R., McCarron, E., White, J.S., 1996. A framework for biological criteria for Florida streams using benthic macroinvertebrates. J. N. Am. Benthol. Soc. 15, 185–211. Blocksom, K., 2003. A performance comparison of metric scoring methods for a multimetric index for Mid-Atlantic Highlands streams. Environ. Manage. 31 (5), 670–682. Bressler, D.W., Paul, M.J., Stribling, J.B., 2004. Development of Tolerance Values for Benthic Macroinvertebrates in Maryland. 766 M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 Tetra Tech, Inc., for Versar, Inc., and Maryland Department of Natural Resources. Council on Environmental Quality (CEQ), 1993. Incorporating Biodiversity Considerations Into Environmental Impact Analysis Under the National Environmental Policy Act. Council on Environmental Quality, Executive Office of the President, Washington, DC. U.S. Environmental Protection Agency (EPA), 2005. Use of Biological Information to Tier Designated Aquatic Life Uses in State and Tribal Water Quality Standards. Review Draft. U.S. Environmental Protection Agency, Washington, DC. Fore, L.S., Karr, J.R., Wisseman, R.W., 1996. Assessing invertebrate responses to human activities: evaluating alternative approaches. J. N. Am. Benthol. Soc. 15, 212–231. Gerritsen, J., Burton, J., Barbour, M.T., 2000. A Stream Condition Index for West Virginia Wadeable Streams. Report by Tetra Tech, Inc., to U.S. EPA Region 3, Environmental Services Division and U.S. EPA, Office of Science and Technology and Office of Water, Washington, DC. Hawkins, C.P., Norris, R.H., Gerritsen, J., Hughes, R.M., Jackson, S.K., Johnson, R.K., Stevenson, R.J., 2000. Evaluation of the use of landscape classifications for the prediction of freshwater biota: synthesis and recommendations. J. N. Am. Benthol. Soc. 19, 541–556. Karr, J.R., Dudley, D.R., 1981. Ecological perspective on water quality goals. Environ. Manage. 5, 55–68. Karr, J.R., Fausch, K.D., Angermeier, P.L., Yant, P.R., Schlosser, I.J., 1986. Assessment of Biological Integrity in Running Waters: A Method and its Rationale, vol. 5. Illinois Natural History Survey Special Publication, Champaign, IL. Kazyak, P.F., Kilian, J.V., Stranko, S.A., Hurd, M.K., Boward, D.M., Millard, C.J., Schenk, A., 2005. Maryland Biological Stream Survey 2000–2004. Stream and Riverine Biodiversity, vol. 9. Report to Monitoring and Non-Tidal Assessment Division, Maryland Department of Natural Resources. CBWPMANTA-EA-05-6. Kilian, J.V., 2004. Fish Assemblages in Non-tidal Streams of Maryland: The Influences of Assemblage Spatial Variability on the Index of Biotic Integrity. Master of Science Thesis. University of Maryland, Frostburg. Klauda, R., Kazyak, P., Stranko, S., Southerland, M., Roth, N., Chaillou, J., 1998. The Maryland biological stream survey: a state agency program to assess the impact of anthropogenic stresses on stream habitat quality and biota. Environ. Monit. Assess. 51, 299–316. Lee, P., Smyth, C., Boutin, S., 2004. Quantitative review of riparian buffer guidelines from Canada and the United States. J. Environ. Manage. 70, 165–180. Lyons, J., Wang, L., Simonson, T.D., 1996. Development and validation of an Index of Biotic Integrity for coldwater streams in Wisconsin. N. Am. J. Fisheries Manage. 16, 241–256. Maryland Department of Environment (MDE), 2005. Water Quality Standards. http://www.mde.state.md.us/ResearchCenter/Data/ WaterQualityStandards. Naiman, R.J., Décamps, H., 1997. The ecology of interfaces: riparian zones. Annu. Rev. Ecol. Syst. 28, 621–658. Noss, R.F., Cooperrider, A.Y., 1994. Saving Nature’s Legacy: Protecting and Restoring Biodiversity. Defenders of Wildlife. Island Press, Washington, DC. Paul, M.J., Stribling, J.B., Klauda, R., Kazyak, P., Southerland, M., Roth, N., 2003. A Physical Habitat Index for Freshwater Wadeable Streams in Maryland. Monitoring and Non-Tidal Assessment Division, Maryland Department of Natural Resources. CBWP-MANTA-EA-03-4. Pirhalla, D.E., 2004. Evaluating fish-habitat relationships for refining regional Indexes of Biotic Integrity: development of a tolerance index of habitat degradation for Maryland stream fishes. Trans. Am. Fisheries Soc. 133, 144–159. Roth, N., Erb, L., Rogers, G., Vølstad, J., 2005a. Maryland Biological Stream Survey 2000–2004. Quality Assurance Report, vol. XV. Report to Monitoring and Non-Tidal Assessment Division, Maryland Department of Natural Resources. Roth, N.E., Vølstad, J.H., Erb, L., Weber, E., Kazyak, P., Stranko, S., Boward, D., 2005b. Maryland Biological Stream Survey 2000–2004. Laboratory, Field, and Analytical Methods, vol. V. Report to Monitoring and Non-Tidal Assessment Division, Maryland Department of Natural Resources. CBWPMANTA-EA-05-3. Roth, N.E., Southerland, M.T., Chaillou, J.C., Kazyak, P.F., Stranko, S.A., 2000. Refinement and Validation of a Fish Index of Biotic Integrity for Maryland Streams. Report to Monitoring and NonTidal Assessment Division, Maryland Department of Natural Resources. CBWP-MANTA-EA-00-2. Roth, N., Southerland, M., Chaillou, J., Klauda, R., Kazyak, P., Stranko, S., Weisberg, S., Hall Jr., L., Morgan II, R., 1998. Maryland Biological Stream Survey: development of an Index of Biotic Integrity for fishes in Maryland streams. Environ. Monit. Assess. 51, 89–106. Southerland, M., Rogers, G., Kline, M., Morgan, R., Boward, D., Kazyak, P., Klauda, R., Stranko, S., 2005a. New Biological Indicators to Better Assess Maryland Streams. Prepared for Monitoring and Non-Tidal Assessment Division, Maryland Department of Natural Resources. Southerland, M., Vølstad, J., Erb, L., Weber, E., Rogers, G., 2005b. Proof of Concept for Integrating Bioassessment Results from Three State Probabilistic Monitoring Programs. Report to U.S. Environmental Protection Agency, Mid-Atlantic Integration Team, Fort Meade, MD. Southerland, M.T., Jung, R.E., Baxter, D.P., Chellman, I.C., Mercurio, G., Vølstad, J.H., 2004. Streamside salamanders as indicators of stream quality in Maryland. Appl. Herpetol. 2, 23–46. Stranko, S.A., Hurd, M.K., Klauda, R.J., 2005. Applying a large, statewide database to the assessment, stressor diagnosis, and restoration of stream fish communities. Environ. Monit. Assess. 23, 99–121. Stribling, J.B., Jessup, B.K., White, J.S., Boward, D., Hurd, M., 1998. Development of a Benthic Index of Biotic Integrity for Maryland Streams. Report to Monitoring and Non-Tidal Assessment Division, Maryland Department of Natural Resources. CBWP-MANTA-EA-98-3. Vølstad, J.H., Neerchal, N.K., Roth, N.E., Southerland, M.T., 2003a. Combining biological indicators of watershed condition from M.T. Southerland et al. / Ecological Indicators 7 (2007) 751–767 multiple sampling programs—a case study from Maryland, USA. Ecol. Indicators 3, 13–25. Vølstad, J.H., Roth, N.E., Mercurio, G., Southerland, M.T., Strebel, D.E., 2003b. Using environmental stressor information to predict the ecological status of Maryland non-tidal streams as 767 measured by biological indicators. Environ. Monit. Assess. 84, 219–242. Weisberg, S.B., Ranasinghe, J.A., Dauer, D.M., Schaffner, L.C., Diaz, R.J., Frithsen, J.B., 1997. An estuarine benthic Index of Biotic Integrity (B-IBI) for Chesapeake Bay. Estuaries 20, 149–158.