Regional Monitoring Networks Tools

GIS-based site screening
Continuous sensor data
Biological data
Other tools

Background

During the RMN site selection process, a screening procedure is performed to identify locations along a standardized disturbance gradient for monitoring. In many regions, participants target many minimally disturbed or ‘reference’ systems to consider as a baseline. Locations of concern or interest are compared with reference sites, which means good quality data are needed for the reference site(s) consistently over time. If the aquatic communities in the reference locations change, the RMN will need to characterize how, and if possible why, the community shifted. Multiple reference sites within an RMN will help programs detect and assess shifting baselines.

Also, the natural physical attributes of each sampling site (such as drainage area, slope, and sinuosity) are described for classification purposes. Attempts are made to use GIS data that are available nationwide and that are derived using common data sources and methods.

Tools for GIS-based site screening

Currently, the stream RMNs use several ArcMap tools developed by EPA. These tools are described in the USEPA (2017) report “Procedures for Delineating and Characterizing Watersheds for Stream and River Monitoring Programs.” The ArcMap tools generate the following types of data (based on exact watershed delineations):

Proportions of each National Land Cover Database (NLCD) land cover category at three spatial scales: whole watershed, within 1- and within 5-km radii of the sampling stations
Baseflow index at the sampling station
Channel slope and sinuosity at each station
The slope of the watershed (average, minimum, maximum, range, standard deviation) relative to the sampling location
The number of dams, mines, National Pollutant Discharge Elimination System (NPDES) and Comprehensive Environmental Response, Compensation, and Liability Act (CERCLA) sites within each watershed

The product is a watershed delineation shapefile that has fields with all the watershed properties listed above. The report and ArcMap tools can be found on Procedures For Delineating And Characterizing Watersheds For Stream And River Monitoring Programs (Final Report) page.

In addition, EPA uses stream catchment (StreamCat) data (Hill et al. 2016), the Index of Watershed (IWI), and the Index of Catchment Integrity (ICI) (Thornbrugh et al. 2018). The StreamCat data set contains hundreds of metrics, both anthropogenic (e.g., urbanization, agriculture, and mines) and natural (e.g., soils, geology, climate, and land cover). The IWI and ICI are overall indices of watershed integrity based on multiple measures. They are all based on the free National Hydrography Dataset Plus version 2.1 (NHDPlusV2) geospatial data set (McKay et al. 2012)

For the lake RMNs, natural and anthropogenic data from the Lake-Catchment (LakeCat) dataset (Hill et al. 2018) are being used. LakeCat is a nationally consistent data set that currently has 136 natural and anthropogenic geospatial metrics for lakes within the conterminous United States. The data are also based on the free NHDPlusV2 geospatial dataset (McKay et al. 2012)

Other metrics may be added as new geospatial layers that are relevant to lake conditions become available.

Site screening methods are under development for wetlands RMNs.

Tools for continuous sensor data

Most stream RMN sites collect year-round temperature and hydrological data at 30-minute intervals or less with continuous sensors. At lake RMN sites, where feasible, partners are deploying moored arrays of continuous sensors that record temperature, dissolved oxygen and water level data year-round at 60-minute intervals or less.

These high frequency measurements capture temporal patterns and episodic events that might otherwise be missed with limited discrete measurements (Figure 1). Continuous data, however, also can pose a challenge. Specifically, the amount of data collected can be overwhelming, especially for RMN partners who might lack data management systems capable of accommodating the volume of continuous data. To help RMN partners manage the data, EPA and partners developed tools and training materials to make quality assurance and quality control (QA/QC), summarizing, visualizing, and managing continuous data easier, including the ContDataQC RShiny App or ContDataQC R package. The RShiny App is accessible through a browser, while the R package requires R software to be installed. Detailed documentation with installation instructions, example codes, and test data can be found on the ContDataQC GitHub page.

The ContDataQC RShiny App (or R package) has three main functions: generate QC reports (called ‘QC raw data’ in the R Shiny app and ‘QCRaw’ in the R package), merge QC’d data files from different time periods (called ‘Aggregate QC’d data’ in Shiny and ‘Aggregate’ in the package) and generate time series plots and basic summary statistics (called ‘Run summary statistics’ in Shiny and ‘SummaryStats’ in the package).
Main functions (which need to be run sequentially) include:

‘QC raw data’, which generates QC reports that include results from four ‘flag’ tests, missing data reports, and time series plots.
‘Aggregate QC’d data’, which combines QC’d data files from different time periods into a single file. Columns with matching names are merged. Columns with non-matching names are carried through as standalone columns.
‘Run summary statistics’ generates two sets of comma-delimited (CSV) spreadsheets (one with daily means and the other with a wider range of summary statistics - e.g., minimum, median, maximum, quartiles) and a PDF with time series plots for each parameter.

Figure 1. Continuous sensors provide robust data sets that capture natural temporal patterns and episodic events that may otherwise be missed by limited numbers of discrete measurements. — Figure 1. Continuous sensors provide robust data sets that capture natural temporal patterns and episodic events that likely would otherwise be missed by limited numbers of discrete measurements.

Figure 2 shows the general workflow that users follow when using the ContDataQC R tools. Instructional materials are available on the Shiny app website and GitHub.

Figure 2. Steps for obtaining and working with continuous sensor data. The main functions should be run sequentially. Items in bold blue text show functions that can be performed with ContDataQC.

Tools for image processing

A number of RMN sites use time-lapse cameras to capture images of stream flow or water levels in lakes. Site information and images are uploaded to the Flow Photo Explorer web application hosted by the U.S. Geological Survey (USGS). EPA Regions 1, 2, 3, and 5, USEPA Office of Research and Development, and the U.S. Geological Survey (USGS) are collaborating to further develop the Flow Photo Explorer . The collaboration is developing algorithms to estimate water level and discharge based on the images provided by state, tribal, and EPA regional partners. This innovative method makes it easier to calculate stream flow in small streams and remote locations that are not otherwise gauged. Lake and wetland images also will be analyzed to produce a continuous record of water levels and other parameters.

Tools for biological data

At stream RMN sites, at a minimum, macroinvertebrates are collected. Collection of fish and periphyton are encouraged but optional. Macroinvertebrates are targeted because they are the most commonly-used animal assemblages in stream bioassessments. They make good indicators for several reasons:

they respond to a wide range of stressors;
many (not all) taxa are easily and consistently identified;
taxa have limited mobility, short life cycles, and are evolutionarily diverse; and
traits (e.g., thermal preferance) can be linked to environmental conditions.

EPA is currently supporting the development of R tools to help RMN partners prepare and analyze the biological data collected at RMN sites. One tool for biological data is the BioMonTools R package, which currently performs the functions in Table 1 that are marked with ‘X'.

For lakes, biological data are also of high importance but are considered a longer-term pursuit. For many programs, biological protocols for lakes are still in a formative stage, and more work is needed to achieve consensus on which are most suitable for this regional effort. Lake RMN partners are planning to use the RMNs as a forum for advancing methods. Potential assemblages include sediment diatoms, fish, macroinvertebrates, macrophytes, zooplankton, phytoplankton and soft-bodied algae from the water column.

Table 1. BioMonTools functions. ‘X’ means the function is available for that assemblage.
BioMonTools function	Description	Bugs	Fish	Algae
Subsample (Rarify)	Subsamples abundance data to a fixed count per sample. For example, if 300-organisms are the target for your RMN, and any of your samples have more than 300 organisms, you can use this function to randomly subsample your data so that the total number of individuals is 300. This is done to make species or other taxonomic richness metrics comparable across RMN samples. The function code is from USEPA Corvallis John Van Sickle’s R code for RIVPACS (v1.0, 2005-06-10).	X	X	X
Exclude taxa	Identifies/marks non-distinct taxa so that those taxa can be excluded from richness metric calculations. This is done to avoid double counting taxa that may be redundant. For example, if organisms identified as Dytiscidae (family-level, diving beetles) and Oreodytes (genus of diving beetles) are both present in a sample, Dytiscidae is excluded from the richness metrics for that sample. This exclusion rule is applied on a sample by sample basis.	X
Calculate metrics	Calculates many different metrics based on the attributes in the input file. Attributes can include thermal indicator, functional feeding groups (FFGs), habit, tolerance value and life cycle metrics. Users can calculate the full suite of metrics or adapt the code to output only the desired metrics.	X
Taxa check	Checks the taxa list from the input file against the RMN "Master Taxa List" file, and notifies the user if any of the taxa do not match.

Other Tools

Other tools applicable to RMNs include: