Data Methodology & Quality Assurance
Complete documentation of sampling protocols, quality assurance procedures, parameter definitions, and data provenance. This page enables researchers to verify, cite, and reproduce analyses using UDC WRRI water quality data.
Data Sources & Provenance
Every reading in the database is tagged with its source for full traceability
Real-time and daily values from USGS National Water Information System. Instantaneous values via the NWIS IV Web Service.
Historical water quality data from EPA's Water Quality Exchange portal. Provides longitudinal data back to 2020.
Initial dataset derived from published averages and modeled values for station commissioning. Being progressively replaced by measured data.
Field measurements entered by WRRI researchers and trained student technicians. Includes grab samples and portable meter readings.
Data Dictionary
Complete definitions for every parameter collected, including measurement methods, valid ranges, and regulatory standards
Water Temperature
field: temperatureTemperature of water at sampling depth. Affects dissolved oxygen capacity, metabolic rates, and aquatic organism survival.
Dissolved Oxygen
field: dissolved_oxygenConcentration of molecular oxygen dissolved in water. Critical for fish and macroinvertebrate survival. Levels below 5 mg/L indicate stress; below 2 mg/L is hypoxic.
pH
field: phMeasure of hydrogen ion concentration (acidity/alkalinity). Affects nutrient availability, metal toxicity, and biological processes.
Turbidity
field: turbidityMeasure of water clarity caused by suspended particles. High turbidity reduces light penetration, affecting photosynthesis and can indicate erosion or pollution.
Specific Conductance
field: conductivityAbility of water to conduct electrical current, indicating total dissolved ion concentration. Elevated values may indicate road salt, sewage, or industrial discharge.
E. coli
field: ecoli_countFecal indicator bacteria. Indicates potential presence of pathogens from sewage or animal waste. Primary indicator for recreational water safety.
Nitrate-Nitrogen
field: nitrate_nInorganic nitrogen form readily used by algae and plants. Excess leads to eutrophication, algal blooms, and oxygen depletion.
Total Phosphorus
field: phosphorusLimiting nutrient for freshwater algal growth. Major contributor to eutrophication in the Anacostia. Sources include fertilizer runoff, wastewater, and erosion.
Sampling Protocols
Standard operating procedures for field data collection
Continuous Monitoring Stations
- Multi-parameter sondes (YSI 6600 / EXO2) deployed at fixed stations
- 15-minute recording interval for temperature, DO, pH, turbidity, conductivity
- Sensors calibrated monthly using NIST-traceable standards
- Anti-fouling wipers activated before each measurement cycle
- Data transmitted via cellular telemetry to USGS NWIS database
- Backup manual readings during maintenance windows
Grab Sampling (E. coli, Nutrients)
- Samples collected mid-channel at 0.3m depth (wadeable) or from bridge with weighted sampler
- Sterile 500mL Whirl-Pak bags for bacteriological samples
- Acid-washed HDPE bottles for nutrient analysis (pre-rinsed 3×)
- Samples stored on ice (4°C) and transported to lab within 6 hours
- E. coli processed within 24 hours per EPA Method 1603
- Nutrient samples filtered (0.45µm) and preserved with H₂SO₄ for nitrogen/phosphorus
Stormwater BMP Monitoring
- Paired influent/effluent sampling at green infrastructure installations
- Flow-weighted composite sampling during storm events (ISCO 6712)
- Minimum 3 first-flush events captured per quarter
- Runoff volume measured via calibrated flumes and pressure transducers
- Pre/post performance metrics: pollutant removal efficiency (%)
- Rainfall intensity recorded by co-located tipping bucket gauge
Field QC Requirements
- Field duplicate collected every 10th sample (≥10% frequency)
- Equipment blank run at start of each sampling event
- Trip blank accompanies every cooler of samples
- Field meter calibration documented in log book before each use
- Chain of custody form signed at each transfer point
- GPS coordinates recorded at each sampling location (±3m accuracy)
Quality Assurance / Quality Control
Procedures ensuring data reliability and fitness for research use
Automated Validation (Ingest Pipeline)
- Physical range checks reject impossible values (e.g., pH > 14, negative DO)
- USGS -999999 sentinel values filtered before storage
- Duplicate timestamp detection per station (same source + time = skip)
- Validation warnings logged and returned in API response
- All rejected values recorded with reason for audit trail
Manual Review Procedures
- Monthly data review by WRRI research staff
- Time-series plots inspected for sensor drift or fouling artifacts
- Cross-parameter consistency checks (e.g., high temp + low DO = expected)
- Lab duplicate RPD must be ≤25% for acceptance
- Flagged values annotated but retained for transparency (not deleted)
Validation Ranges (Automated Rejection Thresholds)
| Parameter | Min | Max | Unit | Action if Out-of-Range |
|---|---|---|---|---|
| Water Temperature | -5 | 45 | °C | Value set to NULL + warning logged |
| Dissolved Oxygen | 0 | 20 | mg/L | Value set to NULL + warning logged |
| pH | 0 | 14 | Standard units | Value set to NULL + warning logged |
| Turbidity | 0 | 4000 | NTU | Value set to NULL + warning logged |
| Specific Conductance | 0 | 10000 | µS/cm | Value set to NULL + warning logged |
| E. coli | 0 | 100000 | CFU/100mL | Value set to NULL + warning logged |
| Nitrate-Nitrogen | 0 | 100 | mg/L | Value set to NULL + warning logged |
| Total Phosphorus | 0 | 50 | mg/L | Value set to NULL + warning logged |
Ingestion History
Log of all data ingestion events — when data was fetched, how many records were added, and any errors encountered
How to Cite This Data
Recommended citations for academic publications and reports
Dataset Citation (APA 7th)
API Endpoint for Programmatic Access
CSV and JSON exports include machine-readable citation metadata. All exports include the source field for each reading.