Structure of GLAMR

GLAMR — a centralized, public repository of ecosystem metabolism estimates and related data for lakes, ponds, reservoirs, and other lentic water bodies around the world.

Photo: Cary Institute

GLAMR includes five primary types of information:

High-frequency automated sensor measurements of dissolved oxygen concentration, temperature profiles, wind speed, and other variables needed to estimate rates of ecosystem metabolism.
Modeled time series of daily ecosystem metabolism, with uncertainties.
Characteristics of the lake and its watershed including area and volume, watershed area and land use, latitude, elevation, etc.
Observed time series of nutrient concentrations, zooplankton abundances, and other limnological characteristics, where available.
Rich metadata in compliance with the Ecological Metadata Language (EML) standard and including, where available, lake identifiers to facilitate links with other publicly available datasets such as HydroLakes (global) and LAGOS-US (United States).

We use the Apache Parquet column-based file format for data types 1 and 2 (high-frequency sensor measurements and daily ecosystem metabolism estimates with uncertainties), to facilitate handling of these large (gigabytes) files. Parquet files are easily readable into R or Python, and have been used previously in similar data repositories (McAfee et al. 2024). Data types 3 and 4 (lake characteristics and time series of supplementary variables) involve smaller files, which are stored as .csv. Data type 5 (metadata) complies with the Ecological Metadata Language (EML) standard.

Breadcrumb

Structure of GLAMR

GLAMR: The Global LAke Metabolism Respository >>

Stay on top of Cary science & events