All datasets have been collected by professional scientists or research agencies who have kindly shared their data with Hudson Data Jam. Each dataset contains a link to a Google Sheet with the data and a brief introductory section describing the research. If you want to use the data in Microsoft Excel you can download as an Excel sheet from within the Google file.
For a Google Drive folder of sample graphs to guide student exploration, teachers can contact firstname.lastname@example.org.
Tuva & the Hudson Valley Data Portal
We’ve created a partnership with TuvaLabs, Inc. to host all of our DJ datasets on their interactive graphing platform. Students can drag and drop the variables right onto the axes and build graphs in seconds without the complexity of manipulating a spreadsheet.
Hudson Valley Data Portal
Dataset levels are derived by looking at the number of factors in the dataset and by the sheer amount of data collected. Most middle schoolers will be successful with a Level 1 or 2 dataset, and the appropriate level for your high schoolers depends on their data experience and determination. Drop us a line if you need help selecting an appropriate dataset for your student.
Level 1= Easy
Level 2= Moderate
Includes an additional PDF with background information and extra resources. These topics are a good starting place for students who are new to data analysis.
Looking at air temperature records can tell us about the climate of a certain location. In this case, we are looking at Poughkeepsie, a city in the Hudson Valley that is located right near the Hudson River
These data come from the National Oceanic and Atmospheric Administration (NOAA) Battery Park monitoring station in New York City, and cover the years 1856-2014.
These data show water quality levels for dissolved oxygen and fecal coliform bacteria at Manhattan.
Researchers at the Cary Institute set up sample plots on the Cary Institute grounds in Millbrook, NY. Researchers searched the following substrates within the plots: live trees, dead trees, leaf litter, and rocks.
These data show the annual average water temperature for the Hudson River at Poughkeepsie, NY from 1946-2012.
Hurricanes are a type of tropical cyclone or severe tropical storm. These catastrophic storms can produce significant thunderstorms, heavy rainfall, floods, and wind gusts exceeding 155 miles per hour.
This dataset will allow you to explore connections between tick populations, their mouse hosts, and the acorns that feed the mice.
Data show a 123-year record (1885-2008) of first arrival date of select migratory birds in Dutchess County, NY.
These data show the salinity (salt) levels at seven sites along the Hudson River.
Zebra mussels were first detected in the Hudson in 1991. By 1992 they had spread throughout the freshwater and slightly brackish parts of the estuary.
Healthy aquatic habitats usually have dissolved oxygen levels at or above 80% saturation. Most fish and other organisms cannot live below 30% dissolved oxygen saturation, which is considered hypoxic.
Dataset representing wildlife encounters recorded by trail cameras during the late summer and fall, 2015-2016. [Location: Cary Institute, Millbrook NY]
In this dataset, you can explore how trends have changed related to milk production and sales in New York over time, as well as compare the environmental impact of milk production vs. apple production.
This dataset contains information on the number of European honey bee colonies, the use of pesticides, and the acres of Bt Corn planted in the USA since 1939.
The annual Riverkeeper Sweep is a day of service for the Hudson River. In 2016, a select number of sites began to classify and count each piece of trash they pick up.
Register for Data Jam
April 16 – Competition registrations and parent/guardian consent forms due.
April 30– Project submissions due online by 11:59pm EST.
May 21 – Virtual Data Jam Expo and Awards Ceremony. All prizes awarded during the event.