All datasets have been collected by professional scientists or research agencies who have kindly shared their data with Hudson Data Jam. Each dataset contains a link to a Google Sheet with the data and a brief introductory section describing the research. If you want to use the data in Microsoft Excel you can download as an Excel sheet from within the Google file.
For a Google Drive folder of sample graphs to guide student exploration, teachers can contact firstname.lastname@example.org.
Tuva & the Hudson Valley Data Portal
We’ve created a partnership with TuvaLabs, Inc. to host all of our DJ datasets on their interactive graphing platform. Students can drag and drop the variables right onto the axes and build graphs in seconds without the complexity of manipulating a spreadsheet.
Hudson Valley Data Portal
Dataset levels are derived by looking at the number of factors in the dataset and by the sheer amount of data collected. Most middle schoolers will be successful with a Level 1 or 2 dataset, and the appropriate level for your high schoolers depends on their data experience and determination. Drop us a line if you need help selecting an appropriate dataset for your student.
Level 1= Easy
Level 2= Moderate
Includes an additional PDF with background information and extra resources. These topics are a good starting place for students who are new to data analysis.
Air quality refers to the health and safety of the atmosphere and is determined based on the amount of pollutants in the air. In this dataset, students can explore how air pollution has changed over time in the USA and in New York.
There are many monitoring sites along the Hudson River. These sites collect data, such as barometric pressure, precipitation, relative humidity, air temperature, surface water temperature, wind direction, and wind speed.
Zebra mussels were first detected in the Hudson in 1991. By 1992 they had spread throughout the freshwater and slightly brackish parts of the estuary and had a biomass greater than the combined biomass of all other consumers.
Air pollution from traffic can be a major problem in many parts of the world. This dataset examines how traffic congestion and associated pollutants are related to the demographics of the populations that live near traffic.
When scientists do a 'budget' of a water source, it helps to think of a bank account. You want to know how much goes in, and how much goes out, of your bank account.
Zebra mussels were first detected in the Hudson in 1991. By 1992 they had spread throughout the freshwater and slightly brackish parts of the estuary.
This dataset contains information on the number of European honey bee colonies, the use of pesticides, and the acres of Bt Corn planted in the USA since 1939.
Samples were collected from the East Branch of the Wappinger Creek on Cary Institute grounds in Millbrook, NY.
Trapa is a floating invasive species that was introduced to the Hudson River. Vallisneria is a submersed (underwater) native species in the Hudson River.
In this dataset, students can explore how the prevalence of Lyme disease has changed over time in the Northeast.
Organic matter that is washed onto the shore, or "wrack," is an important part of shoreline ecosystems because it provides habitats for macroinvertebrates and nutrients for both terrestrial and aquatic ecosystems.
Wastewater enters the Hudson River from point sources including municipal and industrial wastewater treatment plants, combined sewer overflows, urban storm water, and tributaries of the Hudson River such as Fishkill Creek.
May 20 – Competition registrations and parent/guardian consent forms due. (extended deadline)
May 27– Project submissions due online by 11:59pm EST.
June 19 – Virtual Data Jam Expo and Awards Ceremony. All prizes awarded during the event.