National Parks Scraping Project
Welcome
This website was created by Gunnar Griffith and Elijah Barnes as part of our Data Science Process Final Project.
Project Overview
This project explores long-term attendance trends across U.S. National Parks. It combines data from official APIs and custom web scraping to build an updated, reproducible dataset.
- A reproducible Python package: nationalparksdata
- A full data pipeline for generating updated annual datasets
- Interactive visual analysis through a Streamlit application
Resources
GitHub Repository
Access the full source code and documentation here:
GitHub Repository
Python Package
We built a pip-installable package called nationalparksdata that allows users to reproduce and update the dataset annually.
A full tutorial for installation and usage is included in the repository.
Interactive App
We developed a Streamlit application to visualize park attendance trends and explore the dataset interactively.
Written Report
Link to the written report summarizing our findings and methodology.
Written Report
Date Last Updated
April 22, 2026