The Undergraduate Forestry Data Science (or, as we lovingly call it: “UFDS”) program is a joint collaboration between the US Forest Service Rocky Mountain Research Station, Forest Inventory and Analysis Program (FIA), Bucknell University, and Reed College that engages undergraduate students with authentically motivated research questions in statistics, data science, and forestry. The co-directors of the program are George Gaines (FIA), Kelly McConville (Bucknell), and Grayson White (Reed). In summer 2026, the UFDS program will be held at Reed College, but in the past the program has taken place at Whitman College, Swarthmore College, Reed College, Harvard University, and Bucknell University.
FIA is responsible for monitoring the status and trends in forested ecosystems throughout the US. As part of UFDS, students work on research questions generated by the FIA program and collaborate directly with research statisticians and foresters at FIA. With the rise of new data sources, such as satellite imagery and large scale photography, and with the explosion of new statistical learning tools, a wealth of estimation techniques are available to consider. But with these new methods also come a load of important statistical questions related to robustness, bias, efficiency, and more!
Here’s a rough timeline of the summer research process:
FAQs
We have so many answers that one of us even wrote a whole article about this question! Here’s the shortened version:
Learning by doing data science: Practicing data science can really help you develop as a data scientist.
Communication skills: You will have multiple opportunities to share your work (both in writing and orally) to your peers, your mentors, the stakeholders, and novices. You will receive feedback to help you hone your communication skills.
Professional identity and belonging: Research can help strengthen your connection to the discipline of statistics.
Graduate school and career preparation/clarity: The experience will demystify what research is and help you decide if you want to pursue an advanced degree. Plus, grad school or not, the tools and skills learned will help prepare you for your professional life after undergrad.
And, it is fun: The data are messy! The questions are vague! The answers are unknown! What more could you want?
At the start of a project, it is very difficult to predict whether or not it will result in a publication. And, for some projects, a journal article may not be the most useful final product. So, there is no certainty about whether or not your work will be published, but there will be many ways to share your work. For example, the group will present their findings to FIA researchers and will be expected to participate in any relevant campus research presentation events. We will also strongly encourage you to submit the final technical report to the Undergraduate Statistics Research Project Competition and/or a video presentation to the Electronic Undergraduate Statistics Research Conference (eUSR). One of the 2022 projects won “Best Video Presentation” at this year’s eUSR! On top of all that, we will also look for relevant statistics and data science conferences for you to share the work.
Deliverables from previous projects have included journal articles, peer-reviewed technical reports, dashboards, and software development (links include an example of each).
The work will be highly collaborative. We will start most days with a team meeting where everyone presents their progress, discusses issues, and talks through their next steps. For the rest of the day, your time will likely be split between your projects and will be a mix of coding, writing, problem-solving, and dealing with merge conflicts in GitHub.
All our work will be done using R/RStudio and git/GitHub. Previous experience with R is strongly recommended but previous experience with git is not.
The projects will vary in terms of the computational and statistical skills needed, but each research fellow should have prior experience coding in R and building statistical models. Useful courses to have taken include an intro stats course, a coding course, and a modeling course. If you haven’t taken these sorts of courses, you are still encouraged to apply but should address your level of proficiency in R and your experience with statistical modeling in your application. If you don’t have prior experience in R but can code in another language (such as Python), make sure to mention this in your application. Those who are new to R will be expected to learn and use R during UFDS.
No! While some of the former UFDS students have been statistics majors, others have majored in other disciplines such as Economics or English. Coming from a different field often brings a very valuable and unique perspective!
Tree art!
During the summer of 2022, one of our research students, Jing Shang, created an artistic rendering of every team member’s favorite tree.