Improving transit services using ORCA data

A graphic depicting traffic interaction between city neighborhoods

ORCA can tell us the level of interaction between city neighborhoods and the characteristics of those trips. These can be matched to census data about equity.

Project lead: Mark Hallenbeck, senior data science fellow, director of the Washington State Transportation Center

Project mentor: Michael Wolf

Data scientist leads: Jake VanderPlas (primary), Bryna Hazelton (secondary)

DSSG fellows: Mayuree Binjolkar, Daniel Dylewsky, Andrew Ju, Wenhao Zhang

Project summary: Seven regional transportation agencies in the greater Puget Sound region use a common electronic fare payment system, called One Regional Card for All (ORCA). ORCA data provide travel behavior information that can be used to improve regional transportation system planning and decision making. The Data Science for Social Good (DSSG) program will be using two nine-week ORCA data sets, each consisting of over 20 million transit boarding records, to determine the changes in transit behavior that occurred when light rail stations were opened in the Seattle Capitol Hill and University District neighborhoods.

For example, we know that transfers from the Sounder commuter train to Link light rail more than doubled in spring 2016 when two new light rail stations opened. How many of these individuals are headed to the University District? Are they new transit riders? Or did they previously take buses to the University of Washington (UW)? If they took buses, did they take the limited direct bus service, or take the bus routes that operate more frequently to downtown, and then transfer to buses headed to the UW? We will also look at service characteristics before and after the light rail opens (e.g., are transfers to and from trains faster and more reliable than transfers to the UW bound buses?) to get a better understanding of why these behavior changes are occurring. The project goal is to not only describe the changes in travel behavior, but to develop scalable algorithms and procedures from our analysis that can be applied throughout the region. Can we develop reliable models that predict the different behaviors based on the different service levels of each type of trip? And if so, how different are those models from the ones currently used by Puget Sound Regional Council to predict transit use?

Strengthening capacities, knowledge and data sharing platforms for sustainable development

Three men take measurements in a outdoor setting.

Photo courtesy of Vital Signs.

Project leads: Matt Cooper, data manager, Vital Signs and Tabby Njunge, technical operations manager, Vital Signs

Data scientist leads: Anthony Arendt (primary) and Joe Hellerstein (secondary)

DSSG fellows: Cara Arizmendi, Mitchell Goist, Krista Jones, Robert Shaffer

Project summary: To meet the food security and nutrition challenges of today — with nearly one billion chronically hungry people worldwide — and tomorrow will require an estimated 70 – 100% increase in food production. Millions of small-holder farmers will need to play an important role in meeting this need, particularly across Africa. Unfortunately, agricultural activities are degrading ecosystems and the benefits they provide for people faster now than ever before. We need to find new ways of growing food that can simultaneously deliver food security, environmental sustainability, and economic opportunity. There is an urgent need for better data and risk management approaches to guide sustainable agricultural development and ensure healthy and resilient ecosystems and livelihoods. Vital Signs aims to meet this need for informed policy by providing better data and risk management tools to optimize agricultural development decisions for the needs of the human beings they serve and the ecosystems upon which they depend. Headquartered in Nairobi, Kenya, Vital Signs has worked in Kenya, Ghana, Tanzania, Rwanda and Uganda.  

Vital Signs collects data on the ground using a peer-reviewed monitoring system, integrates data from national governments and third-party data sources, and builds online platforms for data exploration and decision support. This monitoring system collects data on agricultural practices and yields, the environment and biodiversity, land cover, soil health and human well-being, in several 10 x 10 km landscapes in each country. This data is analyzed to show spatial and temporal trends, as well as to create multivariate models. Vital Signs does all of this in close collaboration with national and multinational stakeholders and policymakers. The data is visualized on platforms like, and is freely available for download on the Vital Signs website, as it is intended to be a global public good and a resource for any interested party.

Can traffic sensor data detect vehicle cruising?

Seattle, looking downtown and out towards Alki from North Capitol Hill. Photo credit: Timothy Durkan

Seattle, looking downtown and out towards Alki from North Capitol Hill. Photo credit: Timothy Durkan

Project lead: Stephen Barham, data scientist, Seattle Department of Transportation

Data scientist leads: Valentina Staneva (primary) and Vaughn Iverson (secondary)

DSSG fellows: Brett Bejcek, Anamol Pundle, Orysya Stus, Michael Vlah

Project Summary: Vehicles that have arrived at their destination but are driving around for a place to park, and for-hire and transportation network company vehicles that are queued in traffic, have a significant impact on congestion. The Cruising Traffic Analysis project will develop algorithms to quantify aggregated levels of vehicle traffic cruising. The research intends to apply data science techniques to a sample of anonymous travel sensor data, paid parking transaction information, and parking occupancy surveys conducted by the City of Seattle. We hope to generate heat maps depicting relative prevalence of cruising and propose measurement standards for cruising activity, such as a “cruising index” that could pertain to various methods of data collection and processing.

We will attempt to differentiate between the aggregated footprint of vehicles trying to find on-street parking and the amount due to trip deadheading. If successful, this research could help transportation agencies, technology companies, and car companies predict the availability of parking and more accurately direct travelers with online, mobile, and connected tools, thereby reducing congestion impacts, emissions, and fuel costs.

The 'Equity Modeler': examining just development in Seattle

Index of displacement vulnerability, from the Seattle Comprehensive Plan Equity Analysis, 2016, page 17

Index of displacement vulnerability, from the Seattle Comprehensive Plan Equity Analysis, 2016, page 17

Project leads: Rachel Berney, PhD, assistant professor, Department of Urban Design and Planning and Gundula Proksch, associate professor, Department of Architecture

Data scientist leads: Bernease Herman (primary) and Amanda Tan (secondary)

DSSG fellows: Hillary Dawkins, Jacob Kovacs, Yahui Ma, Jacob Rich

Project Summary: In the past years, Seattle has seen unprecedented population growth, record construction activity, and an increase in housing cost, creating an affordability crisis for a large portion of the urban population. The “Equity Modeler” team is investigating the ongoing gentrification process and inequitable access to opportunities across many of Seattle’s neighborhoods. The project uses publicly available data for GIS-based mapping of equity indicators – related to housing and development, income, mobility, and education – on the city and neighborhood scale. It will develop a structural equation model to establish and predict relationships between indicators and analyze policies intended to initiate positive change.

The team’s goal is to create a tool that brings clarity and direction to an impassioned public discussion and allows stakeholders in the city’s development process to analyze, model, and visualize existing trends and the impact of potential changes in the built environment.