Introduction

The availability of automated data sources, such as transit smartcard data, enables us to zoom in to the individual level, and examine how individual user travels through the transportation system over time. This project explores the use of data visualization to help public transit agencies and transportation planners understand individual travel behavior and engage passengers.

The data used in this project is Oyster card records of an anonymous passenger in 2013, which were kindly provided by Transport for London.

Stations

All the TfL stations the user visited are shown on the map, each as a circle, although the National Rail stations are not included. The radius of the circle is proportional to the logarithm of the number of visits the user made to the station. The type of the visits can be specified as "entry", "exit" or "both".

Each station is marked with a unique color. For the top 10 stations (by number of entry visits) the colors are constant, while for the rest of the stations they are randomly generated (so every refresh will produce different colors). Nevertheless, the colors, even the randomly generated ones, are used consistently through different diagrams, making it easy for cross-reference and comparison.

Origin-Destination Pairs

The individual origin-destination matrix is visualized as a chord diagram. Each station is represented as an arc, and the trips between stations as a chord. An arc is characterized by a starting angle, an ending angle, an inner radius and an outter radius. The angle difference (between the starting angle and the ending angle) of the arc is proportional to the total number of entry visits the user made to the station, and the color of the arc matches the unique color assigned to each station. Each chord is a closed shape connecting two arcs with quadratic Bézier curves, showing the flows between two stations, in both directions. The angle difference of either end arc is proportional to the number of outgoing trips from the attached station to the other. The color of the chord matches the station generating more trips.

Travel Sequences

Each travel sequence is defined as the sequence of stations the user visited (both entry and exit) during a day. All the travel sequences the user made during a whole year is visualized as a sunburst sequence diagram.

All the variations of travel sequences can be encoded as a tree, where each node is a subsequence of stations the user visited starting from the beginning of the day. For a given node, its parent node is the subsequence excluding the last visited station. For example, subsequence "A-B-C" is the parent of subsequence "A-B-C-D". Each subsequence is shown as an arc in the diagram. Its angle difference is proportional to the number of occurrences the user went through subsequence, and its color matches the unique color assigned to the last visited station in the subsequence. The arcs that are within the same angle range and have larger radiuses are its children subsequences.