1. How are the numbers of displaced and lost rides estimated?

These numbers are estimated using a ranking-based choice model, which is a statistical model of how an arriving customer chooses where to pick up a bike from among a list of ranked preferences. By fitting historical data (of weekdays between 2016/06/01 and 2016/08/31) to the model, we can then use it to predict how a rider may behave in the event that his/her preferred stations have run out of bikes. This allows us to estimate, retrospectively, how many rides might have been displaced or lost within the 3-month period due to station stock-outs.

2. What behavioral assumptions are made in the model?

We assume that each individual rider's decision—whether to choose a pick-up station, if available, or to give up otherwise—is completely determined by his/her own ranking of station choices. This ranking may reflect convenience of access, closeness to the intended destination, and other exogenous factors such as weather.

For example, an MIT graduate student commuting to the main campus from his dorm on Albany St. may rank two nearby choices in order of walking distance: [ MIT Pacific St., MIT Vassar St. ]; a BU student living in Kenmore Square going to Cambridge might have a different ranking in mind: [ Kenmore Sq., Beacon St. and Mass. Ave. ]. If the first-choice station has no bike available, the student will either choose the next available station on the list (the pick-up is displaced), or give up using the service if there is no alternative (the pick-up is lost).

In the model, we assume that each rider will only consider at most two alternatives within 1km from his first choice. Also, that these behaviors are homogenous throughout the time period of study. Based on this setup, our algorithm then infers a probabilistic distribution over all possible station choice rankings that best explains the data.

3. Can a rider’s drop-off (bike-returning) behaviors be similarly modeled?

Absolutely. Our statistical model can be adapted to drop-off as well. However, we focus on bike pick-ups for this visualization because drop-off behaviors are less complex—most bikes on loan are eventually returned, so there is no equivalent of lost pick-ups in this case.

4. What insights can be drawn from this visualization?

We find several interesting angles that may be worth exploring further:

Try exploring the tool yourself and tell us what you find!

5. More questions?

Contact Chong Yang Goh and Chiwei Yan.