Welcome to the web blog
Bit pusher at Spotify. Previously Interactive News at the New York Times, U.S. Digital Service, and Code for America.
I just got back from the exceptionally well done Open Vis conference which featured a number of truly excellent talks and speakers. If you are interested in data visualization, design, color theory, or a host of other topics, I highly recommend that you watch the website for when the conference videos are posted.
Inspired by Open Vis, I thought I’d take a look at the recently posted trip-level data dumps from Citibike, available here. I decided to take a look at the most recently available month of data. I was mostly curious about the relationships between different stations and seeing how people used the network. Because the data that actually powers the visual is pretty big, I’ve put it on a [separate page]((https://bensmithgall.com/projects/citibike-trips.html), and have pasted screenshotsin this post instead.
In order, the top five stations for trip origins are:
Let’s take a closer look at top origin station.
It’s a bit unsurprising that this station is such a common origin station – it’s geographically very central to the station network:
This station has destinations of almost every other station in the system (save a chunk of Brooklyn that is not near a bridge).
Lafayette’s top five destinations are:
This pattern is interesting as well, because it looks as though all of the top destinations for this station are clustered geographically.
Also interesting are the top destination stations. The top five destinations on Citibike are:
This is very interesting, because it shows that the same stations are both common as destinations and as starting points. This information might be valuable in tinkering with the Citibike rebalancing project.
While these top-level stats are interesting, I also wanted to be able to explore the citibike network and see relatonships between the different stations. I put together this visualization of the February Citibike trips to try to explore the data. The code is available as a gist and on bl.ocks.
The Citibike trips data set is fairly comprehensive and interesting. There have already been some really great work done with it, like this awesome visualization of rides over a 48-hour period. You can get the data from the Citibike website, and I encourage you to check it out!