Opal trip datasets now available


#1

New Opal trip datasets for train, bus, ferry and light rail have been made available on the Open Data Hub!

Explore and download the datasets in our Data Catalogue.

Remember to tell us how you use the data with a “Brag” post!


#2

jumps up and down in excitement


#3

Suggestions… would love:

  • identical dataset for tap-offs (including a metric for “failure to tap-off”)
  • breakdown by bus stop

#4

This is good.

However, it would be great to have a dataset at a more granular level. Perhaps transactional data at a customer level for each month for the last 6 months along with the journey start datetime+location and journey end datetime+location, and of course, the fields in current dataset as well.

If there are any plans to do this then do let us know.

Cheers.


#5

hey guys, we’re glad you are so enthusiastic about the data! Keep the feedback coming.

I just wanted to quickly respond to a couple of points to set some expectations.

Anything very granular becomes difficult for us to provide as we need to ensure the safety and privacy of our customers. e.g. bus stop level data or customer level data.

Do let us know what the usage of the data is so we can help prioritise any data we’re ‘unlocking’ and possibly present the data in ways which may help your use cases without compromising the safety or privacy of our customers.


#6

Understood @yvonne.lee

From a journey planning perspective, it would be useful to display to a customer what times a particular station is usually most busy. That way, they can plan their journey to use the station at a less busy time if they aren’t in a hurry.

re bus stop data, I can see how bus stop data can be a privacy concern. I’m basically looking for a lower level of detail than contract zones which would allow us to provide customers with an idea of how busy an area is at a certain point in time. Perhaps data can be given at a BTS TZs level?


From a demography analysis perspective, it would be great if the Opal data can be used to produce something similar to the JTW statistics, Table 11.

Obviously, there are privacy risks if the data is released at customer level even if it’s anonymised… but perhaps, a statistical sample of Origin and Destination TZ pairs for Opal journeys can be produced?


#7

For a study that we’re doing at UNSW station by date by hour summary data would be very useful. This still affords full privacy but enables a much broader set of analysis