How much refining does GTFS transit data need before being published?

Preparing a data set for a public release is an important part of the job as they are strict rules to comply with. For instance :

  • Format compliance
  • Overall size or maximum size for each set
  • Anonymity of names
  • Proper copyright indications
  • Remove useless it empty rows
  • Ensure right way to handle it to the recipient or publisher

This is the “clean data “ phase that goes after the acquisition and processing of the set. It can take some time according to the size and the complexity of each point in this checklist

This text was originally published in quora.

A question? A comment?

Please send it to me by email or on Twitter.