Clément Renaud

How much refining does GTFS transit data need before being published?

Preparing a data set for a public release is an important part of the job as they are strict rules to comply with. For instance :

  • Format compliance
  • Overall size or maximum size for each set
  • Anonymity of names
  • Proper copyright indications
  • Remove useless it empty rows
  • Ensure right way to handle it to the recipient or publisher

This is the "clean data " phase that goes after the acquisition and processing of the set. It can take some time according to the size and the complexity of each point in this checklist