How much refining does GTFS transit data need before being published?
Preparing a data set for a public release is an important part of the job as they are strict rules to comply with. For instance :
- Format compliance
- Overall size or maximum size for each set
- Anonymity of names
- Proper copyright indications
- Remove useless it empty rows
- Ensure right way to handle it to the recipient or publisher
This is the "clean data " phase that goes after the acquisition and processing of the set. It can take some time according to the size and the complexity of each point in this checklist