Cairo Transport Authority Data
Mohammad Tayseer (http://mtayseer.net/) writes about PDF data from Governorate of Cairo.
What's bad?
- The file is in PDF format, which is very hard to parse.
- There is another copy in Excel format, but it asks for credentials!
- Stations are jammed into a single cell, sometimes separated by dashes, sometimes by underscores.
- Stations are written in many different ways. Sometimes with the complete name. Sometimes written as shortcuts. Sometimes there are spelling mistakes.
- Sometimes the names of stations are hidden
- No consistent numbering of lines
- A lot of lines are defined by start & end stations only, not mentioning the intermediate lines, making it impossible for anyone to know the route of the bus.
- The data is not updated for more than 3 years