by britta on 9/2/14, 5:54 PM with 50 comments
by tatsiana on 9/2/14, 7:49 PM
by ZanyProgrammer on 9/2/14, 9:01 PM
My solution, since I didn't want to do any screen scraping or make trying to identify individual busses/trains a project in and of itself, was to use Portland's TriMet API. That API acutally return specific route numbers, and estimated and scheduled times for each stop (interpolated in the case of non time points). I'm originally from the Portland Area, so I'm pretty familiar with the geography and roads.
From what I remember in the 511.org Google developer group, people have raised this exact issue, i.e. Caltrain train numbers. The guy responding from the MTA said they'd try and integrate it in the future, but these posts were like back in 2012 (IIRC).
by rakoo on 9/2/14, 7:25 PM
by deepsun on 9/3/14, 1:47 AM
by guard-of-terra on 9/2/14, 6:58 PM
Why won't it match the real schedule?
by bfung on 9/3/14, 4:06 AM
RE: scraping - instead of putting logic in your scraper, just download the entire section you need, store it in file format. Then parse and shove into database whenever you feel like it. You could rerun the parsing since you'll have all the historically scraped website data on disk.
by ZanyProgrammer on 9/2/14, 9:04 PM
by tzm on 9/2/14, 9:17 PM