Category: Good Transit
Frequency is Relative
Five years ago, I wrote a blog post about frequency-ridership spirals, mentioning as a side comment that the impact of mass transit frequency on ridership can be lumped together with the trip time. I’d like to develop this point here, and talk about how it affects various kinds of public transportation, including intercity trains.
The rule of thumb I’ve advocated for in ETA reports (for example, on commuter rail) is that the maximum headway should be no more than half the trip time. Untimed transfers reset the clock, since passengers have to wait another time every time they make such a transfer; timed transfers do not, but are rare enough that local public transportation doesn’t usually need to consider them in service planning. Intercity transportation can follow the same rule of thumb, but can also get away with worse frequency since passengers time themselves to arrive shortly before the train does; in particular, hourly trains between cities that are three hours apart are frequent enough that increasing service is valuable only insofar as it provides more capacity, and is unlikely to lead to higher ridership through shorter waits.
Wait and transfer penalties
In the literature on modeling public transportation ridership, it is universal that passengers prefer spending time on board a moving vehicle to waiting for a vehicle, walking to the station, or walking between platforms. This preference is expressed as a factor, called the waiting or transfer penalty. different models have different levels for these penalties; passengers also likely have different penalties depending on circumstances, such as familiarity with the route or how much luggage they’re carrying.
The papers I’ve seen have penalties ranging from 1.75 (in the MTA’s model) to 3 (the higher end cited in Lago-Mayworm-McEnroe). I usually model with 2, in Teulings-Ossokina-de Groot. The factor of 2 has the advantage of consistency with an assumption that passengers don’t have a wait penalty but do assume a worst-case scenario for waits, so that the generalized travel time is equal to the maximum headway plus the in-vehicle travel time.
Update 4-19: I was just alerted to a new study by Yap-Wong-Cats using London ridership, finding an out-of-vehicle penalty factor of 1.94 pre-pandemic and 1.92 post-pandemic.
The impact of frequency relative to trip time
If the elasticity of ridership with respect to the generalized travel time, summing the headway and in-vehicle travel time but not walking time to and from the station, is e, then we can compute the elasticity of ridership with respect to frequency as a fraction of e. If the current headway is a proportion r of the in-vehicle trip time, that is to say a fraction r/(r+1) of the generalized travel time, then the elasticity is er/(r+1).
In Lago-Mayworm-McEnroe, the value of e appears to be 0.8. This means that if r = 0.5, the elasticity of ridership with respect to frequency is 0.267. The paper doesn’t quantify elasticities relative to different levels of r but only relative to absolute frequencies, but 0.267 is within the range it finds for different frequencies, dipping to 0.22 for high-frequency lines. Other papers have different figures of e, often higher in the long run as passengers adjust, but those go up to around 1 or, reasoning backward from a VTPI report, a little higher.
Of notes:
- The value of r is not constant across different uses of the same line. A commuter traveling from near the outer end of a subway line to city center faces much lower r than a traveler going a short distance, within city center or within a large outlying neighborhood. In particular, r is generally lower for commutes than for non-commute trips, which is why the latter are more sensitive to frequency.
- Systems that rely on extensive transfers can have very high values of r with short in-vehicle trips. New York averages 13.5 minutes per unlinked subway trip, with many trips facing an effective off-peak headway of 10 or even 12 minutes, at which point e is high enough that increasing off-peak frequency could pay for itself through higher paying ridership (see analysis in a blog post and an ETA report). This, again, depends on the type of trip – commuters may pick an apartment or a job based on ease of travel, reducing the need to transfer, but their non-commute trips are usually a collection of irregular trips to various destinations and are likelier to involve a transfer.
- The cost of higher frequency depends on mode (it’s higher on buses than trains) and time of day (it’s very low on off-peak trains until it matches peak frequency). Together with points 1 and 2, this argues in favor of raising the off-peak frequency on urban and inner suburban trains, potentially to the point of matching peak frequency. On longer-range commuter trains, the impact of frequency on ridership is lower, and thus the marginal cost may be such that a ratio of peak to off-peak service larger than 1 is desirable.
Intercity trains
The papers I’m citing aim to fit elasticity factors to observed ridership on local and regional public transportation. Intercity rail has its own set of models, with different assumptions. Frequency again matters, but because passengers time themselves to arrive at the departing station shortly before the train leaves, its impact is reduced.
I don’t know the elasticity of intercity rail ridership with respect to frequency; Cascetta-Coppola have the elasticity of ridership with respect to in-vehicle trip time as about -2, while Börjesson has it at -1.5 for business travel and -1 for non-business travel with a rough rule of thumb trying to approximate the impact of frequency. At the level of the sanity check, the low frequency of TGV services is not visible in TGV ridership between the provinces and Paris, compared with Japan (which charges higher fares) and as far as I can tell from a few data points Germany. TGV ridership between the provinces is bad, but that involves trains with service gaps that are much larger than the trip times, reaching six hours between Marseille and Lyon. In contrast, those three-hour gaps in service between Paris and cities three hours away by TGV don’t seem to impact ridership visibly.
What this means is that intercity trains do need a certain baseline frequency. The German system of a train every two hours on every city pair is wise, in light of the typical intercity rail travel distances in a large country with slow trains. Higher frequency is warranted if the cities are bigger and therefore require more service, or if they are closer together in time through either a short geographical distance or higher speeds. New York and Philadelphia are about 1:10 apart by rail, and high-speed rail could cut this to about 45 minutes; half-hourly frequencies in the current situation are sufficient that more service would have a second-order effect, and even with high-speed rail, a train every 15 minutes is more than enough for all purposes except capacity (the current offer is 3-4 trains an hour with irregular spacing). Frequency is freedom, but this depends on trip times; what works for four-station subway trips is not what works for trips between cities 140 km apart, let alone 360 km, and vice versa.
Subway-Intercity Rail Connections
Something Onux said in comments on yesterday’s post, about connecting Brooklyn to intercity rail, got me thinking more about how metro lines and intercity rail can connect better. This matters for mature cities that build little infrastructure like New York or Berlin, but also for developing-world cities with large construction programs ahead of them. For the most part, a better subway system is automatically one that can also serve the train station better – the train station is usually an important destination for urban travel and therefore, usually the same things that make for a stronger subway system also make for better subway-intercity rail connections.
Subways and commuter trains
Like gender, transit mode is a spectrum. There are extensive systems that are clearly metro and ones that are clearly commuter rail, but also things in between, like the RER A, and by this schema, the Tokyo and Seoul subways are fairly modequeer.
The scope of this post is generally pure subway systems – even the most metro-like commuter lines, like the RER A and the Berlin S-Bahn, use mainline rail rights-of-way and usually naturally come to connect with intercity train stations. Of note, RER A planning, as soon as SNCF got involved, was modified to ensure the line would connect with Gare de Lyon and Gare Saint-Lazare; previous RATP-only plans had the line serving Bastille and not Gare de Lyon, and Concorde and not Auber. So here, the first rule is that metro (and metro-like commuter rail) plans should, when possible, be modified to have the lines serve mainline train stations.
Which train stations?
A city designing a subway system should ensure to serve the train station. This involves nontrivial questions about which train stations exactly.
On the one hand, opening more train stations allows for more opportunities for metro connections. In Boston, all intercity trains serve South Station and Back Bay, with connections to the Red and Orange Lines respectively. In Berlin, north-south intercity trains call not just at Hauptbahnhof, which connects to the Stadtbahn and (since 2020) U5, but also Gesundbrunnen and Südkreuz, which connect to the northern and southern legs of the Ringbahn and to the North-South Tunnel; Gesunbrunnen also has a U8 connection. In contrast, trains into Paris only call at the main terminal, and intercity trains in New York only stop at Penn Station.
On the other hand, extra stations like Back Bay and delay trains. The questions that need to be answered when deciding whether to add stations on an intercity line are,
- How constructible is the new station? In New York, this question rules out additional stops; some of the through-running plans involve a Penn Station-Grand Central connection to be used by intercity trains, but there are other reasons to keep it commuter rail-only (for example, it would make track-sharing on the Harlem Line even harder).
- How fast is the line around the new station? More stations are acceptable in already slow zones (reducing the stop penalty), on lines where most trips take a long time (reducing the impact of a given stop penalty). Back Bay and Südkreuz are in slow areas; Gesundbrunnen is north of Hauptbahnhof where nearly passengers are going south of Berlin, so it’s free from the perspective of passengers’ time.
- How valuable are the connections? This depends on factors like the ease of internal subway transfers, but mostly on which subway lines the line can connect to. Parisian train terminals should in theory get subsidiary stations because internal Métro transfers are so annoying, but there’s not much to connect to – just the M2/M6 ring, generally with no stations over the tracks.
Subway operations
In general, most things that improve subway operations in general also improve connectivity to the train station. For example, in New York, speeding up the trains would be noticeable enough to induce more ridership for all trips, including access to Penn Station; this could be done through reducing flagging restrictions (which we briefly mention at ETA), among other things. The same is true of reliability, frequency, and other common demands of transit advocates.
Also in New York, deinterlining would generally be an unalloyed good for Penn Station-bound passengers. The reason is that the north-south trunk lines in Manhattan, other than the 4/5/6, either serve Penn Station or get to Herald Square one long block away. The most critical place to deinterline is at DeKalb Avenue in Brooklyn, where the B/D/N/Q switch from a pattern in which the B and D share one track pair and the N and Q share another to one in which the B and Q share a pair and the D and N share a pair; the current situation is so delicate that trains are delayed two minutes just at this junction. The B/D and N/Q trunk lines in Manhattan are generally very close to each other, so that the drawback of deinterlining is reduced, but when it comes to serving Penn Station, the drawback is entirely eliminated, since both lines serve Herald Square.
If anything, it’s faster to list areas where subway service quality and subway service quality to the train station specifically are not the same than to list areas where they are:
- The train station is in city center, and so circumferential transit, generally important, doesn’t generally connect to the station; exceptions like the Ringbahn exist but are uncommon.
- If too many lines connect to the one station, then the station may become overloaded. Three lines are probably fine – Stockholm has all three T-bana lines serving T-Centralen, adjacent to the mainline Stockholm Central Station, and there is considerable but not dangerous crowding. But beyond that, metro networks need to start spreading out.
- Some American Sunbelt cities if anything have a subway connection to the train station, for example Los Angeles, without having good service in general. In Los Angeles, the one heavy rail trunk connects to Union Station and so does one branch of the Regional Connector; the city’s problems with subway-intercity rail connections are that it doesn’t really have a subway and that it doesn’t really have intercity rail either.
Is There Room for Optimism About New York Construction Costs?
This year, there have been some positive signs about things changing in New York on subway construction – and yet, I’m uncertain about them. There are some signs that construction costs for Second Avenue Subway Phase 2 are coming under control. The New York Post broke in January that the MTA is eying smaller station designs, to reduce costs, to the tun of $300 million; an article released a few hours ago adds that there may be another $600 million in potential savings. So, in theory, costs are going down, and they’re going down as the MTA implements something we’ve been screaming about at the Transit Costs Project, so we should be happy.
And yet, I’m uncertain – not negative, but still somewhat pessimistic about whether this portends an era in which New York can finally build more subways. The main reason isn’t even some mistrust in the MTA at this point – the reduction in station footprints is a genuinely good thing, and to the extent it’s incomplete, it’s because it’s a longstanding project with older designs. Rather, it’s a combination of what this means for future projects, and how it interacts with federal funding. In brief, federal funding is at the level of the project rather than agency, and this makes it hard for cost savings to be plugged into the most straightforward benefit – namely, being able to build more on the same budget.
How is the money being saved?
The New York Post is relying on an MTA presentation from January that defends the cost structure but talks about how to reduce station costs through reducing back-of-house space. Phase 1 of Second Avenue Subway built two deep-mined intermediate stations, at 72nd and 86th Streets; the platforms are 610′ (187 meters) long, and there are no serious prospects of ever running longer trains since the line is an extension of older lines, but the station caverns are, respectively, 398 and 295 meters long, where the norm in the European comparison cases we’ve seen is that the station dig is 3-15% longer than the platforms, not twice as long.
Both stations have extensive back-of-house space, which New York City Transit demanded so that each department using the station would have its own space; 72nd also has a crossover inherited from older designs that would have permitted some trains from the south to terminate there on a third station track, which was later removed from scope to reduce costs. (The terminal station, the cut-and-cover 96th Street, is a 485 meter long dig, but that’s an artifact of block-level geology: the northern end had to go as far as 99th in order to connect with an older tunnel built in the 1970s, whereas the southern end had to go as far as 92nd because the underground geology changes abruptly there and it was easier to start boring at 92nd than at 96th.)
The plan for Phase 2 initially included much longer digs than the platform, for the same reason. However, it has since changed, and now the digs are substantially reduced. The MTA’s presentation looks like the overage at 125th Street is reduced from somewhat more than 100% in the 2004 plan to about 40-50% in the 2023 plan, and the overage at 106th and 116th is somewhat less than that, maybe 30-40%. While the 125th Street station dig is still as deep as in prior plans, the deep-mined station will also extend less far up, reducing the extent of space required to be outfit with systems.
The MTA could shrink the stations’ footprint further and save more money, but it’s fairly late in the design, and thus the opportunity to take full benefit of this improvement is for future projects. If this establishes precedent for future station construction, then it’s an unmixed blessing.
Money is saved. So what?
The broader issue is that the savings from shrinking the stations’ footprint – totaling potentially $1 billion out of a budget of $7 billion – don’t have much to go. The rub is that the project already has a Full Funding Grant Agreement. If the MTA manages to do it for less, then the most obvious, and most pessimistic, answer to where the money goes is “preventing future overruns.” The savings, in the worst case, then transfer waste from one basket, namely oversize stations, to other baskets, which could be future conflict with contractors, last-minute design changes, or betterments to the neighborhood.
That said, there are plans to spend it on something useful. But the problem is that this is limited by the scope of the project itself. Second Avenue Subway in its current iteration dates to the 1990s, and is reusing some infrastructure from the 1970s. The intention in the 1990s was to do the entire thing, or at least Phases 1 and 2, together, and the project was only chopped into four phases due to high costs. There was design work done 20 years ago or more, and the environmental impact statement is roughly that old.
I suspect the reason the cost saving from shrinking the stations is $1 billion and not much more – we estimated that Phase 1’s cost would have been halved if the stations had been only as long as the platforms – is that the designs are already spoken for, with 20 years of optimization involved. Thus the change is reducing the hard costs of construction, but not the soft costs. It will not surprise me if a postmortem will reveal an elevated ratio of soft to hard costs, purely because the cost savings are happening at such a late stage; in this case, and only in this case, it is important to forgive a high ratio of soft to hard costs, since it portends that future designs will be cheaper, and future cost savings larger. Normally, a high soft-to-hard cost ratio suggests red tape and waste involving consultants, but in this one case, it would suggest something else; I highlight this so that watchdogs for government waste, including the New York Post, realize what is going on and avoid hitting the project if it turns out to indeed have a high soft-to-hard cost ratio as I expect.
Current plans include potentially continuing the tunnel boring under 125th Street. Governor Hochul expressed some interest in a subway extension under 125th Street, extending Phase 2 from 125th/Lexington to the Hudson, with stops at the intersections with the subway lines, at Lenox (2/3), St. Nicholas (A/B/C/D), and Broadway (1 and also potential commuter rail). Such an extension was long on the wishlist of New York-area railfans, and an operations planner mentioned it to me as a future desire more than 10 years ago, unprompted. But there is no way to just reallocate $1 billion to this line; that’s not how federal funding works. At best, it will be possible to continue boring the tunnel all the way to the west, and leave the systems and stations to a future project.
My pessimism is that the cost figure given for the 125th Street extension is $7.6 billion, around $3.3 billion per kilometer. Even taking into account future inflation, it’s costlier than Second Avenue Subway Phases 1 and 2. Now, this is an early number, one that hasn’t really made it into any plans. I hope that the current cost savings are then plugged into the 125th Street extension plan, and that this project is pursued seriously at a much lower cost figure; since all stations would have to be complex digs underneath older north-south subway lines, the benefits of shrinking station footprint are especially large. But I worry that this will not happen; I’ve had hopes dashed before – for example, FRA reform did not lead American commuter rail agencies to start buying alt-compliant vehicles. We’ll see what happens if there’s more detailed work on the 125th Street extension proposal.
Funding projects vs. funding agencies
The current way federal funding works for public transportation in the United States is that the government funds specific capital projects. The MTA can ask for funding for big-ticket items like Second Avenue Subway Phase 1, Phase 2, a future 125th Street subway, the Interborough Express, or any similar such line. It can also ask for a rolling program of improvements, for example installing elevators at stations to make them accessible in line with the Americans with Disabilities Act. But the federal government does not make it easy to move money between such projects.
Phase 2 is just one project, but imagine that there are three subway lines funded concurrently: Phase 2, a subway extension under Nostrand, and a subway extension under Utica. If the MTA finds 25% cost savings, it can’t easily flex the money to a fourth line, say under 125th. It would need to start the planning process early, which is so cumbersome and expensive that it wouldn’t do so for a project it wasn’t certain it wanted to do; there is no shelf of subway extensions that are approved and are just waiting for money.
This makes the incentives for cost savings uncertain: cost savings could be used to establish the agency’s bona fides with a distrustful public, or to establish a warchest guarding against future cost overruns, or to trial new ways of working that could lead to bigger cost savings in the future, but the most straightforward benefit of cost savings – building more infrastructure on the same budget – is not generally available. For Phase 2, the best that can happen is, again, continuing boring the tunnel to the western end of 125th Street, which could be connected with the current project because the geology under the street is the same from Second Avenue to Broadway so might as well future-proof it.
And unfortunately, in the United States, the current examples of funding agencies rather than projects have been lacking. The programs in Los Angeles and Seattle, funded by sales taxes, in effect fund the agencies. There are long lists of projects in both metro areas that are funded from them, but they come from the same pool of money in each region. The situation in Los Angeles is that there’s a decided priority list, with money allocated through the end of the 2050s; if Los Angeles figures out how to cut costs significantly, all of the opening dates will be moved closer and additional lines could be planned and built with the money (the planning and environmental review process takes years, not decades, so by 2050 they will have reviews for the additional subways they could build).
And yet, the same process that’s produced lush agency-level funding in both regions has also led to bad prioritization. New York may have the world’s highest construction costs, but at least what it’s building is what it should be building: Second Avenue Subway Phase 2 is indeed the highest priority right now, and among the next priorities, the Interborough Express and 125th Street are solid choices, according to most area rail advocates two of the top five, and potentially even the top two (the other three are Nostrand, Utica, and an elevated extension of the Astoria Line to LaGuardia). In contrast, Los Angeles is prioritizing the wrong projects. The same ballot proposition process there that produces agency-level funding also requires the agency to bribe local actors who care little for public transportation or for ideological politics with lines to their own subregions of the county, not because they or their constituents will ride the trains, but because they will be able to tell their constituents “I managed to force the county to give us infrastructure money.” This way, each region of the county gets a light rail extension, no matter how lightly-ridden, while the core of the system receives little investment: while the busiest bus corridor in Los Angeles, Wilshire, is getting a subway in the D (Purple) Line Extension, the next two busiest, Vermont and Western, are not getting any rail through the 2050s, despite calls from advocates to built a line on Vermont to turn the B (Red) Line into a north-south line rather than a branch with the Purple Line.
Incentives for the future
The MTA is clearly capable of saving money. The question is now how to incentivize doing more of it. First of all, I urge New York-area advocates to pursue the 125th Street extension, and demand that the cost savings identified for Phase 2 apply to it too. The savings may even potentially be relevant to the Interborough Express, though with at-grade and above-ground stations, the impact is greatly reduced. The Phase 2 savings are reactive; applying them to future lines is proactive.
Second, I urge both the MTA and advocates to look for cost savings in areas where it is easier to flex money – namely, ADA accessibility. Being able to make a station accessible not for the current budget of about $70 million per station with contingency but the $25 million of Boston or $10 million of Madrid would enable New York to have an all-accessible subway system not in the 2050s but in the early 2030s.
Finally, at the federal level, it is useful to figure out how to fund agencies with a positive track record and not just specific projects. Potentially, agencies could be encouraged to submit wishlists of future projects that may be cleared in case money becomes available on short notice; this is useful not just in the case of cost savings, but also in the case of an unexpected infrastructure stimulus – neither the scope of the 2009 stimulus in the early Obama era nor that of 2021 under Biden had been telegraphed until shortly before, and so agencies have not always been able to take maximum advantage of the additional funds.
Northeast Corridor Travel Markets and Fares
I’m optimistic about the ability of fast trains on the Northeast Corridor to generate healthy ridership at all times of day. This is not just a statement about the overall size of the travel market, which is naturally large since the line connects three pretty big metro areas and one very big one. Rather, it’s also a statement about how the various travel markets on the corridor fit atypically well together, creating demand that is fairly consistent throughout the day, permitting trains with a fixed all-day schedule to still have high occupancy – but only if the trains are fast. Today, we’re barely seeing glimpses of that, and instead there are prominent peak times on the corridor, due to a combination of fares and slowness.
Who travels by high-speed rail?
High-speed intercity rail can expect to fill all of the following travel markets:
- Business trips
- Tourism and other recreational trips (including visits to friends and family)
- Commute trips
The key here is that the first two kinds of trips can be either day trips or multiday trips. I day-tripped to the Joint Mathematics Meeting in 2012 when I lived in Providence and the conference was in Boston, about an hour away by commuter rail, but when there was an AMS sectional conference in Worcester in 2011 (from which my blog’s name comes – I found Worcester unwalkable and uploaded an album to Facebook with the name “pedestrian observations from Worcester”) I took trains. I probably would have taken trains from New York to Worcester for the weekend rather than day-tripped even if they were fast – fast trains would still be doing it in around 2:30, which isn’t a comfortable day trip for work.
Non-work trips are the same – they can be day trips or not. Usually recreational day trips are uncommon because train travel is expensive, especially at current Amtrak pricing – the Regional costs twice as much per passenger-km as the TGV and ICE, and the Acela costs three times as much. But there are some corner cases: I moved to Providence with a combination of day trips (including two to view apartments and one to pick up keys) and multiday trips. Then, at commuter rail range, there are more recreational day trips – I’ve both done this and invited others to do so on Boston-Providence and New York-New Haven. Recreational rail trips, regardless of length, are more often taken by individuals and not groups, since car travel costs are flat in the number of passengers and train costs are linear, but even groups sometimes ride trains.
We can turn this into a two-dimensional table of the various kinds of trips, classified by purpose and whether they are day trips or multiday trips:
| Purpose | Day trip | Multiday trip |
| Business | Work trips between city centers, usually in professional services or other high-value added firms, such as lawyers traveling between New York and Washington, or tech workers between New York and Boston; academic seminar trips | Conference trips, more intensive work trips often involving an entire team, and likely all business trips between Boston and Washington or Boston and Philadelphia |
| Tourism | Trips to a specific amenity, which could be urban, such as going to a specific museum in New York or Washington or walking around a historic town like Plymouth, or not, such as taking the train to connect to a beach or hiking trail | Long trips to a specific city or region, stringing many amenities at once |
| Other non-work | College visits, especially close to the train stations; meetings with friends within short range (such as New York-Philadelphia); advocacy groups meeting on topics of interest; apartment viewings; some trips to concerts and sports games | Visits to friends and family; some moving trips; some trips to concerts and sports games |
| Commute | Long-range commutes | — |
Day trips and different peaks
The different trips detailed in the above table peak at different times. The commute trips peak at the usual commute times: into the big cities at 9 in the morning, out of them at 5 in the afternoon. But the rest have different peaks.
In particular, multiday tourism trips peak on weekends – out on Friday, back on Sunday night – while multiday business trips peak during the week. This is consistent enough that airlines use this to discriminate in prices between different travelers: trips are cheaper if booked as roundtrips over Saturday nights, because such trips are likely to be taken by more price-sensitive leisure travelers rather than by less price-sensitive business travelers.
Nonetheless, even with this combination of patterns, the most common pattern I am aware of on intercity rail, at the very least on the Northeast Corridor and on the TGV network, is that the trains are busier during the weekend leisure peak than during the weekday business peaks (except during commute times). Regardless of the distribution of business and leisure trips, leisure trips have a narrow peak, leaving within a window just a few hours long on Friday, whereas business trips may occur on any combination of weekdays; for this reason, leisure-dominated railroads, such as any future service to Las Vegas, are likely to have to deal with very prominent peaking, reducing their efficiency (capacity has to be built for all hours of the week).
But then day trips introduce additional trip types. The leisure day trip is unlikely to be undertaken on a Friday – the traveler who is free during the weekend but not the week might as well travel for the entire weekend. Rather, it’s either a Saturday or Sunday trip, or a weekday trip for the traveler who can take a day off, is not in the workforce, or is in the region on a leisure trip from outside the Northeast and is stringing multiple Northeastern cities on one trip. This day trip then misses the leisure peak, and usually also misses the commuter peak.
The business day trip is likely to be undertaken during the week, probably outside the peak commute time – rather, it’s a trip internal to the workday, to be taken midday and in the afternoon. It may overlap the afternoon peak somewhat – the jobs that I think are likeliest to take such trips, including law, academia, and tech, typically begin and end their workdays somewhat later than 9-to-5 – but the worst peak crowding is usually in the morning and not the afternoon, since the morning peak is narrower, in the same manner that the Friday afternoon peak for leisure travel is narrower than other peaks.
The day trips are rare today; I’ve done them even at the New York-Providence range, but only for the purpose of moving to Providence, and other times I visited New York or even New Haven for a few days at a time. However, the possibility of getting between New York and Providence in 1:20 changes everything. The upshot is that high-speed rail on the Northeast Corridor is certain to massively boost demand on all kinds of trips, but stands to disproportionately increase the demand for non-peak travel, making train capacity more efficient – running the same service all day would incur less waste in seat-km.
The issue of fares
All of this analysis depends on fares. These are not too relevant to business trips, and I suspect even to business day trips – if you’re the kind of person who could conceivably day trip between New York and Washington on a train taking 1:45 each way to (say) meet with congressional staffers, you’re probably doing this even if the fares are as high as they are today. But they are very relevant to all other trips.
In particular, non-work day trips are most likely not occurring during any regular peak travel time. Thus, the fare system should not attempt to discourage such trips. Even commute trips, with their usual peak, can be folded into this system, since pure intercity trips peak outside the morning commute period.
The upshot is that it is likely good to sell intercity and commuter rail tickets between the same pair of stations for the same fare. This means that people traveling on sections currently served by both intercity and commuter rail, like Boston-Providence, New York-New Haven, New York-Trenton, and Baltimore-Washington, would all be taking the faster intercity trains. This is not a problem from the point of view of capacity – intercity trains are longer (they serve fewer stations, so the few stations on the corridor not long enough for 16-car trains can be lengthened), and, again, the travel peaks for multiday trips are distinct from the commute peak. Commuter trains on the corridor would be serving more local trips, and help feed the intercity trains, with through-ticketing and fare integration throughout.
Serving Metro-North Fordham Station
In the Bronx, the Metro-North Harlem Line runs north-south, west of the 2/5 subway lines on White Plains Road and east of the 4 on Jerome Avenue and the B/D on Grand Concourse. It makes multiple stops, all served rather infrequently, currently about every half hour, with some hourly midday gaps, at premium fare. The north-south bus lines most directly parallel to the line, the Bx15 and Bx41, ranked #20 and #24 respectively in ridership on the New York City Transit system in 2019, though both have lost considerable ground since the pandemic. Overall, there is serious if not extremely high demand for service at those stations. There is already a fair amount of reverse-peak ridership: while those half-hourly frequencies can’t compete with the subway for service to Manhattan, they are the only non-car option for reverse-peak service to White Plains, and Fordham gets additional frequencies as well as some trains to Stamford. A city report from 2011 says that Fordham has 51 inbound passengers and 3,055 outbound ones boarding per weekday on Metro-North. Figuring out how to improve service in the Bronx requires a paradigm shift in how commuter rail is conceived in North America. Fordham’s reverse-peak service is a genuinely hard scheduling question, which we’re having to wrestle with as we’re proposing a (much) faster and smoother set of timetables for Northeast Corridor trains. Together, they make for a nontrivial exercise in tradeoffs on a busy commuter line, in which all options leave something to be desired.
Harlem Line local service
The bulk of demand from Fordham is local service, mostly toward Manhattan. The area is a bedroom community: within 1 km of the station at Park and Fordham there are 35,338 employed residents and only 22,515 jobs as of 2019; the largest destination is Manhattan (12,734 commuters), followed by the Bronx (7,744), then the rest of the city (8,069 across three boroughs), and only then Westchester (2,207), Long Island (1,660), and Connecticut (220). But to an extent, the station’s shed is larger for reverse-commute service, because people can connect from the Bx12 bus, which ranked second in the city behind the M15 in both 2019 and 2022; in contrast, Manhattan-bound commuters are taking the subway if they live well east or west of the station along Fordham. Nonetheless, the dominance of commutes to city destinations means that the most important service is to the rest of the city.
Indeed, the nearest subway stations have high subway ridership. The city report linked in the lede cites ridership of 11,521 on the B/D and another 12,560 on the 4 every weekday, as of 2012; both stations saw declines by 2019. The West Bronx’s hilly terrain makes these stations imperfect substitutes for each other and for the Metro-North station, despite the overlap in the walk sheds – along Fordham, Park is 600 meters from the Concourse and 950 from Jerome. Nonetheless, “roughly the same as either of the Fordham stations on the subway” should be a first-order estimate for the ridership potential; better Metro-North service would provide a much faster option to East Harlem and Grand Central, but conversely require an awkward transfer to get to points south, which predominate among destinations of workers from the area, who tilt working-class and therefore peak in Midtown South and not in the 50s:

Adding up all the Bronx stations on the line – Wakefield, Woodlawn, Williams Bridge, Botanical Garden, Fordham, Tremont, Melrose – we get 170,049 employed residents (as always, as of 2019), of whom 62,837 work in Manhattan. The line is overall in a subway desert, close to the 4 and B/D but along hills, and not so close to the 2/5 to the east; several tens of thousands of boardings are plausible if service is improved. For comparison, the combination of Westchester, Putnam, and Dutchess Counties has 115,185 Manhattan-bound commuters, split across the Harlem Line, the Hudson Line, and the inner New Haven Line. The Bronx is thus likely to take a majority of Manhattan-bound ridership on the Harlem Line, though not an overwhelming one.
To serve all this latent demand, it is obligatory to run more local service. A minimum service frequency of a train every 10 minutes is required. The current outbound schedule is 20 minutes from Grand Central to Fordham, and about four of those are a slowdown in the Grand Central throat that can be waived (the current speed limit is 10 miles per hour for the last mile; the infrastructure can largely fit trains running three times that fast almost all the way to the bumpers). Lower frequency than this would not really make use of the line’s speed.
Moreover, using the track chart as of 2015 and current (derated) M7 technical performance, the technical trip time is 18 minutes, over which 20 minutes is not too onerously padded; but removing the derating and the four gratuitous minutes crawling into and out of Grand Central, this is about 14 minutes, with some pad. The speed zones can be further increased by using modern values of cant and cant deficiency on curves, but the difference isn’t very large, only 40 seconds, since this is a section with frequent stops. It’s fast, and to reinforce this, even higher frequency may be warranted, a train every 7.5 or 6 or even 5 minutes.
There is room on the tracks for all of this. The issue is that this requires dedicating the local tracks on the Harlem Line in the Bronx to local service, instead of having trains pass the platforms without stopping. This, in turn, requires slowing down some trains from Westchester to make more local stops. Current peak traffic on the Harlem Line is 15 trains per hour, of which 14 run past Mount Vernon West, the current northern limit of the four-track section, and 13 don’t stop in the Bronx at all. The line has three tracks through past Crestwood, and the stations are set up with room for a fourth track, but a full 10 trains per hour, including one that stops in the Bronx, run past Crestwood. In theory it’s possible to run 12 trains per hour to Mount Vernon West making all stops, and 12 trains past it skipping Bronx stops; this slows down the express trains from White Plains, which currently skip seven stops south of White Plains to Mount Vernon West inclusive, but higher speeds in the Bronx, speeding up the Grand Central throat, higher frequency, and lower schedule padding would together lead to improvements in trip times. However, this introduces a new set of problems, for which we need to consider the New Haven Line too.
Harlem Line express service and the New Haven Line
Currently, the New Haven Line runs 20 trains per hour into Grand Central at the peak. This number will go down after Penn Station Access opens in 2027, but not massively; a split of 6-8 trains per hour into Penn Station and 16-18 into Grand Central, with the new service mildly increasing total throughput, is reasonable.
Today, New Haven Line locals stop at Fordham, and nowhere else in the Bronx. This is inherited from the trackage rights agreement between the New York Central and the New York, New Haven and Hartford Railroad, allowing the latter to make only one stop in the Bronx on the former’s territory; it used to be Woodlawn, the branch point, but has been moved to Fordham, which has busier reverse-peak traffic. The two railroads merged in 1969, and all service is currently run by Metro-North, but the practice persists. This is not necessarily stupid: the New Haven locals are long – Stamford is 53 km from Grand Central, 50% farther than White Plains – and a system in which the New Haven trains are more express than the Harlem trains is not by itself stupid, depending on other system constraints. Unfortunately, this setup introduces all manners of constraints into the system:
- Fordham is a local-only station, and thus New Haven locals have to use the local tracks, using awkwardly-placed switches to get between the express and the local tracks. In fact, all stations up to and including Woodlawn are local-only; the first station with platforms on the express tracks is Wakefield, just north of the split between the two lines.
- If there are 12 Harlem Line trains per hour expressing through the Bronx, then the New Haven Line is limited to about 12 trains per hour as well unless the local trains make all the Bronx local stops.
- The Hudson Line has a flat junction with the Harlem Line at Mott Haven Junction, which means that any regular schedule has to have gaps to let Hudson Line trains pass; current peak Hudson Line traffic is 11 trains per hour, but it was 14 before corona.
This leads to a number of different options, each problematic in its own way.
Maximum separation
In this schedule, all Harlem Line trains run local, and all New Haven Line trains run express in the Bronx. This is the easiest to timetable – the junction between the two lines, unlike Mott Haven, is grade-separated. This also requires splitting the Hudson Line between local and express tracks, so delays will still propagate in any situation unless the Hudson Line is moved to the Empire Connection (6-8 trains per hour can stay on the current route); but in a future with Penn Station Access West, building such service, it does allow for neat separation of the routes, and I usually crayon it this way in very high-investment scenarios with multiple through-running tunnels.
But in the near future, it is a massive slowdown for Harlem Line riders who currently have express service from White Plains to Manhattan. The current peak timetable has a 15-minute difference between local and express trains on this section; this figure is padded but not massively so, and conversely, higher speeds on curves increase the express train speed premium.
It also severs the connection between the Bronx and the New Haven Line, unless passengers take east-west buses to the Penn Station Access stations. It is possible to add infill at Wakefield on the New Haven Line: this is north of the junction with the Harlem Line, but barely, so the separation between the lines is short, and a transfer station is feasible. But it wouldn’t be a cross-platform transfer, and so Fordham-Stamford service would still be degraded.
Locals run local
The New Haven Line locals can make local stops in the Bronx. It’s a slowdown of a few minutes for those trains – the current outbound timetable is 18 minutes from Grand Central to Fordham, two minutes faster than on Harlem Line locals while skipping Melrose and Tremont. Overall, it’s a slowdown of around six minutes; the current speed zones are 60 and 75 mph, and while raising the speed limits increases the extent of the slowdown (most of the track geometry is good for 160 km/h), getting new trainsets with better acceleration performance decreases it, and overall it’s likely a wash.
From there, the service pattern follows. New Haven Line locals to Grand Central have little reason to run more frequently than every 10 minutes at peak – the local stations are 30-60 minutes out of Grand Central today, and this is massively padded, but with the timetables Devin produced, fixing the Grand Central throat, New Rochelle would still be 20 minutes out on a local train (stopping at Fordham only as is the case today) and Stamford would be 45 minutes out. What’s more, there will be some local trains to Penn Station starting in three years, boosting the effective frequency to a train every five minutes, with a choice of Manhattan station if express trains can be made to stop at New Rochelle with a timed connection.
Now, if there are six local trains per hour in the Bronx going to Stamford, then the Harlem Line locals only take six trains per hour of their own, and then 12 trains should run express from Wakefield to Harlem. What they do to the north depends. The simplest option is to have all of them make all stops, which costs White Plains 7-8 minutes relative to the express stopping pattern. But if the line can be four-tracked to Crestwood, then half the trains can run local to White Plains and half can run nonstop between Wakefield or Mount Vernon West and White Plains. Two local stations, Scarsdale and Hartsdale, are in two-track territory, but timetabling a local to follow an express when both run every 10 minutes and there are only two local stops’ worth of speed difference is not hard.
The New Haven Line, meanwhile, gets 12 express trains per hour. Those match 12 express Harlem Line trains per hour, and then there’s no more room on the express tracks; Hudson Line trains have to use the local tracks and somehow find slots for the northbound trains to cross both express tracks at-grade.
Status quo
The status quo balances Bronx-Stamford connectivity with speed. Bear in mind, the New Haven Line today has truly massive timetable padding, to the point that making trains make all six stops in the Bronx and not just Fordham would still leave New Rochelle locals faster than they are today if the other speedup treatments were put into place. But the status quo would allow New Rochelle, Larchmont, Rye, and Greenwich to take maximum advantage of the speedup, which is good. The problem with it is that it forces New Haven Line locals to take slots from both the express tracks and the local tracks in the Bronx.
In this situation, the New Haven Line still runs six locals and 12 express trains to Grand Central per hour. The Harlem Line is reduced to six express trains through the Bronx and has to run 12 trains local. Transfers at Wakefield allow people in suburbs south of White Plains to get on a faster train, but this in effect reduces the effective peak-of-peak throughput from the suburbs to Manhattan to just six trains per hour.
Express Fordham station
Rebuilding Fordham as an express station means there’s no longer any need to figure out which trains stop there and which don’t: all would. Then the New Haven Line would run express and the Harlem would either run local or run a mix.
The problem is that Fordham is in a constrained location, where such a rebuild is hard:

The line is below-grade, with a tunnel from Fordham Road to 189th Street. The platforms are short and narrow, and partly overlap the tunnel. Any conversion has to involve two island platforms north of the tunnel, where there is room but only if the right-of-way is expanded a little, at the expense of some parkland, and possibly a lane of Webster Avenue. The cost would not be pretty, independently of the inability of the MTA to build anything on a reasonable budget.
That said, the timetables on the Northeast Corridor require some infrastructure intervention to smooth things, like grade-separating some junctions for hundreds of millions of dollars each. New Rochelle, which has only a local platform southbound, should almost certainly be rebuilt as a full express stop. So rebuilding Fordham is not out of the question, even if the cost is high (which it is).
In this situation, all New Haven Line trains should use the express tracks. Thus, as in the status quo alternative, the Harlem Line gets six express trains, the other trains having to run local. Potentially, there may be a schedule in which the New Haven Line runs 16 trains to Grand Central and eight to Penn Station, and then the Harlem Line can get eight local and eight express trains; but then the local trains have to be carrying the load well into Westchester, and four-tracking the line to Crestwood is likely obligatory.
The New Haven Line and intercity trains
The above analysis elides one important factor: intercity trains. The current practice in the United States is a three-way separation of urban rail, suburban commuter rail, and intercity rail, with fares designed to discourage riders from taking trains that are not in their sector. However, in much the same way the best industry practice is to charge mode-neutral fares within cities, it is also valuable to charge mode-neutral fares between them. In other words, it’s useful to look at the impact of permitting people with valid commuter rail tickets to take intercity trains, without seat reservations.
To be clear, this means that at rush hour, there are going to be standees on busy commuter routes including Stamford-New York and Trenton-New York. But it’s not necessarily bad. The intercity trip time in our timetables between Stamford and New York is around 29 minutes without high-speed bypasses; the standing time would be less than some subway riders endure today – in the morning rush hour the E train departs Jamaica Center full and takes 34 minutes to get to its first Manhattan stop. And then there’s the issue of capacity: commuter trains on the New Haven Line are eight cars long, intercity trains can be straightforwardly expanded to 16 cars by lengthening the platforms at a very small number of stations.
And if the intercity trains mop up some of the express commuter rail traffic, then the required service on the New Haven Line at rush hour greatly decreases. An intercity train, twice as long as a commuter train (albeit with somewhat fewer seats and less standing space per car), could plausibly displace so much commuter traffic that the peak traffic on the line could be reduced, say to 18 trains per hour from today’s 20. Moreover, the reduction would be disproportionately at longer distance: passengers west of Stamford would not have any replacement intercity train unless they backtracked, but passengers east of Stamford would likely switch. This way, the required New Haven Line traffic shrinks to 12 local trains per hour and six express trains; half the locals run to Penn Station and half to Grand Central, and all express trains run to Grand Central.
In that situation, we can rerun the scenarios for what to do about Fordham; the situation generally improves, since less commuter traffic is required. The maximum separation scenario finally permits actual separation – the Hudson Line would run on the express tracks into Grand Central and have to cross the southbound Harlem Line locals at Mott Haven Junction, with predictable gaps between trains. The locals-run-local scenario gives Harlem Line express trains more wiggle room to slot between New Haven Line express trains. The status quo option lets the Harlem Line run six local trains and 12 express trains, though that likely underserves the Bronx. Converting Fordham to an express stop straightforwardly works with zero, six, or 12 express Harlem Line trains per hour.
Or maybe not. It’s fine to assume that letting passengers get on a train that does Stamford-Penn Station in 29 minutes and New Haven-Penn Station in 57 for the price of a commuter pass is going to remove passengers from the express commuter trains and put them onto longer intercity trains. But by the same token, the massive speed improvements to the other stops could lead to an increase in peak demand. The current trip time to Stamford is 1:12 on a local train; cutting that to 45 minutes means so much faster trips to the suburbs in between that ridership could increase to the point of requiring even more service. I’m not convinced on this – the modal split for peak commutes to Manhattan is already very high (Metro-North claimed 80% in the 2000s), and these suburbs are incredibly NIMBY. But it’s worth considering. At the very least, more local service is easier to add to the timetable than more express service – locals to Grand Central don’t share tracks with intercities at all, and even locals to Penn Station only do on a controllable low-speed section in Queens.
New York-New Haven Trains in an Hour
Devin Wilkins and I are still working on coming up with a coordinated timetable on the Northeast Corridor, north to south. Devin just shared with me the code she was running on both routes from New Haven to New York – to Grand Central and to Penn Station – and, taking into account the quality of the right-of-way and tunnels but not timetable padding and conservative curve speeds – it looks like intercity trains would do it in about an hour. The current code produces around 57 minutes with 7% timetable pad if I’m getting the Penn Station throat and tunnel slowdowns right – but that’s an if; but at this point, I’m confident about the figure of “about an hour” on the current right-of-way.
I bring this up to give updates on how the more accurate coding is changing the timetable compared to previous estimates, but also to talk about what this means for future investment priorities.
First, the curve radii I was assuming in posts I was writing last decade were consistently too optimistic. I wrote three months ago about how even within the highest speed zone in southern Rhode Island, there’s a curve with radius 1,746 meters (1 degree in American parlance), which corresponds to about 215 km/h with aggressive cant and cant deficiency. At this point we’ve found numbers coming straight from Amtrak, Metro-North, and MBTA, letting us cobble together speed zones for the entire system.
But second, conversely, I was being too conservative with how I was setting speed zones. My principle was that the tightest curve on a section sets the entire speed limit; when writing commuter rail timetables, I would usually have each interstation segment be a uniform speed zone, varying from this practice only when the interstation was atypically long and had long straight sections with a tight curve between them. When writing intercity timetables, I’d simplify by having the typical curves on a line set the speed limit and then have a handful of lower speed limits for tighter curves; for example, most curves on the New Haven Line are 873 meters, permitting 153 km/h with aggressive high-speed rail cant and cant deficiency, and 157 km/h with aggressive limits for slower trains, which can run at slightly higher cant deficiency, but those sections are punctuated by some sharper curves with lower limits. Devin, using better code than me, instead lets a train accelerate to higher speed on straight sections and then decelerate as soon as it needs to. Usually such aggressive driving is not preferred, and is used only when recovering from delays – but the timetable is already padded somewhat, so it might as well be padded relative to the fastest technical speed.
The upshot of all of this is that the speed gains from just being able to run at the maximum speed permitted by the right-of-way are massive. The trip time today is 1:37 on the fastest trains between New York and New Haven. Commuter trains take 2:10, making all stops from New Haven to Stamford and then running nonstop between Stamford and Manhattan; in our model, with a top speed of 150 km/h, high-performance regional trains like the FLIRT, Talent 3, or Mireo should do the trip in about 1:15-1:20, and while we didn’t model the current rolling stock, my suspicion is that it should be around six minutes longer. The small difference in trip time is partly because Penn Station’s approach is a few kilometers longer than Grand Central’s and the curves in Queens and on the Hell Gate Bridge are tight.
What this means is that the highest priority should be getting trains down to this speed. In the Swiss electronics-before-concrete schema, the benefits of electronics on the Northeast Corridor are massive; concrete has considerable benefits as well, especially on sections where the current right-of-way constrains not just speed but also reliability and capacity, like New Haven-Kingston, but the benefits of electronics are so large that it’s imperative to make targeted investments to allow for such clean schedules.
Those investments do include concrete, to be clear. But it’s concrete that aims to make the trains flow more smoothly, in support of a repetitive schedule with few variations in train stopping patterns, so that the trains can be timetabled in advance not to conflict. At this point, I believe that grade-separating the interlocking at New Rochelle, popularly called Shell Interlocking and technically called CP 216, is essential and must be prioritized over anything else between the city limits of New York and New Haven Union Station. Currently, there’s very high peak traffic through the interlocking, with a flat junction between trains to Penn Station and trains to Grand Central.
On the electronics side, the timetables must become more regular. There are currently 20 peak trains per hour on the New Haven Line into Grand Central; of those, four go to branches and 16 are on the main line, and among the 16, there are 13 different stopping patterns, on top of the intercity trains. It is not possible to timetable so many different trains on a complex system and be sure that everything is conflict-free, and as a result, delays abound, to which the response is to pad the schedules. But since the padded schedules still have conflicts, there is a ratchet of slowdowns and padding, to the point that a delayed train can recover 20 minutes on less than the entire line. Instead, every train should either be a local train to Stamford or an express train beyond Stamford, and there should only be a single express pattern on the inner line, which today is nonstop between Harlem and Stamford and in the future should include a stop at New Rochelle; this means that, not taking intercity trains into account, the main line should have at most four stopping patterns (local vs. express, and Penn Station vs. Grand Central), and probably just three, since express commuter trains should be going to Grand Central and not Penn Station, as passengers from Stamford to Penn Station can just ride intercity trains.
Also on the electronics side, the way the line is maintained currently is inefficient to an extent measured in orders of magnitude and not factors or percents. Track inspection is manual; Metro-North finally bought a track geometry machine but uses it extremely unproductively, with one report saying it gets one tenth as much work done as intended. Normally these machines can do about a track-mile in an overnight work window, which means the entire four-track line can be regraded and fixed in less than a year of overnights, but they apparently can’t achieve that. Whatever they’re doing isn’t working; the annual spending on track renewal in Connecticut is what Germany spends on once-in-a-generation renewal. The endless renewal work includes a plethora of ever-shifting slow zones, and at no point is the entire system from New York to New Haven clear for trains, even on weekdays. The excessively complex schedule, on tracks that constantly shift due to segment-by-segment daytime repairs, is turning a trip that should be doable on current rolling stock in perhaps 1:23 into one that takes 2:10.
The billions of dollars in the Bipartisan Infrastructure Law that are dedicated to the Northeast Corridor and have not yet been spent can reduce trip times further. But the baseline should be that the bucket of money is a few hundreds of millions smaller and the base case trip time from New York to New Haven is an hour and not 1:37; this is what the system should be compared with.
Security Theater on Metros
In comments, Sarapen asked me about security on urban rail. It’s common in developing Asia to require people to go through metal detectors to get to the platforms; I’ve seen this in Bangkok, she mentions this in the context of Manila, and it’s also the case on Indian metros and Chinese ones. Seung Y. Lee, a BART digital media worker and indispensable commentator on American and Asian metro history who sadly doesn’t blog enough, has an excellent post about it, talking about the use of security as a tool of social control, for example in Hong Kong.
But Sarapen is asking about the need for security to deter terrorism and violence, which do exist in the Philippines.
For this, I naturally went to the country facing terrorism and violence that I know most intimately. I haven’t visited in more than 11 years, and so I’ve never taken the light rail in either Tel Aviv or Jerusalem (which was open on my last visit but I didn’t visit the city then). So I went ahead and asked on my Discord server, and got this:

The person covered up by the two shorter white rectangles also posted a picture of a platform in Tel Aviv, with a vault for bomb disposal:

Israel Railways does have security theater – one has to open one’s bags in front of a security guard and go through a metal detector, and this being Israel, there is extensive racial profililng. But the light rail, including the underground portions in Tel Aviv, do not. There’s a lot of visible security presence, including cameras, security personnel, and K-9 units, but no metal detectors. This is in a country that, in the 1990s and early 2000s, underwent a wave of suicide bombings on buses, and still didn’t put metal detectors on them, because it’s infeasible to install such technology on surface transit, and too expensive and frankly not too useful on metros (a security line is an attractive bombing target).
This shouldn’t be too surprising, in a way. Western Europe did not install security theater on trains in response to far-left terrorism in the 1970s and 80s. Israel’s way of building and running public transportation is intended as a pan-European medley, using consultants who have done work in Europe, and with media discourse that looks up to Europe’s urban transportation systems (on other matters Israel looks up to the United States, but Israelis understand American public transportation is not good). If nobody in Europe (or at least Western Europe) does something, it’s unlikely Israel will do it, not on civilian public infrastructure.
Curves in Fast Zones
I wrote nearly five years ago that the lowest-hanging fruit for speeding up trains are in the slowest sections. This remains true, but as I (and Devin Wilkins) turn crayon into a real proposal, it’s clear what the second-lowest hanging fruit is: curves in otherwise fast sections. Fixing such curves sometimes saves more than a minute each, for costs that are not usually onerous.
The reason for this is that curves in fast zones tend to occur on a particular kind of legacy line. The line was built to high standards in mostly flat terrain, and therefore has long straight sections, or sections with atypically gentle curves. Between these sections the curves were fast for the time; in the United States, high standards for the 19th century meant curves of radius 1,746 meters, at which point each 100′ (30.48 meters) of track distance correspond to one degree of change in azimuth; 30.48*180/pi = 1,746.38. For much more information about speed zones, read this post from four weeks ago first; I’m going to mention some terminology from there without further definitions.
These curves are never good enough for high-speed rail. The Tokaido Shinkansen was built with 2,500 meter curves, and requires exceptional superelevation and a moderate degree of tilting (called active suspension) to reach 285 km/h. This lateral acceleration, 2.5 m/s^2, can’t really be achieved on ordinary high-speed rolling stock, and the options for it always incur higher acquisition and operating costs, or buying sole-sourced Japanese technology at much higher prices than are available to Japan Railways. In practice, the highest number that can be acheived with multi-vendor technology is around 2.07 m/s^2, or at lower speed around 2.2; 1,746*2.07 = 3,614.22, which corresponds to 60.12 m/s, or 216 km/h.
If a single such curve appears between two long, straight sections, then the slowdown penalty for it is 23.6 seconds from a top speed of 300 km/h, 35.5 from a top speed of 320 km/h, or 67 seconds from a top speed of 360 km/h. The curve itself is not instantaneous but has a few seconds of cruising at the lower speed, and this adds a few seconds of penalty as well.
Case in point: the curve between Kingston and Wickford Junction in South County, Rhode Island is such a curve. The red line below denotes a 4 km radius curve, good for a little more than 320 km/h (360 with tilt), deviating from the black line of the existing curve.

The length of the existing curve is about 1.3 km, so to the above penalties, add 6 seconds if the top speed is 300 km/h, 7 if it’s 320, or 8.7 if it’s 360 (in which case the curve needs to be a bit wider unless tilting is used). If it’s not possible to build a wider curve than this, then from a top speed of 360 km/h, the 320 km/h slowdown adds 10 seconds of travel time, including a small penalty for the 1.3 km of the curve and a larger one for acceleration and deceleration; thus, widening the curve from the existing one to 4 km radius actually has a larger effect if the top speed on both sides is intended to be 360 and not 320 km/h.
The inside of this curve is not very developed. This is just about the lowest-density part of the Northeast Corridor. I-95 has four rather than six lanes here. Land acquisition for curve easement is considerably easier than in higher-density sprawl in New Jersey or Connecticut west of New Haven.
The same situation occurs north of Providence, on the Providence Line. There’s a succession of these 1,746 meter curves, sometimes slightly tighter, between Mansfield and Canton. The Canton Viaduct curve is unfixable, but the curves farther to the south are at-grade, with little in the way; there are five or six such curves (the sixth is just south of the viaduct and therefore less relevant) and fixing all of them together would save intercity trains around 1:15.
For context, 1.3 km of at-grade construction in South County with minimal land acquisition should not cost more than $50 million, even with the need to stage construction so that the new alignment can be rapidly connected to the old one during the switchover. Saving more than a minute for $50 million, or even saving 42 seconds for $50 million, is around 1.5 orders of magnitude more cost-effective than the Frederick Douglass Tunnel in Baltimore ($6 billion for 2.5 minutes); there aren’t a lot of places where it’s possible to save so much time at this little cost.
It creates a weird situation in which while the best place to invest in physical infrastructure is near urban stations to allow trains to approach at 50-80 km/h and not 15-25, the next best is to relieve 210 km/h speed zones that should be 300 km/h or more. It’s the curvy sections with long stretches of 120-160 km/h that are usually more difficult to fix.
We Gave a Talk About New York Commuter Rail Modernization
Blair Lorenzo and I gave the talk yesterday, as advertised. The slide deck was much more in her style than in mine – more pictures, fewer words – so it may not be exactly clear what we said.
Beyond the written report itself (now up in web form, not just a PDF), we talked about some low-hanging fruit. What we’re asking for is not a lot of money – the total capital cost of electrification and high platforms everywhere and the surface bottlenecks we talk about like Hunter Junction is around $6 billion, of which $800 million for Portal Bridge need to happen regardless of anything else; Penn Reconstruction is $7 billion and the eminently cancelable Penn Expansion is $17 billion. However, it is a lot of coordination, of different agencies, of capital and operations, and so on. So it’s useful to talk about how to, in a way, fail gracefully – that is, how to propose something that, if it’s reduced to a pilot program, will still be useful.
The absolute wrong thing to do in a pilot program situation is to just do small things all over, like adding a few midday trains. That would achieve little. There is already alternation between hourly and half-hourly commuter trains in most of the New York region; this doesn’t do much when the subway or a subway + suburban bus combination runs every 10-12 minutes (and should be running every six). The same can be said for CityTicket, which incrementally reduces fares on commuter rail within New York City but doesn’t integrate fares with the subway and therefore produces little ridership increase.
Instead, the right thing to do is focus on one strong corridor. We propose this for phase 1, turning New Brunswick-Stamford or New Brunswick-New Rochelle into a through-line running every 10 minutes all day, as soon as Penn Station Access opens. But there are other alternatives that I think fall into the low-hanging fruit category.
One is the junction fixes, like Hunter as mentioned above (estimated at $300 million), or similar-complexity Shell in New Rochelle, which is most likely necessary for any decent intercity rail upgrade on the Northeast Corridor. It costs money, but not a lot of it by the standards of what’s being funded through federal grants, including BIL money for the Northeast Corridor, which is relevant to both Hunter and Shell.
The other is Queens bus redesign. I hope that as our program at Marron grows, we’ll be able to work on a Queens bus redesign that assumes that it’s possible to connect to the LIRR with fare integration and high frequency; buses would not need to all divert to Flushing or Jamaica, but could run straight north-south, leaving the east-west Manhattan-bound traffic to faster, more efficient trains.
Setting Speed Zones
At the Boston meetup two days ago, I was asked about what tools I use to generate timetables, for example for my New York commuter rail posts. The answer is that I use speed zones and then run this code on them – but then the question is how to figure out speed zones. I hope that this sequence of steps will help advocates who are interested in rail modernization.
Generating curve radii
The most difficult element to fix on mainline rail is the right-of-way geometry. Most other things that can restrict a train’s speed can be fixed with more modern maintenance, but right-of-way geometry doesn’t change without physical construction, often in constrained areas – if they weren’t constrained, the curves would have been built wider in the first place.
The best case scenario is that there exist track maps with exact curve radii. I have these for large chunks of the Northeast Corridor, but not all. For example, here is Metro-North (with thanks to The Korot). Curves on such maps are denoted as circles or bumps deviating from a line, with the direction of the circle indicating the direction of the curve’s curvature. On this and other American maps, the radius is listed in degrees, and the cant (see the section below) in inches.
To convert the radius from degrees to more usual units, set one degree to be 1,746 meters, and note that degrees measure curvature and not radius, so a two degree curve has half the radius of a one degree curve. More precisely, the formula is that degrees measure the change in azimuth over 100 feet; 100 feet are 30.48 meters, and converting 30.48 from degrees to radians gives 1,746.37536… meters.
For example, on the Metro-North chart, let’s look at Harrison, New York. It’s on PDF-p. 24 of the chart; Harrison is sandwiched between two curves with opposite orientations, with the platforms on tangent (uncurved) track. The curve just west of Harrison has radius 1° 58′ 30″, which is 884.24069… meters; the curve just east has radius 2° 2′ 15″, which is 857.11674… meters.
Converting curve radii to speeds
The formula for the speed of a train, in SI units, is
I wrote about lateral acceleration, cant, and cant deficiency two and a half years ago. In short, lateral acceleration, in m/s^2, is the centrifugal force coming from the action of the train rounding the curve at speed. For the purposes of the formula, it is measured in the horizontal plane. To reduce the centrifugal force felt by the passengers (for comfort and safety) as well as that felt by the train body (for safety and maintenance costs), the tracks will typically be banked so that the inner rail is lower than the outer rail, which is called cant or superelevation, and is written in units of distance, such as mm or inches.
The speed of a train on canted track is typically higher than the perfect balancing speed, where the force of gravity counteracts that of centrifugal force; thus, on a fast train there is a residual force pointing to the outside of the train, which can be written down as lateral acceleration in the plane of the tracks (in m/s^2), but is more typically written down in the same units as superelevation, representing the additional superelevation required for the speed to perfectly balance, which is called cant deficiency or underbalance.
The conversion rate between cant (or cant deficiency) is the track gauge measured between the middle of the two rails, divided by the gravitational constant (9.8 m/s^2). Track gauge is typically given as inner rail to inner rail; standard gauge is 1,435 mm inner rail to inner rail. The relevant quantity to superelevation calculations is a few cm more; on standard gauge, it’s taken to be about 1,470-1,500 mm, so the conversion rate is 1 m/s^2 = 150 mm of cant or cant deficiency. The two quantities, cant and cant deficiency, are additive.
The American track charts that I have specify the actual cant. However, the values tend to be too conservative. Again with the example of Harrison, the slightly wider western curve has 5″ cant and the slightly tighter eastern curve has 4.125″ cant. Regulations for maximum cant depend on the country and maintenance standards. The absolute maximum cant I am aware of on any standard-gauge railway is 200 mm on the Tokaido and Tohoku Shinkansen. The reasons not to raise cant further include maintenance difficulties and the risk of a train running at lower speed or even stopping on the track. On lines that are not captive to just high-speed trains, the highest cant I am aware of is 180 mm, in Germany, and this is rare; 160 mm is more common. The American limit is 7″, but frequent inspections are required at that point to ensure that the tracks don’t get bent out of shape to produce higher cant; 8″ is a do-not-exceed level, and in practice track irregularities may lead to exceeding it if there isn’t regular track maintenance.
In practice, raising the cant is usually easy – it can be done with a track geometry machine automatically. However, in one case, it is not: that of S-curves, which have exactly the shape implied by the letter S. Our example of Harrison has two reverse curves in close proximity, but is not an S-curve, as there are hundreds of meters of tangent track between the two curves. Other places do have S-curves, and there, the maximum cant must be lowered somewhat; regulations vary on this, but in Europe, the maximum change in cant is 30-55 mm per second depending on the country (there’s a secondary regulation on mm per meter, but on the Northeast Corridor, the binding rule is mm/s, not mm/m). For example, if we take 45 mm/s, and 180 mm of cant, then it takes four seconds to reverse a curve; note that it is four and not eight, because half of the increase in cant, called a superelevation spiral, is within the curve. Harrison’s current curves impose a hard limit of about 150 km/h, at which point the hundreds of meters of tangent track make it trivially easy to have full superelevation. However, other places, most infamously among Northeastern railfans Elizabeth, there is an actual S-curve, forcing lower cant and lower speeds.
Finally, the maximum cant deficiency depends on the track, the train, and the regulations. Traditionally, American regulations limited most passenger trains to 3″ of cant deficiency, which is ridiculously conservative; in the 2000s, a waiver allowing 5″ in some cases was derided as the “magic high-speed rail waiver” because it applied not just to higher-speed tracks but also to lower-speed lines that had through-service to higher-speed tracks. Since then, FRA regulations have changed, and now the practical limit in the US, with extensive testing, is 6″, or 150 mm cant deficiency, at most speeds. European limits tend to be around 130-150 mm; high-speed trains are at the lower end of this range unless they are tilting trains, which nearly all trains are not. Cant deficiency, like cant, requires its own superelevation spiral on S-curves, but the limits are in practice looser than for cant, and in some cases trains can change superelevation abruptly, with no spiral, for example on switches.
The upshot is that in the typical case, the most aggressive assumption should be 180 mm cant, 150 mm cant deficiency, for a lateral acceleration in the horizontal plane of 2.2 m/s^2. Most lines will not have this pair of aggressive assumptions: 180 mm is only viable when it’s guaranteed that trains will not stop on a canted curve, which is a reasonable assumption on a reliable high-speed line and even on a German high-speed line. Moreover, if there is any freight on the line, superelevation must fall drastically: slower trains would be at cant excess, and freight trains have high center of mass (diesel locomotives and double-stacked containers both have higher center of mass than electric passenger trains) and therefore have tight cant excess limits. Aggressive assumptions are viable on the Northeast Corridor and on controllable commuter lines with no or almost no freight, such as the LIRR, but not everywhere else.
Finding curve radii
In some cases, curve radii are spelled out in a chart. In others, they are not, and must be figured out. Our program’s schedule writer, Devin Wilkins, tells me she has just found a track chart for SEPTA, but otherwise, I have no such charts south of New York. There, the dirty, imperfect method of estimating curve radii must be used.
For that, I use Google Earth. Nowadays, Google Earth Pro is free, and comes with a circle tool. In theory, I can fidget with the radius of a circle until I find that it approximates the arc of a curve well. This requires paying special attention to how the drawn curve compares with not just the broad outline of the curve but also the exact arc of each track or even each rail: the drawn curve should be at the same relative position to the rails, such as following one rail of one track, or right in the center of one track, or right between the track centers, and so on.
In cases of uncertainty, it’s also possible to use Google Earth line tools, which state the azimuth of each line. If I can find the exact start and end points of each curve, and the azimuths of the tangents on both side, then I can draw the chord with a line tool, verifying that its azimuth is the exact arithmetic mean of the azimuths of the two tangents; if it is not the mean, then either I made an error (more likely) or the curve is not a perfect circle (possible but less likely). The radius of the curve is approximately the length of the chord times 180/pi divided by change in azimuth; more precisely, the radius is
This method is error-prone, especially for short, sharp curves. Computing the start and end points of the curve will always have errors, and if the change in azimuth is small, then these will lead to large errors. The circle tool method suffers from the same drawback: it’s easier to use it to estimate the radius of a curve with 60 degree change in azimuth than that of one with 10 degree change.
Over time I’ve gotten this method down to the point that my errors from what I later find with track charts such as that of Metro-North are fairly small, and not very biased in the larger-radius direction. But it takes time and practice and ideally you should avoid it for short, sharp curves.
Update 10-28: Ari Ofsevit has a third method, using chord lines.
Other speed limits
Speed limits on intercity trains mostly come from curves. But there are other things to keep in mind, not all of which are fixable:
- Tunnels increase air resistance to the point that unless the tunnels are constructed with large enough radius to have a lot of free air (modern tunnels are, legacy ones aren’t), or unless the trains are pressurized, the speed limit has to be lower just to avoid popping passengers’ ears at entry and exit.
- Switches generally have low speed limits – they have tight curves and no superelevation – making all complex junctions and major stations slow.
- Terminal stations have another set of speed limits coming from the bumper tracks. American limits are very conservative – 10 mph where a ramp down from 40 or 50 km/h is more normal in Europe – but even 50 km/h is not 200 km/h.
- Some pieces of infrastructure are so shoddy that they limit the dynamical axle load of the train, which is derived from both static axle load, which is a function of train mass, and speed. As usual, American limits on this are conservative, assuming high static axle load and fixing a low speed on some very old bridges, instead of permitting lighter trains to run faster. But it is sometimes a real problem.