- Jul 5, 2009
- 2,440
- 4
- 0
Being unemployed has left me with some time on my hands to work on something that I've been thinking about for a while. What I've done is collect the final GC results for every rider since 1985 including their total finish time and average speed. Going year by year, I took each rider's speed and "normalized" it by the average for that year. So, for example, if you raced the Tour at the average speed, your normalized speed is 1.0, and if you raced it 1.5% faster than average, your normalized speed is 1.015.
I did this normalization so that data from different years could be compared, since each race is a bit different in course, weather, etc. I then took this normalized data and looked at the population distribution for each year. My initial assumption was that the distribution of speeds would be gaussian (bell-shaped) and centered on the average. I also expected to at some point see a "two speeds" phenomenon due to EPO and other drugs. This would look like a distribution with two peaks instead of one.
What I actually saw surprised me. Even back in 1985 there was a two speed phenomenon, but the peaks were very close together, ~1% apart. This is true for every year, except there is a very linear (r^2 > 0.86) increase in the gap between those peaks of 0.1% per year. The gap between the fast and slow groups now stand at ~5%. This is puzzling because the average Tour speed has been flat since 1998, after the 50% hematocrit rule was in full effect. So this means that every year the fast riders are getting faster and the slow riders are getting slower, without an overall change in speed.
Even more bizarre is that as of 2003, there is an obvious third group in the middle of the population. So now you have, a fast group, an average group, and a slow group. This somewhat coincides with the introduction of the UCI's EPO test in 2001. And now in the last few years, the spread between fast and slow is large enough that the population distribution is fairly smeared out with indistinct peaks for the three groups (i.e., not enough riders/data).
One other interesting thing I noticed is the relative size of the groups. From 1985 to 1995, the ratio of slow to fast riders was steady at 2:1. From 1996 onwards, the ratio is very steady at 3:1 (the slow group includes the middle/average group).
I would love to post graphs, etc but attachments aren't allowed. I tried and got the "Sorry, the board attachment quota has been reached" message. If anyone would like an excel copy of the raw data, the cleaned data, the analysis, or graphs just let me know via message or email me at john@bikephysics.com
Also, if there's any specific analysis you'd like me to do, just say the word.
John Swanson
I did this normalization so that data from different years could be compared, since each race is a bit different in course, weather, etc. I then took this normalized data and looked at the population distribution for each year. My initial assumption was that the distribution of speeds would be gaussian (bell-shaped) and centered on the average. I also expected to at some point see a "two speeds" phenomenon due to EPO and other drugs. This would look like a distribution with two peaks instead of one.
What I actually saw surprised me. Even back in 1985 there was a two speed phenomenon, but the peaks were very close together, ~1% apart. This is true for every year, except there is a very linear (r^2 > 0.86) increase in the gap between those peaks of 0.1% per year. The gap between the fast and slow groups now stand at ~5%. This is puzzling because the average Tour speed has been flat since 1998, after the 50% hematocrit rule was in full effect. So this means that every year the fast riders are getting faster and the slow riders are getting slower, without an overall change in speed.
Even more bizarre is that as of 2003, there is an obvious third group in the middle of the population. So now you have, a fast group, an average group, and a slow group. This somewhat coincides with the introduction of the UCI's EPO test in 2001. And now in the last few years, the spread between fast and slow is large enough that the population distribution is fairly smeared out with indistinct peaks for the three groups (i.e., not enough riders/data).
One other interesting thing I noticed is the relative size of the groups. From 1985 to 1995, the ratio of slow to fast riders was steady at 2:1. From 1996 onwards, the ratio is very steady at 3:1 (the slow group includes the middle/average group).
I would love to post graphs, etc but attachments aren't allowed. I tried and got the "Sorry, the board attachment quota has been reached" message. If anyone would like an excel copy of the raw data, the cleaned data, the analysis, or graphs just let me know via message or email me at john@bikephysics.com
Also, if there's any specific analysis you'd like me to do, just say the word.
John Swanson