Comprehensive Climbers Ranking

Page 3 - Get up to date with the latest news, scores & standings from the Cycling News Community.
Feb 7, 2026
18
45
80
Indurain is easily a Top 5 climber in history (I think his Tour weight was around 78kg). Even though it has to be said that his style of just setting a hard pace (a bit like Evenepoel) inflates his watts slightly. He also sometimes liked to attack over the top of climbs just like Evenpoel.

His '95 Tour, without having a 100+ performance, may be the best in terms of climbing consistency at a high level ever. He basically pumped out 95+ performances every mountain stage and easily matched/surpassed Riis and the ONCE squad. Riis in that Tour already had the same general shape as in '96, he just had a bad day on La Plagne.
 
  • Like
Reactions: Krzysztof_O
Aug 13, 2024
731
840
4,180
@Pozzovivo

Sormano '24: Altitude -3, Stage Hardness +4, Approach/Clear Start -2, Follow-up +3, Regularity +2 --> +4 Adjustment
Ganda '25: Altitude -2, Stage Hardness +3, Approach -3, Follow-up +2 --> 0 Adjustment

My model "does not like" Ganda because of the easy approach (descent followed by false flat downhill is the best/easiest possible approach for me).
Sormano also gets a +2 for regularity because there is 1km of flat (or even slight downhill) in the middle of the climb.

All these adjustments are of course debatable, but they fit quite well also with the observed performance of the next riders (Evenepoel etc.).
Thank you for taking the time to explain, @Peyresourde.
I understand that some degree of subjectivity is unavoidable, and I am not trying to challenge your judgement(or model). I would howver like to know more about the methodological approach behind it.

Importantly, how did you develop the model and the categories itself? Was there an exploratory and confirmatory process, for example by splitting the data?

Did you test alternative model specifications with different weightings on a subset of climbs, and then evaluate how well they predicted performance in another dataset with the same riders? In other words, did you arrive at the current structure because it provided the best fit across samples?

I ask because I want to be cautious about the risk of models being shaped, even unintentionally, by prior expectations or narratives about particular riders or races, and I am interested in understanding how you worked to guard against that.
 
Feb 7, 2026
18
45
80
Thank you for taking the time to explain, @Peyresourde.
I understand that some degree of subjectivity is unavoidable, and I am not trying to challenge your judgement(or model). I would howver like to know more about the methodological approach behind it.

Importantly, how did you develop the model and the categories itself? Was there an exploratory and confirmatory process, for example by splitting the data?

Did you test alternative model specifications with different weightings on a subset of climbs, and then evaluate how well they predicted performance in another dataset with the same riders? In other words, did you arrive at the current structure because it provided the best fit across samples?

I ask because I want to be cautious about the risk of models being shaped, even unintentionally, by prior expectations or narratives about particular riders or races, and I am interested in understanding how you worked to guard against that.
My starting point was that I was unhappy with the existing metrics.
LR uses ASLP, which only incoorporates altitude, but is IMO too extreme and also undervalues short climbs.

W2W adjusts for stage hardness and altitude. But it devalues TTs extremely. Especially in mixed TTs like PDBF 2020, it is even harder to perform on the climb than after a 180k road stage. W2W just takes the overall kilojoules and slams a fat minus adjustment on the effort.
In contrast, it seems to overvalue long stages. E.g. on the Lombardia climbs almost everyone does a PB which is not likely to be true.

My first premise is to rather adjust too little than too much. There are studies for the effect of altitude and heat, but I also take these with a grain of salt because some of the effects seem very high to me.

My next premise was that the approach to a climb is very important (even more important than how hard the overall stage has been). As for how to incorporate this? I did not use any scientific method, I basically just winged it and did what seemed plausible to me.

Certain prior expectations play a role in my process. E.g. Carapaz did a really high level performance in the Giro last year on the stage he won (on Pietra di Bismantova). It was a small climb and not very steep, no one else analyzed it. Then you catch yourself thinking: Carapaz normally never has high watts, this is a complete outlier --> There must have been a tailwind or maybe the segment was wrong etc.

And for a famously good climber, it may be the other way around: he won and gapped everyone by 30 seconds, yet the performance seems low --> Maybe the tarmac was worse than I thought, or there was a headwind?

I do this as a hobby, and I intentionally do a lot of the adjustments manually and by feel. Otherwise I would have no fun doing it. I also think this way may be more accurate than some automated system using a unified method (where would you even get accurate data for all parameters? Strava is an option, but it did not exist back in the day and many riders don't post power).

So certainly, for some performances I input the data once, it seems good and I never look at it again. While for some others, I put a lot of effort in to get it right/plausible.
 
Last edited:
Aug 13, 2024
731
840
4,180
My starting point was that I was unhappy with the existing metrics.
LR uses ASLP, which only incoorporates altitude, but is IMO too extreme and also undervalues short climbs.

W2W adjusts for stage hardness and altitude. But it devalues TTs extremely. Especially in mixed TTs like PDBF 2020, it is even harder to perform on the climb than after a 180k road stage. W2W just takes the overall kilojoules and slams a fat minus adjustment on the effort.
In contrast, it seems to overvalue long stages. E.g. on the Lombardia climbs almost everyone does a PB which is not likely to be true.

My first premise is to rather adjust too little than too much. There are studies for the effect of altitude and heat, but I also take these with a grain of salt because some oft he effects seem very high to me.

My next premise was that the approach to a climb is very important (even more important than how hard the overall stage has been). As for how to incorporate this? I did not use any scientific method, I basically just winged it and did what seemed plausible to me.

Certain prior expectations play a role in my process. E.g. Carapaz did a really high level performance in the Giro last year on the stage he won (on Pietra di Bismantova). It was a small climb and not very steep, no one else analyzed it. Then you catch yourself thinking: Carapaz normally never has high watts, this is a complete outlier --> There must have been a tailwind or maybe the segment was wrong etc.

And for a famously good climber, it may be the other way around: he won and gapped everone by 30 seconds, yet the performance seems low --> Maybe the tarmac was worse than I thought, or there was a headwind?

I do this as a hobby, and I intentionally do a lot of the adjustments manually and by feel. Otherwise I would have no fun doing it. I also think this way may be more accurate than some automated system using a unified method (where would you even get accurate data for all parameters? Strava is an option, but it did not exist back in the day and many rider don't post power).

So certainly, for some performances I input the data once, it seems good and I never look at it again. While for some others, I put a lot of effort in to get it right/plausible.
Great reply, very reasonable and credit to you. Still I do think that doing a path analysis with a "split data" approach could be of high value to find the best model fit. Maybe racing data is just too noisy idk.

I understand that when considering the performance on a climb, the lead in to the climb is very very improtant. So, for instance, Superbagneres in last years TDF should be massively upgraded in your view compared to other sources? The official segment starts after doing like 1,5km of 9% for some reason. Is this Felix Gall's best perfomance?

Anyway, looking forward to seeing the next lists you plan on posting, and how it will prove what we knew in our hearts - that Pozzovivo was in fact the best climber of all time ;) Any standout performances from him in your dataset btw?
 
Last edited:
Sep 10, 2016
444
840
11,480
Thanks for that informative presentation, Peyresourde! It's amazing how those climbing times of Pantani still stand. It lasted twenty years until those times were matched, in a completely different era, with completely different material.
Thirty years actually
 
Feb 7, 2026
18
45
80
Great reply, very reasonable and credit to you. Still I do think that doing a path analysis with a "split data" approach could be of high value to find the best model fit. Maybe racing data is just too noisy idk.

I understand that when considering the performance on a climb, the lead in to the climb is very very improtant. So, for instance, Superbagneres in last years TDF should be massively upgraded in your view compared to other sources? The official segment starts after doing like 1,5km of 9% for some reason. Is this Felix Gall's best perfomance?

Anyway, looking forward to seeing the next lists you plan on posting, and how it will prove what we knew in our hearts - that Pozzovivo was in fact the best climber of all time ;) Any standout performances from him in your dataset btw?
I plan the start posting the next list later today.

And yes, certain aspects of analysis can still be improved. it is by no means a completely finished product and I adjust things from time to time. The more often a certain climb is used, the more feel and context you also get.

One interesting climb is Oude Kwaremont, because it is raced twice in the same race (actually 3 times total) and Pogacar more or less goes full gas on both. In 2025 he went much faster the first time and still had 4 guys on his wheel. 40 km later he was slower but dropped everyone. So you could argue the correct adjustment would be to make both performances equal or even make the second one better.

But then I would have to adjust the second effort by an insane amount like +30 or even +40 points (basically equivalent to 1 W/kg!!!)

Superbagneres is indeed Gall's best performance with 89 (+7). For historic reasons (to be able tp compare with 1986 and 1989) I take the whole 42 minute climb from Luchon. If I just took some segment starting higher up, the Index would probably be the same, but with much higher adjustment. E.g.: 89 (+15)


For Pozzovivo: In my dataset I tried to include every effort of 80 or higher. For other climbs with lower performances, I often just take the winner/highest performer. That is to say that I have only 3 entries for Pozzovivo im my dataset and his best effort is probably not even in there.
The 3 I have:

72 (-1) on Lago Laceno/Mollela (Giro 2012)
72 (-2) on Zoncolan (Giro 2018)
69 (+12) on Fedaia (Giro 2008)