Should Event Categories Be Based on Power to Weight or FTP?

For flat and hilly routes, success in real life racing and group rides often depends more on raw power (FTP) than power to weight (w/kg). GCN has an excellent video on this.

If this is true on Zwift as well, should group events continue to be based on power to weight instead of FTP?

To answer this question, we need to determine the following:

Does power to weight create a significant advantage for heavier, more powerful riders?

Would FTP create a significant advantage for lighter but equally powerful riders?

Which method is better?

Are there alternatives?

Does Power to Weight Create a Significant Advantage for Heavier Riders?

We analyzed ZwiftPower race results by comparing the finishing order of 2,000+ racers in 17 Zwift races. We grouped racers into power groups, recorded their power to weight and percentile finish in the race (lower is better) and charted the results (see below). If power to weight is a fair method, you’d expect the graph lines to be flat indicating similar race results (Y axis) across all power groups (X axis). But, instead, at virtually every power to weight across every race category (A, B and C) the graph lines are descending, clearly indicating that the heavier, more powerful racers, have significantly better results.

Percentages are the average race result from the top for each power group (e.g. 40% means that, on average, that power group finished in the top 40% of all racers). Power groups represent the normalized power plus 20 watts (e.g. the 260 power group includes all racers with normalized power between 260-280 watts).

Speed/time tests were then conducted to determine just how muchfaster heavier riders are at the same power to weight. Three volunteers chose either a flat or hilly loop and performed three identical Zwift workouts at different weights and raw power but the same power to weight. Not surprisingly, over a one-hour event, the heavier riders were between 1:00-1:31 minutes faster for every 10 kg weight difference depending on various factors such as gradient, power and weight.

Based on this speed/time test and our weight study (see appendix), these results can significantly impact events.

In a 100-person event, a C rider of average weight is 10-20 kg lighter than approximately 16 people… a 1:00-3:00 minute time disadvantage in a one-hour event.

In a 100-person event, the lightest 16 C riders are 10-40 kg lighter than approximately half the field… a 1:00-6:00 minute time disadvantage in a one-hour event.

Would FTP Create a Significant Advantage for Lighter but Equally Powerful Riders?

Although events based on power to weight favor heavier riders, should events be based on FTP instead? Not necessarily. Since we expect lighter riders to be faster than heavier riders at the same power, we need to test this speed advantage and compare the results to the power to weight test.

For the FTP test, three volunteers once again chose either a flat or hilly loop and performed three identical workouts at different weights but the same raw power. As expected, over a one-hour event, lighter riders were between 1:35-2:22 minutes faster for every 10 kg weight difference.

Which Method is Better?

In the power to weight test, heavier riders were between 1:00-1:31 minutes faster for every 10 kg weight difference. In our FTP-based test, lighter riders were between 1:35-2:22 minutes faster for every 10 kg weight difference.

Based on these results, it becomes clear that events based on power to weight give an advantage to heavier riders, but events based on FTP would give a greater advantage to lighter riders.

Are There Alternatives?

So, it turns out that power to weight, for all its faults, is slightly better than FTP. Is there a more effective method for closing the wide time gaps we see during events? Below are the pros and cons of two alternatives:

1: Hybrid Power to Weight

Same as currently but with minimum and maximum FTP limits. Riders above the maximum move up and riders below the minimum move down.

Pros: Shorter event time gaps; easy to implement.

Cons: At the same power to weight, the remaining heavier riders still have an advantage.

2: Results-Based

You race, you get results and you are placed in a category based on those results.

Pros: A fair, merit-based system which is used in real life racing.

Cons: Someone has to create and administer a system that has the backing of Zwift and the entire Zwift community.

Regardless of some imperfections, Zwift events are still the backbone of the platform which has made indoor cycling not just tolerable but social, fun and addicting. Hopefully as the platform matures more effective event classification can be implemented, resulting in a better experience for all involved.

Appendix

Fun facts about weight from our study of 2,454 Zwift racers (A=629, B=894, C=728, D=203… too few to be included in our race results analysis):

The average Zwift racer weighs 76.8 kg (169.3 lb) with a standard deviation of 9.2 kg (20.3 lb)
At one standard deviation, 68% of the racers weigh between 67.6-86.0 kg (149.0–189.6 lb)
At two standard deviations, 95% of the racers weigh between 58.4-95.2 kg (128.8–209.0 lb)

“A” racers weigh the least and have the lowest weight variation. Each successive group weighs more and has a higher weight variation.
The average A racer weighs 72.3 kg (159.4 lb) with a standard deviation of 7.4 kg (16.3 lb).
The average B racer weighs 75.7 kg (166.9 lb) with a standard deviation of 8.6 kg (19.0 lb).
The average C racer weighs 79.7 kg (175.7 lb) with a standard deviation of 10.4 kg (22.9 lb).
The average D racer weighs 85.7 kg (188.9 lb) with a standard deviation of 12.9 kg (28.4 lb)

In Zwift races, the average weight difference between the heaviest rider and lightest rider is 44.1 kg (97.2 lb).
The average minimum weight in a Zwift race is 55.7 kg (122.8 lb)
The average maximum weight in a Zwift race is 99.8 kg (220.0 lb)

Fun facts from our study about normalized (weighted average) power:

The average Zwift racer puts out 269 watts with a standard deviation of 34 watts.
At one standard deviation, 68% of the racers put out between 235-303 watts.
At two standard deviations, 95% of the racers put out between 201-337 watts

Each category averages about 40 watts more or less than its closest category. All of the categories have about the same power variation.
The average A racer puts out 318 watts with a standard deviation of 36 watts.
The average B racer puts out 277 watts with a standard deviation of 35 watts.
The average C racer puts out 237 watts with a standard deviation of 35 watts.
The average D racer puts out 197 watts with a standard deviation of 34 watts.

The average minimum and maximum power for all races is 190 and 343 watts, respectively, for a difference of 154 watts.
The average min and max power in an A race is 244 and 398 watts, respectively, for a difference of 153 watts.
The average min and max power in a B race is 189 and 352 watts, respectively, for a difference of 163 watts.
The average min and max power in a C race is 160 and 310 watts, respectively, for a difference of 150 watts.
The average min and max power in a D race is 127 and 257 watts, respectively, for a difference of 130 watts.

Fun facts from our study about race times. Keep in mind, these results are AFTER Zwiftpower disqualified or moved riders up from the real-time Zwift results. In addition, we eliminated the bottom 5% of racers in each race Category to prevent severely lagging racers from skewing the results.

The average Zwift racer finished about 3:20 minutes behind the winner with a standard deviation of 2:03 minutes.

In each successive category, the average racer finished further behind the winner.
The average A racer finished 1:30 seconds behind the winner.
The average B racer finished 3:01 seconds behind the winner.
The average C racer finished 4:50 seconds behind the winner.
The average D racer finished 6:19 seconds behind the winner.

On average, the last place racer finished 8:17 seconds behind the winner.
The average last place A racer finished 4:54 behind the winner.
The average last place B racer finished 7:16 behind the winner.
The average last place C racer finished 10:07 behind the winner.
The average last place D racer finished 16:44 behind the winner.

I would be more concerned with people calibrating their non-direct drive trainers properly. I worked at a bike shop and the trainer there was set up with the wrong tension and I was able to hit 1200 W with sneakers and khakis on. I would suggest only direct drive trainers for A races.

You may have yours set up correctly, or maybe not. That’s the point. Many of them are notoriously inaccurate and give an advantage whether people are aware of it or not. It’s not as bad as Zpower ofcourse but it’s not good either. Low HR, high average power is commonplace. If they don’t own a power meter in real outdoor riding they just don’t know the difference I guess? 50-70+ watts discrepancy matters.

Totaly agree! FTP or W/Kg both run into problems that when you have a good performance you get DQF’d for being too good. Promote/demote based on results (like IRL). Now only the A race shows the best rider in that class.

Great article!
I am at the bottom end of B coz I am apparently a tad too strong for C. I still get clobbered by both B and some C riders tho. That’s racing!
Perhaps 2-3 more categories would help even things out plus accounting for actual race results, as mentioned earlier?
Age is important consideration too. Younger riders can have huge performance gains over time. Whereas older riders particularly 60+ will only have marginal gains if at all therefore are not likely to improve their W/kg.

Great article! I still find it slightly hard to get my head around the fact that on a relatively flat course, lighter riders are at a disadvantage. This is counter intuitive as IRL (In Real Life) a lighter rider is probably/possibly smaller than a heaver rider, therefore creates less drag (more aerodynamically efficient), which is one of the main contributors to speed on the flat.
I recently took part in a WBR 2.5 hour event on a flat(ish) course, and I noticed that I was having to ride at approx 0.2W/K more than most others around me.
I would say that FTP is a better measure, as it better reflects the situation in RL, and if a lighter rider puts out the same power as a heavier one, they should go a little bit faster.
Like the discussions around the (so called) ‘double draft’ (or better named ‘true draft’) effect, I find it difficult to understand why people are resistant to making Zwift as realistic to RL as possible – but maybe that’s because I’m a lighter rider without much ‘top end’ power & I want to weight things in my direction 🙂

The heavier rider just puts out way more watts do to their weight than a lighter rider, although yes in real life smaller riders create significantly less drag. It’s something that could be implemented to help make it more fair.

Newsletter

Leave this field empty if you're human:

About this Site

Zwift Insider is independent of Zwift corporate (www.zwift.com), although Zwift corporate now provides funding to help defray site costs. This community-driven site is maintained by Eric Schlange and a team of Zwift enthusiasts.

This site contains affiliate links to Amazon, Wahoo, and other brands. Zwift Insider makes a small commission on qualifying purchases made from these links, so please shop through them to support our efforts.