VO2Max is a powerful metric of longevity: every 1 ml/kg/min increase in VO2 max reduces risk of death by 9 percent.
Clinical testing of VO2Max typically uses a respiratory mask during intense exercise, such as a treadmill or cycling. But wearables like Apple Watch estimate cardio fitness (their version of VO2 max) to within an accuracy of 1.2 ml/kg/min without requiring a mask.
How do they do that? This article explains the science behind VO2 max, how the Apple Watch’s cardio fitness algorithm works (based on published details from 2021 and 2023 papers), and concrete data from the study used to validate its accuracy. Along the way we’ll cover how to combine physiological ODEs (ordinary differential equations) with flexible neural networks, 9-panel plots used in VO2 max testing, and more.
For details on how to actually use VO2Max, its link to longevity, and how to get a cardio fitness measurement, see our guide to Apple Watch’s cardio fitness metric.
Primer: how VO2Max is measured in the lab
Before we get into how Apple Watch algorithm estimates VO2Max, we have to first define how VO2Max is actually measured in the lab.
VO₂ Max is defined as the maximum volume of O₂ (oxygen) the body can utilize during intense exercise. It’s measured during a cardiopulmonary exercise test (CPET). During a CPET, you wear a respiratory mask while exercising (e.g., cycling or treadmill) at a steadily increasing intensity. As the work intensity rises, you consume more O₂ (oxygen), expell more CO₂ (carbon dioxide), and your heart rate rises.
At some point, the intensity of the workout becomes high enough that your muscles need more oxygen than your heart and lungs can supply. Your muscle cells are then forced to switch from aerobic to anerobic energy production, so that CO₂ continues increasing while O₂ pleateaus. At least, this is the “idealized” test situation—in practice, O₂ may not actually reach a plateau and the test may end when the the ratio of CO₂ to O₂ reaches 1.1, when the patient feels breathless, when an ECG shows signs of stress, etc.
This whole process produces a nine-panel plot, which shows various combinations of O₂ consumed, CO₂ expelled, intensity of exercise (in watts), heart rate, breathing rate, blood oxygen saturation, and so on.
In panel 3, you can see how both O₂ (red) and CO₂ (blue) increase as the exercise intensity ramps up (black). There’s a point around 11 minutes where the two lines diverge (this is the anaerobic threshold). Panel 5 shows that heart rate increases roughly in proportion to O₂.
Cardio fitness: Apple Watch’s maskless approximation of VO2Max
If you wanted to estimate VO2Max without actually measuring O₂ and CO₂, how would you do it? You can make use of two facts:
- Heart rate and O₂ are proportionate: as your heart rate increases, you’re consuming more oxygen.
- You can estimate work (in watts) if you know the person’s pace and body weight.
Various methods have been proposed in the medical literature, and as we’ll see, Apple’s algorithm appears to be a more sophisticated version of those using neural networks combined with biophysical ordinary differential equations (ODEs).
The Rockport cardio fitness test: a simple method from 1987
The Rockport fitness walking test was proposed in 1987 to estimate VO2Max. Subjects performed multiple 1-mile walk tests as fast as possible. The heart rate and completion time were placed into a linear regression, which yielded this formula to calculate VO2Max based on each subject’s weight, age, sex, fastest time, and heart rate:
VO2max = 6.9652 + (0.0091weight) - (0.0257age) + (0.5955sex) - (0.2240time) - (0.0115*heart_rate)
The accuracy was good (r=0.92 in the initial study). As we’ll see in the next section, Apple appears to be using a similar scientific basis but with a more sophisticated algorithm.
Apple Watch estimates cardio fitness using heart rate and GPS as inputs
Apple Watch requires two things to produce a cardio fitness estimate:
- Outdoor walking/running/hiking workout on flat ground (5% grade or less).
- Heart rate needs to increase more than 30% of the way between your resting heart rate and max heart rate. For example, if your resting heart rate is 60bpm and your max heart rate is 180bpm, you’d need to reach a heart rate of 96bpm or above to get a VO2Max estimate.
These suggest they’re estimating both work and heart rate response to exercise—similar in its basic science to the Rockport test.
Estimating VO2Max with physiological ODEs and neural networks
Note: some background in neural networks is helpful to understand the following section. If you don’t want the details, feel free to just skip to the next section.
In 2023, researchers from Apple published a paper on using biophysical ODEs for fitness estimation using wearables, alongside an open source repo with some Torch neural networks. This gives a climpse into how modern deep learning algorithms can be used to improve upon the basic science of the Rockport test.
The paper starts by defining heart rate and oxygen response to exercise using a set of ordinary differential equations (ODEs):
Here, I(t) is exercise intensity at time t, f(I(t)) is the instantaneous demand for oxygen, D(t) is oxygen demand at time t, and HR(t) is heart rate at time t. A, B, alpha, beta are learned parameters.
The first equation tells us how the oxygen demand changes over time to match the instantaneous demand from the intensity of the workout. The second equation tells us how drive heart rate toward the pace required to meet that oxygen demand. It includes factors that describe how fast the heart can adapt (A), how difficult it is to reach the maximum heart rate (beta), and how much time it takes to rest down to the resting heart rate (alpha).
Each person’s fitness at a given time is characterized by seven physiological parameters: A, B, alpha, beta, f, HRmin, and HRmax.
Rather than estimate these parameters directly, the researchers trained a neural network that predicts a person’s entire heart rate response to exercise given a latent vector z. The paper doesn’t specify z’s dimension, but in the open source code z is a 32-dimensional vector. They thain trained additional neural networks to predict each physiological parameter as a function of that latent fector: A(z), B(z), alpha(z), and so on.
These physiological parameters were then fed into a linear regression to predict VO2 Max. The error was about 5% (not bad). What’s interesting is that the model is interpretable as a result of going through the physiological ODE. For example, you can see an obvious correlation between the predicted A, B, alpha, and beta and VO2 Max in the plots below:
Apple Watch’s cardio fitness algorithm was validated in a study with 221 participants
Apple itself ran a study validating the algorithm, with 534 participants for development and 221 for validation. The algorithm was surprisingly accurate, with a mean error of 1.2 to 1.4 ml/kg/min and a reliability coefficient (ICC) around 0.86–0.89.
You can see a plot of actual VO2Max (x axis) vs the Apple Watch’s estimate (y axis). It’s pretty good!
However, the algorithm does have limitations for people with certain health conditions and on certain medications. From Apple’s own report:
Other medical conditions can also decrease the accuracy of VO2 max estimates on Apple Watch. These include medical conditions or devices that decouple heart rate from movement or exercise (for example, pain, arrhythmias, pacemakers, or cardiac-assist devices); medical conditions that severely limit exercise tolerance, preventing patients from reaching heart rates close to their predicted maximum heart rate (for example, peripheral arterial disease); and medical conditions that significantly increase the difficulty of ambulation, such as skeletal or neuromuscular conditions causing gait inefficiency (for example, multiple sclerosis or cerebral palsy).
What to do if you get a low cardio fitness notification
VO2Max is most commonly improved through zone 5 (high intensity) training. If you want to improve your VO2Max, download Empirical Health for Apple Watch or check out our ultimate guide to Apple Watch health metrics.