
The Wearables That Actually Predict Your Longevity (And the ones Selling You Theater)
CES 2026 just happened. Let's talk about what you actually need to measure.
The floor of that convention center was stacked with devices promising to "revolutionize your training," "decode your stress," and "unlock your optimization potential." Marketing language aside, I watched the announcement stream with one question running in the background: Which of these things correlates to whether I'm alive in twenty years?
The answer is: fewer than three metrics. And most of the devices measuring them are doing it wrong.
I say this as someone who spent two years obsessively monitoring his own recovery data after a catastrophic L5-S1 disc herniation at 39. Flat on my back, then slowly rebuilding. I needed data that actually meant something—not steps, not calorie burn, not a proprietary "Body Battery" score with no published methodology. I needed signal. Everything else was noise that made me feel productive while changing nothing.
Here's what I learned, and what the longevity literature says.
The Vanity Metrics Trap
Step count is the original sin of consumer wearables.
Not because steps are useless—they're not—but because they create an illusion of precision that doesn't actually map to longevity outcomes. A device showing you hit 10,347 steps feels objective. It feels like data. But the research on step count and all-cause mortality is significantly weaker than the research on cardiorespiratory fitness, heart rate variability, and resting heart rate. You can hit your step goal every day and still have the cardiovascular profile of someone twenty years older if your aerobic capacity is in the basement.
Same problem with calorie burn. Consumer wearables have error rates on energy expenditure that range from 27% to over 90% depending on the study and activity. The Stanford validation work (Shcherbina et al., 2017, JPLM) found that even the most accurate devices in their sample were off by 27% on calorie estimates—and the worst exceeded 90% error. That's not rounding error. That's noise masquerading as signal. Worse, the error skews in one direction: devices tend to over-count, which means your 500-calorie-deficit day might actually be 200 calories, or zero.
Then there are the proprietary stress scores—Garmin's Body Battery, Fitbit's Daily Readiness, WHOOP's Recovery Score. I'm not saying these are worthless, but they're black boxes. No two devices agree on "your stress." There's no published, peer-reviewed methodology you can audit. You're trusting an algorithm you can't inspect to tell you how recovered you are. For some people, that's fine. For people who actually want to use wearable data to make training decisions, it's a problem.
The question isn't whether these numbers feel good. It's whether they predict anything. And for the 40+ cohort specifically, the longevity literature has been pretty clear about what actually predicts all-cause mortality risk.
The Three Metrics That Actually Matter
1. Heart Rate Variability (HRV)
HRV measures the variation in time between successive heartbeats. Higher variability—counterintuitively—means your autonomic nervous system is functioning well. Lower HRV is consistently associated with higher all-cause mortality risk in longitudinal research, including analyses of Framingham Heart Study cohort data (Tsuji et al., Circulation, 1994/1996).
The mechanism makes sense from a systems perspective: HRV is a proxy for parasympathetic nervous system function, which is your "rest and repair" mode. When that system is chronically suppressed—from overtraining, poor sleep, high psychosocial stress, or simply aging without intervention—your body loses the capacity to recover efficiently. Over time, that shows up in cardiovascular outcomes.
The practical implication for training: HRV is one of the few real-time signals that tells you whether yesterday's session actually helped you or just depleted you. During my L5-S1 rehab, tracking HRV was the only thing that kept me from making the classic mistake of training through systemic fatigue. My HRV would drop 15-20% on days when my body was prioritizing spinal repair. Those were days to walk, not deadlift.
2. Resting Heart Rate (RHR)
RHR is cardiovascular efficiency measured in one number. The lower it is (within reason—below 40 bpm in non-athletes is worth investigating), the more efficiently your heart is pumping blood per beat.
Multiple large longitudinal studies—including data from the Framingham cohort and European cardiovascular research—have identified elevated resting heart rate as an independent predictor of cardiovascular events, even in apparently healthy populations. This isn't new data; cardiologists have used RHR as a clinical marker for decades. But most people don't track it longitudinally, which is where the value actually lives. A RHR trend moving up by 5-8 bpm over three months means something is wrong. It could be overtraining, poor sleep quality, unmanaged stress, or early-stage illness. It's a signal you shouldn't miss.
What's interesting about RHR is that it's one of the few metrics that wearables actually measure reasonably well—because it's taken at rest, optical heart rate sensors perform much better than they do during dynamic exercise.
3. VO2 Max
VO2 max is the maximum rate at which your body can consume oxygen during exercise. It's the gold-standard measure of aerobic fitness. And the decades of longitudinal data accumulated through Cooper Institute research—including Blair et al.'s landmark 1989 JAMA study and subsequent follow-up work—make for uncomfortable reading.
Being in the lowest quintile for cardiorespiratory fitness roughly doubles your mortality risk compared to being in the next quintile up. Not compared to elite athletes—compared to being merely average. Moving from "low" to "below average" in aerobic fitness is one of the highest-return longevity interventions available to sedentary adults over 40. More impactful, per that data, than most supplements or dietary protocols.
Your smartwatch manufacturer doesn't lead with this research because the actionable conclusion is train harder cardiovascularly—not buy a new device. The research points to Zone 2 runs and VO2 max intervals. It doesn't require a $400 ring.
VO2 max can't be directly measured by a wrist device—that requires a lab with a metabolic cart. What Garmin, Apple, and Polar do is estimate VO2 max from heart rate response to exercise, using algorithms validated against lab data. These estimates are imprecise (±10-15% is typical for validated devices), but directionally useful for tracking change over time.
The Accuracy Problem Nobody Wants to Talk About
Here's the engineering problem with most consumer wearables: they're built for the walking-to-brunch use case, not the training use case.
Optical heart rate sensors—the green-light sensors on the underside of every wrist wearable—work by shining light into your capillaries and measuring blood volume changes. This works reasonably well at rest or during steady-state cardio. It falls apart during strength training, high-intensity intervals, cold exposure, and anything involving wrist movement that isn't rhythmically correlated with your heart rate.
The error rates during high-intensity work are significant. Multiple published validation studies—including research by Gillinov et al. (J Am Heart Assoc, 2017) and others—have documented optical HR errors of 15-25 beats per minute or more during lifting or sprint intervals. That's not a rounding error—that's a completely different heart rate zone. If you're using wrist HR data to determine training intensity during a strength session, you're making decisions based on noise.
The HRV accuracy problem is even worse. Every device calculates HRV differently. WHOOP uses a proprietary algorithm. Garmin uses a different one. Oura Ring uses another. There is no standardized methodology, and when researchers have compared devices measuring HRV simultaneously on the same person, the numbers frequently don't align. Your Garmin's HRV number is not comparable to your friend's Oura number. Your Oura number from last year is not necessarily comparable to this year's if they updated firmware.
This doesn't make HRV useless. It makes absolute HRV numbers useless. What's useful is your personal trend—which is why establishing a baseline matters more than the number itself.
Wearing the device wrong kills your data completely. Too loose a band = poor optical contact = garbage readings. Wrong wrist position during HRV measurement = artifact-contaminated data. Most people have no idea their nightly readings are corrupted by this.
What the Research Actually Says
I keep pointing to specific studies because I think it's important that we talk about real data rather than device marketing:
Framingham Heart Study cohort (HRV): Analyses of this long-running cohort—including Tsuji et al. in Circulation—linked reduced HRV to higher all-cause mortality risk. The mechanism: autonomic dysregulation is both a marker and a driver of accelerated cardiovascular aging. This association has been replicated across multiple independent cohorts.
RHR and cardiovascular risk: The relationship between elevated resting heart rate and cardiovascular event risk is one of the more consistent findings in cardiology literature, documented across multiple large population studies. It's predictive independent of traditional risk factors in apparently healthy populations—not just in people with diagnosed heart disease.
Cooper Institute longitudinal research (VO2 Max): Starting with Blair et al.'s landmark 1989 JAMA paper and spanning subsequent decades of follow-up work, this body of research established the fitness-mortality dose-response curve as one of the most robust findings in exercise science. The uncomfortable conclusion: moving from the lowest fitness category to merely average aerobic fitness may be more impactful on longevity than most other interventions people obsess over.
Why don't wearable manufacturers lead with this research in their marketing? Because the actionable conclusion is train cardiovascularly—not buy a new device. The research doesn't require a $400 ring.
How to Actually Use This: The Engineer's Protocol
Wearables are useful as a dashboard for your training system—not as the training system itself. The device doesn't improve your fitness. It gives you data that, if you know how to read it, can help you make better decisions.
Here's how I structure it:
Establish your personal baseline first. Spend the first two weeks doing nothing but measuring. Same time every day—before coffee, before getting out of bed. Record your HRV and RHR. This gives you your numbers. Absolute values matter less than your personal trend from baseline.
Daily HRV: pre-coffee, pre-movement, supine. The measurement protocol matters as much as the device. Follow your device's app specification exactly. Three to five minutes of quiet breathing. Do this the same way every day or your data is comparing apples to filing cabinets.
RHR: first reading of the morning. Most wearables track this automatically through the night. Review the morning number before you start moving.
VO2 Max: quarterly reassessment. This changes slowly. Checking it weekly is noise. Checking quarterly tells you whether your aerobic training is actually working. Run or bike at a consistent effort with your device tracking HR, and let the algorithm estimate. The trend is what you're after.
When to actually pay attention:
- HRV drops more than 10% below your baseline for three or more consecutive days: red flag. Means your system is under systemic stress—overtraining, illness onset, sleep disruption, or significant life stress. Reduce training intensity, prioritize sleep, don't add volume.
- RHR climbs 5-8 bpm above your established baseline for a week or more: same red flag, different signal. Something is wrong. Investigate before you train through it.
- VO2 max trend is flat or declining over two quarterly assessments: your aerobic training stimulus is insufficient. Add Zone 2 volume or VO2 max intervals.
These are the signals worth responding to. Everything else—your daily stress score, your sleep stage breakdown, your "body battery"—is interesting at best, noise at worst.
Spring 2026 Buyer's Guide: The Short Version
Garmin (Fenix 8, Epix lineup): Best for athletes who want sport-specific tracking, GPS accuracy, and breadth of metrics. HRV status tracking is solid. VO2 max estimation methodology is well-documented. Weakness: the wrist HR accuracy problems during high-intensity work affect all optical sensors equally.
Oura Ring Gen 4: Best for sleep and recovery-focused tracking. The ring form factor solves the wrist-movement artifact problem during sleep, making nighttime HRV data more reliable than most wrist devices can achieve. Weakness: no display, limited real-time training feedback, ongoing subscription cost.
WHOOP 5.0: Best for recovery-focused athletes who don't need GPS. The 24/7 monitoring approach and the recovery/strain framework are coherent. Weakness: proprietary HRV algorithm, subscription-only model, and the "Recovery Score" is a black box.
Apple Watch (current lineup): Best for Apple ecosystem integration and general-purpose health monitoring. The ECG feature and irregular rhythm notifications are genuinely useful for cardiovascular surveillance. Weakness: optical HR accuracy during strength training and HIIT is a known limitation shared across wrist-based devices—if your primary training is strength-focused, treat the workout heart rate data accordingly.
What you don't need: Anything that leads with a proprietary "stress score," "body battery," or "optimization index" with no published methodology. You're buying theater.
Setup that actually matters:
- Wear the device one finger-width above your wrist bone—not on the wrist bone.
- Tight enough that you can't spin it freely; loose enough that circulation isn't compromised.
- Update firmware before you establish baseline. Different firmware = different algorithms = incomparable numbers.
- Follow your device's guidance on dominant vs. non-dominant wrist. Most are calibrated for non-dominant.
The Actual Bottom Line
The peer-reviewed longevity literature points to three numbers: HRV trend, RHR trend, VO2 max. The devices that track these with at least some methodological accountability are Garmin, Oura, and WHOOP—with significant caveats about optical HR accuracy during high-intensity work, and the impossibility of comparing absolute HRV numbers across manufacturers.
Everything else coming out of CES 2026? It's either incremental improvement on the above, or it's theater designed to create a sense of optimization without the metabolic discomfort that actually drives adaptation.
The uncomfortable truth: a $400 wearable tracking your VO2 max estimate quarterly will tell you less than two 30-minute Zone 2 runs per week would change. The device is a dashboard. The training is the system. Get the order of operations right.
During my L5-S1 rehab, the wearable wasn't what fixed me. It was the data giving me permission to not train on days when my nervous system was clearly prioritizing repair—and evidence that my cardiovascular capacity was slowly, measurably returning. The device served the system. The system was the work.
Use it that way.
Marcus Voss is a certified strength coach and recovering systems architect in Boulder, CO. He's been tracking HRV since before it was a LinkedIn content category.
