Simon McGrath looks at new objective evidence of abnormal response to exercise in ME/CFS patients, and the questions that researchers are still trying to answer …

Exercise testing at Dr. Keller’s lab
Given the doubt, scepticism and even denial of benefits that often confronts ME/CFS patients, it’s not surprising that many patients crave clear-cut, objective evidence of physiological problems in the illness. Preferably something that will explain at least some of the perplexing symptoms. Something that will say: “this is real”.
And finally it seems researchers might be closing in on this.
Two-day maximal exercise tests promise to provide objective evidence for the most striking and unusual feature of the illness, the exhaustion and flare of symptoms following exercise, called post-exertional malaise.
A couple of studies by researchers at Workwell showed distinctive abnormalities on these tests, and now another researcher, Dr. Betsy Keller of Ithaca College, has found them in an independent study.
But, as usual, science is messy, and these studies raise almost as many questions as they answer. While all three studies find substantial abnormalities, key details vary between them.
Furthermore, safety is a big unknown. These tests require patients to push well beyond normal limits twice in two days — exactly the kind of thing patients have found can cause lasting relapses. As yet, there is no hard safety data.
Even so, a little-noticed Social Security ruling in April states that while there is not yet a ‘definitive’ laboratory test, an abnormal exercise stress test, consistent with other evidence in the case record, proves a ‘medically determinable impairment’ in those with ME/CFS. And Keller reports that insurers increasingly accept her tests as evidence of incapacity. The two-day test could help secure a financial lifeline for patients.
Results from the latest exercise study
Inability of myalgic encephalomyelitis/chronic fatigue syndrome patients to reproduce VO2peak indicates functional impairment – Betsy A Keller 2014
Keller used a similar two-day maximal exercise test to Workwell. ME/CFS patients exercised to complete exhaustion on a bicycle while their oxygen consumption, work output (pedal power), and numerous other measures were recorded. Then the 22 patients came back one day later to do it all again.
The most interesting measures were oxygen consumption and work output, a key measure of what people can actually do.
Work and oxygen consumption were measured at two key points: peak (usually just before people stopped from exhaustion), and at the anaerobic threshold. This is the point where normal aerobic (oxygen-burning) metabolism can’t keep up, so the body has to draw more heavily on anaerobic metabolism to make up the shortfall, resulting in accumulation of lactate in the blood.
For healthy people at least, the anaerobic threshold is roughly the point where it’s still just possible to talk normally while exercising. (Strictly speaking, Keller used the “ventilatory threshold,” calculating it from the ratio of oxygen used to carbon dioxide exhaled, but that’s approximately the same as the anaerobic threshold.)
The patients showed an average fall in day-two oxygen consumption of 14% at their peak and 16% at their anaerobic threshold. Work output fell by similar amounts: patients could do less on day two.
There were no controls in this study, but there are well-established studies going back to the 1950s showing that healthy people can reliably reproduce their performance (within 7%) on two-day tests. The far greater drop in ME/CFS patients indicates an underlying physiological defect, says Keller.
In fact, the results are even more striking for individual patients: while some show little to no change, some show very large drops, including one patient dropping 55% on day-two peak oxygen consumption. (It’s worth taking a look at the results diagram, Figure 3A in the paper, to see what’s really going on).
What’s more, those patients who didn’t show a clinically significant fall in peak oxygen consumption instead showed a significant fall in oxygen consumption at anaerobic threshold: everyone showed abnormalities.
All patients showed a clinically significant drop on day-two, either for peak or anaerobic threshold oxygen consumption.
A one-day exercise test wouldn’t be enough to reliably show these patients’ true impairment. Using a standard measure of impairment developed for heart and lung patients based on a single exercise test, 13 of the 24 ME/CFS patients were classified as having little or no impairment. But while heart and lung patients can reproduce their exercise performance on day two, the ME/CFS patients could not.
The difference was big enough that half of the ME/CFS patients in this study would be classified as having lower functional ability on day-two results, and five of the thirteen that didn’t show any impairment on day one did on day two. This just highlights something very unusual going on with ME/CFS patients.
The results were so unusual, in fact, that it raised the possibility of faulty equipment. Though Keller didn’t run a control group, separate testing in her lab during the same period found normal results for healthy people, indicating equipment worked effectively.
Workwell included controls in their studies, and Chris Snell said this was mainly to prove the equipment was working (not to demonstrate that healthy controls can replicate performance on day two).
Science is messy
Perhaps these inconsistent results are not so surprising. A complex disease like ME/CFS is unlikely to yield its secrets so easily, and science is messy, especially in the early days.
A good example of messy findings came in the most important discovery in biology of the twentieth century: the structure of DNA.
X-ray diffraction images of DNA provided key evidence about its structure. Original images of DNA were clear in some parts but blurry in others, making them hard to interpret. This changed when X-ray expert Rosalind Franklin discovered DNA came in two types: A-DNA and B-DNA. (B-DNA simply has more water attached to it.)
The original images turned out to be for a mixture of A and B DNA. Separate images for A and B revealed the extra information needed to solve the structure of DNA.
![The very different x-ray diffraction patterns of A & B DNA. By I.C. Baianu et al. [GFDL (http://www.gnu.org/copyleft/fdl.html) or CC-BY-SA-3.0 (http://creativecommons.org/licenses/by-sa/3.0/)], via Wikimedia Commons](http://phoenixrising.me/wp-content/uploads/abdna.jpg)
The different X-ray diffraction patterns of A and B DNA
Different differences
Unfortunately, when you compare the results of the various studies, as Keller did in her paper, the story becomes considerably murkier.
First, let’s look at oxygen consumption. Keller’s result, in summary, was that every one of her twenty-two patients had a drop in oxygen consumption either at peak or at anaerobic threshold, but not necessarily both.
The first wave-making 2007 Workwell study found drops in both for all six patients. A 2010 Dutch study found drops in both as well — but they were too small to be confident that they were meaningful.
Last year, a larger Workwell study (51 patients, 10 controls) found that the oxygen consumption at anaerobic threshold dropped significantly (by 11%), but the drop in peak oxygen consumption wasn’t big enough to be statistically or clinically significant (only 5%).
However, this last study also found a huge drop of 55% in work output on day two at the anaerobic threshold for patients.
This was especially surprising because the drop in oxygen consumption was far smaller at 11%, meaning that the patients produced much less work for each unit of oxygen. This inefficiency could point to some major metabolic change.
In contrast, Keller’s new study found a less dramatic drop of 21% in work. While that’s a clinically interesting result, it is broadly in line with the 16% drop in oxygen consumption at anaerobic threshold. It doesn’t suggest that the body is producing less work per unit of oxygen, the way the larger Workwell study did.
The Dutch study found a non-significant drop in work output of 7%, while work wasn’t measured in the first Workwell study.
So three of four published studies find substantial drops in day-two performance (the fourth Dutch study was on the margins of finding a difference), but no clear pattern has emerged as to which are the most important measures. Is it peak oxygen consumption, anaerobic threshold oxygen consumption, or work output at anaerobic threshold?
Moving beyond messy science
Researchers have uncovered a very promising area for study. What’s needed to build on these findings and find out what’s really going on is further, more thorough investigation. Keller says we sorely need a much bigger study — and by bigger she means involving hundreds of subjects, which will come with a hefty price tag.
The bigger study will help clarify exactly what changes in response to exercise and for whom. It could even establish thresholds for measures like peak oxygen consumption that could then be used by clinicians in diagnosis. Such guidelines will be crucial if more patients want to take advantage of the test to help with disability claims, as currently only three labs in the United States carry out this work.
Keller is clearly disappointed by the lack of interest in these extraordinary findings from exercise physiologists working outside ME/CFS. She’s not sure why but thinks it may go back to training.
As in so many medical fields, exercise physiology undergraduates learn next to nothing about ME/CFS, while young researchers are deterred by a lack of funding for this area. As things stand, moving the field forward will fall on the shoulders of the existing small group of researchers.
What could be going on physiologically?
Given the mixed findings, it’s hard to know what is driving the problem. The reduced oxygen consumption suggests two possibilities.
Keller thinks that too little blood could be being delivered to the muscles, thus not supplying enough oxygen. There is some indirect evidence to support this, including a modestly reduced heart rate on day two in this study, indicating less blood is being pumped around the body.
Poor autonomic response, says Keller, could be behind reduced blood flow, and there is plenty of research showing autonomic problems in ME/CFS, at least for subgroups.
An alternative explanation is that muscles are simply unable to burn more oxygen, and a study by a Dutch group suggested muscle uptake of oxygen was the problem. The muscle problem in turn could be down to mitochondrial or other energy metabolism defects. Again, it may need careful identification of subgroups before researchers can really get a handle on what’s going wrong.
Is it unique?
We don’t know if these findings are unique to ME/CFS, though it’s certainly unusual. This kind of maximal testing has been used frequently in diseases affecting heart and lungs (where all published results indicate they can reproduce results the second day).
Yet no published data exists on other fatiguing illnesses such as multiple sclerosis, HIV and rheumatoid arthritis. In a pilot study, however, Workwell reported normal reproducibility for one MS and one HIV patient.
The safety question
There will inevitably be concerns from some patients about taking a test that by definition pushes them to their absolute limits, and then does it again twenty-four hours on in the midst of post-exertional malaise.
Pushing beyond normal limits has triggered many a relapse, and sometimes patients never bounce back. But how big are the actual risks for a two-day test?
Keller says that most people in their tests report recovering to baseline in seven to 21 days, though some take longer. She also points out there is a lack of published data on long-term effects even for a one-day test, let alone two-day tests.
What’s needed, she says, is long-term follow-up looking at both symptoms and activity levels, to establish how long it takes patients to recover to baseline symptoms and activity. And she wants to use actometers (activity sensing devices) to ensure objective measurements of patient activity. This would also allow researchers to see if changes in patient activity matched the physiological changes seen on day two.
If these tests do cause extended relapses in some patients, they may never be appropriate to use for all patients in the clinic, even if they fulfil their promise to unravel some of the mechanisms of the disease. And in that case, there’s a cruelty in the new disability ruling. The only way that patients can prove their disability might be by risking making it worse.
The promise of exercise stress tests
To me, these are extremely interesting findings, because they have found substantial effects linked to the central feature of ME/CFS, abnormal response to exertion.
It’s not just these two-day tests either: in Utah, the Lights’ fascinating gene expression studies are particularly interesting as they only uses a single, moderate test to reveal differences between patients and controls. This is closer to real-world exertion by patients, as well as probably carrying less risk of relapse. The Lights’ initial findings have apparently been replicated in a larger NIH-funded study, but have not yet been published.
When researchers take different approaches to the same problem like this, and both find abnormalities, that’s usually a good sign. Clearly this isn’t a done deal yet, but the biological response to exercise is becoming an important new front in helping to unravel ME/CFS.
Thanks to Dr Betsy Keller for her patience in answering numerous questions about her study
Simon McGrath tweets on ME/CFS research:
Shop via one of these links and Phoenix Rising gets up to 8% of your spend at no cost to you!
Phoenix Rising’s US Amazon Store Phoenix Rising’s UK Amazon Store
Phoenix Rising's store with all our big-name affiliates
{ 61 comments… add one }
It's great that you have improved, but I am sceptical about whether any kind of pushing can bring about improvement. I think it's more likely that other interventions are improving physical function, enabling you/one to do more.
I achieved fat loss, muscle gain and improved coordination without any deliberate increase in activity, but through diet and supplements. Once the body is functioning better, it tells you that you can do more. With me I could feel the feedback of my muscles working better.
Improvement isn't consistent, and there are better periods and worse periods. Impossible to tell what is going on without tests.
As I write this I’m in a ‘crashed’ state – PEM, due to reading and paperwork this past weekend.
No prior defined exercise other than walking around the house.
I wonder how the researchers interpolate this and similar phenomena into their testing? I would
indubitably be in worse shape if I attempted their testing regimen. (I know from doing so in past years
that my highs on up days became lower as well as my lows on low days. They never seemed to return
to pre-push levels on each crash – instead my baseline rose, e.g. to 7+ on a scale of 1, best, 10 -worst! I now try ever so much harder not to push too hard on good days (not always successful)!)
As with other PWCs this demise can come on from simply having a friend drop by and engaging in conversation. It comes on as well pursuant to anxiety, e.g. a neighbor's dog barking too long, or thinking
too hard (on a problem's resolution, etc.).
Thanks @voner for the reference which I followed and read the abstract. Since I am not a pub med author I
could not gain access to the complete article. However, I got the point and was surprised by the Androgen
correlation, which I guess explains the females' side.
While there perusing I did come across another article in pub med where the full text was offered. While it's 5 years old it does offer credence to the role of HPA activation in PWC's.
Moderate exercise increases expression for sensory, adrenergic, and immune genes in chronic fatigue syndrome patients but not in normal subjects.
Light AR1, White AT, Hughen RW, Light KC.
I am so sorry to hear of your terrible experience, and hope that your health improves.
I have been seriously ill a couple of times from over-exertion, but the exertion had been continued/repeated over a period of time, with no rest in-between. It took a year or so to get back to my previous level of 'ME health'.
We seem to have variable ability to recover from exertion. I've been ill for 19 years but am not significantly worse than I was near the start, despite being 19 years older. I do realise that I probably made things worse at the very start by trying to get fit (e.g. cycling) but since it set in properly I don't feel worse overall. I have had changes in the pattern of symptoms.
So I think I would be safe doing the 2-day CPET, but acknowledge that it could be very dangerous for some, and it must always be completely voluntary.
This figure from the study makes it very clear indeed:
It is pretty likely that either there are distinctly different illnesses under our label of ME, or that, like strokes, the underlying cause is the same but the effects depend on severity and locality. My hesitation in mentioning my response to exercise is exactly due to that. The last thing I want to suggest is that everyone can improve like that: I know only too well that that is not true. But it does seem to be true for some of us, and MeSci's posting seems to support that. Personally I rate myself at about 20% of my normally functioning level (that's a mathematical assessment of ability to carry out mental and physical tasks, rather than the artificial percentages generated by some of the grading questionnaires), but I would prefer to be 20% of me at my most fit. That matches my limits: I can work at an A-level maths paper, producing solutions for about 45 minutes before needing a break, and can amble for a little under an hour. But that doesn't mean I don't hit unexplained bad patches where my performance drops dramatically. And, believe me, I do appreciate how many of you would love to be able to reach my levels of activity, and how many of you are hit hard by post-exertional malaise.
Oh for Aladdin's magic lamp.
I'm not sure what you mean by me 'improving'. I have never improved as a result of exercise – quite the reverse. Was my own message confusing?
I have improved from my worst as a result of dietary changes plus supplements, and probably better pacing.
I would suffer short-term exacerbation from an exercise test, but I don't think I would suffer prolonged damage, based on my past experience. Others probably would.
Sorry MeSci: I need to be clearer. I was referring to the graphs showing two distinct subsets, which seems to suggest that there are at least two types of people with ME, one of which may be able to tolerate and recover from "exercise".
By "improving" I'm not suggesting that my ME is actually being addressed, just that I was in a pretty bad way after struggling on at work for 18 months prior to diagnosis, and I have, very slowly, managed to improve from that low.
I must admit that I find the written word quite tricky at times. This illness has a minefield of terms which can so easily be misunderstood. "Exercise", "improvement", "payback": these have very different meanings to us with ME than to those who are healthy, and I am very much using them from an ME perspective.
Valentjin..
Thanks for that comment. I just copied it as it was interesting. I wouldnt have got my head around what it could mean if it wasnt for your comment.
Simon, that was a great article, the best article Ive seen for ages, written so clearly. Thanks. Im really glad I just noticed it and read it.
@Valentijn & @taniaaust1,
thanks for the discusion, this is a subject I have often pondered.
The Lights also state and show in a table in their paper that the ADRA2A decrease group tended to have orthostatic intolerance (71% of them) and the ADRA2A increase group had much less orthostatic intolerance (24% of them). Their patient numbers are pretty tiny, 34 and the ADRA2A increased group and 14 in the ADRA2A decrease group, so there's no statistical significance, but…
What do you think of this? How easy/hard/expensive is it to get the norepinephrine blood test done?
You must log in to post a comment. Log in now.