Do chiropractors even know the difference between promotion and research?

Probably a rhetorical question.

Personally, I have seen them doing so much pseudo-research that I doubt they recognise the real thing, even if they fell over it.

Here is a recent example that stands for many, many more such ‘research’ projects (some of which have been discussed on this blog).

But first a few sentences on the background of this new ‘study’.

The UD chiropractic profession is currently on the ‘opioid over-use bandwagon’ hoping that this move might promote their trade. Most chiropractors have always been against using (any type of) pharmaceutical treatment and advise their patients accordingly. D D Palmer, the founder of chiropractic, was adamant that drugs are to be avoided; he stated for instance that Drugs are delusive; they do not adjust anything. And “as the Founder intended, chiropractic has existed as a drug-free healthcare profession for better than 120 years.” To this day, chiropractors are educated and trained to argue against non-drug treatments and regularly claim that chiropractic is a drug-free alternative to traditional medicine.

Considering this background, this new piece of (pseudo) research is baffling, in my view.

The objective of this investigation was to evaluate the association between utilization of chiropractic services and the use of prescription opioid medications. The authors used a retrospective cohort design to analyse health insurance claims data. The data source was the all payer claims database administered by the State of New Hampshire. The authors chose New Hampshire because health claims data were readily available for research, and in 2015, New Hampshire had the second-highest age-adjusted rate of drug overdose deaths in the United States.

The study population comprised New Hampshire residents aged 18-99 years, enrolled in a health plan, and with at least two clinical office visits within 90 days for a primary diagnosis of low-back pain. The authors excluded subjects with a diagnosis of cancer. They measured likelihood of opioid prescription fill among recipients of services delivered by chiropractors compared with a control group of patients not consulting a chiropractor. They also compared the cohorts with regard to rates of prescription fills for opioids and associated charges.

The adjusted likelihood of filling a prescription for an opioid analgesic was 55% lower among chiropractic compared to non-chiropractic patients. Average charges per person for opioid prescriptions were also significantly lower among the former group.

The authors concluded that among New Hampshire adults with office visits for noncancer low-back pain, the likelihood of filling a prescription for an opioid analgesic was significantly lower for recipients of services delivered by doctors of chiropractic compared with nonrecipients. The underlying cause of this correlation remains unknown, indicating the need for further investigation.

The underlying cause remains unknown???


Let me speculate, or even better, let me extrapolate by drawing an analogy:

Employees by a large Hamburger chain set out to study the association between utilization of Hamburger restaurant services and vegetarianism. The authors used a retrospective cohort design. The study population comprised New Hampshire residents aged 18-99 years, who had entered the premises of a Hamburger restaurant within 90 days for a primary purpose of eating. The authors excluded subjects with a diagnosis of cancer. They measured the likelihood of  vegetarianism among recipients of services delivered by Hamburger restaurants compared with a control group of individuals not using meat-dispensing facilities. They also compared the cohorts with regard to the money spent in Hamburger restaurants.

The adjusted likelihood of being a vegetarian was 55% lower among the experimental group compared to controls. The average money spent per person in Hamburger restaurants were also significantly lower among the Hamburger group.

The authors concluded that among New Hampshire adults visiting Hamburger restaurants, the likelihood of vegetarianism was significantly lower for consumers frequenting Hamburger restaurants compared with those who failed to frequent such places. The underlying cause of this correlation remains unknown, indicating the need for further investigation.



The question whether spinal manipulative therapy (SMT) has any specific therapeutic effects is still open. This fact must irritate ardent chiropractors, and they therefore try everything to dispel our doubts. One way would be to demonstrate a dose-effect relationship between SMT and the clinical outcome. But, for several reasons, this is not an easy task.

This RCT was aimed at identifying the dose-response relationship between visits for SMT and chronic cervicogenic headache (CGH) outcomes; to evaluate the efficacy of SMT by comparison with a light massage control.

The study included 256 adults with chronic CGH. The primary outcome was days with CGH in the prior 4 weeks evaluated at the 12- and 24-week primary endpoints. Secondary outcomes included CGH days at remaining endpoints, pain intensity, disability, perceived improvement, medication use, and patient satisfaction. Participants were randomized to 4 different dose levels of chiropractic SMT: 0, 6, 12, or 18 sessions. They were treated 3 times per week for 6 weeks and received a focused light-massage control at sessions when SMT was not assigned. Linear dose effects and comparisons to the no-manipulation control group were evaluated at 6, 12, 24, 39, and 52 weeks.

A linear dose-response was observed for all follow-ups, a reduction of approximately 1 CGH day/4 weeks per additional 6 SMT visits (p<.05); a maximal effective dose could not be determined. CGH days/4 weeks were reduced from about 16 to 8 for the highest and most effective dose of 18 SMT visits. Mean differences in CGH days/4 weeks between 18 SMT visits and control were -3.3 (p=.004) and -2.9 (p=.017) at the primary endpoints, and similar in magnitude at the remaining endpoints (p<.05). Differences between other SMT doses and control were smaller in magnitude (p > .05). CGH intensity showed no important improvement nor differed by dose. Other secondary outcomes were generally supportive of the primary.

The authors concluded that there was a linear dose-response relationship between SMT visits and days with CGH. For the highest and most effective dose of 18 SMT visits, CGH days were reduced by half, and about 3 more days per month than for the light-massage control.

This trial would make sense, if the effectiveness of SMT for CGH had been a well-documented fact, and if the study had rigorously controlled for placebo-effects.

But guess what?

Neither of these conditions were met.

A recent review concluded that there are few published randomized controlled trials analyzing the effectiveness of spinal manipulation and/or mobilization for TTH, CeH, and M in the last decade. In addition, the methodological quality of these papers is typically low. Clearly, there is a need for high-quality randomized controlled trials assessing the effectiveness of these interventions in these headache disorders. And this is by no means the only article making such statements; similar reviews arrive at similar conclusions. In turn, this means that the effects observed after SMT are not necessarily specific effects due to SMT but could easily be due to placebo or other non-specific effects. In order to avoid confusion, one would need a credible placebo – one that closely mimics SMT – and make sure that patients were ‘blinded’. But ‘light massage’ clearly does not mimic SMT, and patients obviously were aware of which interventions they received.

So, an alternative – and I think at least as plausible – conclusion of the data provided by this new RCT is this:

Chiropractic SMT is associated with a powerful placebo response which, of course, obeys a dose-effect relationship. Thus these findings are in keeping with the notion that SMT is a placebo.

And why would the researchers – who stress that they have no conflicts of interest – mislead us by making this alternative interpretation of their findings not abundantly clear?

I fear, the reason might be simple: they also seem to mislead us about their conflicts of interest: they are mostly chiropractors with a long track record of publishing promotional papers masquerading as research. What, I ask myself, could be a stronger conflict of interest?

(Pity that a high-impact journal like SPINE did not spot these [not so little] flaws)

Lock 10 bright people into a room and tell them they will not be let out until they come up with the silliest idea in healthcare. It is not unlikely, I think, that they might come up with the concept of visceral osteopathy.

In case you wonder what visceral osteopathy (or visceral manipulation) is, one ‘expert’ explains it neatly: Visceral Osteopathy is an expansion of the general principles of osteopathy which includes a special understanding of the organs, blood vessels and nerves of the body (the viscera). Visceral Osteopathy relieves imbalances and restrictions in the interconnections between the motions of all the organs and structures of the body. Jean-Piere Barral RPT, DO built on the principles of Andrew Taylor Still DO and William Garner Sutherland DO, to create this method of detailed assessment and highly specific manipulation. Those who wish to practice Visceral Osteopathy train intensively through a series of post-graduate studies.  The ability to address the specific visceral causes of somatic dysfunction allows the practitioner to address such conditions as gastroesophageal reflux disease (GERD), irritable bowel (IBS), and even infertility caused by mechanical restriction.

But, as I have pointed out many times before, the fact that a treatment is based on erroneous assumptions does not necessarily mean that it does not work. What we need to decide is evidence. And here we are lucky; a recent paper provides just that.

The purpose of this systematic review was to identify and critically appraise the scientific literature concerning the reliability of diagnosis and the clinical efficacy of techniques used in visceral osteopathy.

Only inter-rater reliability studies including at least two raters or the intra-rater reliability studies including at least two assessments by the same rater were included. For efficacy studies, only randomized-controlled-trials (RCT) or crossover studies on unhealthy subjects (any condition, duration and outcome) were included. Risk of bias was determined using a modified version of the quality appraisal tool for studies of diagnostic reliability (QAREL) in reliability studies. For the efficacy studies, the Cochrane risk of bias tool was used to assess their methodological design. Two authors performed data extraction and analysis.

Extensive searches located 8 reliability studies and 6 efficacy trials that could be included in this review. The analysis of reliability studies showed that the diagnostic techniques used in visceral osteopathy are unreliable. Regarding efficacy studies, the least biased study showed no significant difference for the main outcome. The main risks of bias found in the included studies were due to the absence of blinding of the examiners, an unsuitable statistical method or an absence of primary study outcome.

The authors (who by the way declared no conflicts of interest) concluded that the results of the systematic review lead us to conclude that well-conducted and sound evidence on the reliability and the efficacy of techniques in visceral osteopathy is absent.

It is hard not to appreciate the scientific rigor of this review or to agree with the conclusions drawn by the French authors.

But what consequences should we draw from all this?

The authors of this paper state that more and better research is needed. Somehow, I doubt this. Visceral osteopathy is not plausible and the best evidence available to date does not show it works. In my view, this means that we should declare it an obsolete aberration of medical history.

To this, the proponents of visceral osteopathy will probably say that they have tons of experience and have witnessed wonderful cures etc. This I do not doubt; however, the things they saw were not due to the effects of visceral osteopathy, they were due to chance, placebo, regression towards the mean, the natural history of the diseases treated etc., etc. And sometimes, experience is nothing more that the ability to repeat a mistake over and over again.

  • If it looks like a placebo,
  • if it behaves like a placebo,
  • if it tests like a placebo,


And what is wrong with a placebo, if it helps patients?



Sipjeondaebo-tang is an East Asian herbal supplement containing Angelica root (Angelicae Gigantis Radix), the rhizome of Cnidium officinale Makino (Cnidii Rhizoma), Radix Paeoniae, Rehmannia glutinosa root (Rehmanniae Radix Preparata), Ginseng root (Ginseng Radix Alba), Atractylodes lancea root (Atractylodis Rhizoma Alba), the dried sclerotia of Poria cocos (Poria cocos Sclerotium), Licorice root (Glycyrrhizae Radix), Astragalus root (Astragali Radix), and the dried bark of Cinnamomum verum (Cinnamomi Cortex).

But does this herbal mixture actually work? Korean researchers wanted to find out.

The purpose of their study was to examine the feasibility of Sipjeondaebo-tang (Juzen-taiho-to, Shi-Quan-Da-Bu-Tang) for cancer-related anorexia. A total of 32 participants with cancer anorexia were randomized to either Sipjeondaebo-tang group or placebo group. Participants were given 3 g of Sipjeondaebo-tang or placebo 3 times a day for 4 weeks. The primary outcome was a change in the Anorexia/Cachexia Subscale of Functional Assessment of Anorexia/Cachexia Therapy (FAACT). The secondary outcomes included Visual Analogue Scale (VAS) of anorexia, FAACT scale, and laboratory tests.

The results showed that anorexia and quality of life measured by FAACT and VAS were improved after 4 weeks of Sipjeondaebo-tang treatment. However, there was no significant difference between changes of Sipjeondaebo-tang group and placebo group.

From this, the authors of the study concluded that sipjeondaebo-tang appears to have potential benefit for anorexia management in patients with cancer. Further large-scale studies are needed to ensure the efficacy.

Well, isn’t this just great? Faced with a squarely negative result, one simply ignores it and draws a positive conclusion!

As we all know – and as trialists certainly must know – controlled trials are designed to compare the outcomes of two groups. Changes within one of the groups can be caused by several factors unrelated to the therapy and are therefore largely irrelevant. This means that “no significant difference between changes of Sipjeondaebo-tang group and placebo group” indicates that the herbal mixture had no effect. In turn this means that a conclusion stating that “sipjeondaebo-tang appears to have potential benefit for anorexia” is just fraudulent.

This level of scientific misconduct is remarkable, even for the notoriously poor 

I strongly suggest that:

  1. The journal is de-listed from Medline because similarly misleading nonsense has been coming out of this rag for some time.
  2. The paper is withdrawn because it can only mislead vulnerable patients.

Cranio-sacral therapy is firstly implausible, and secondly it lacks evidence of effectiveness (see for instance here, here, here and here). Yet, some researchers are nevertheless not deterred to test it in clinical trials. While this fact alone might be seen as embarrassing, the study below is a particular and personal embarrassment to me, in fact, I am shocked by it and write these lines with considerable regret.

Why? Bear with me, I will explain later.

The purpose of this trial was to evaluate the effectiveness of osteopathic manipulative treatment and osteopathy in the cranial field in temporomandibular disorders. Forty female subjects with temporomandibular disorders lasting at least three months were included. At enrollment, subjects were randomly assigned into two groups: (1) osteopathic manipulative treatment group (n=20) and (2) osteopathy in the cranial field [craniosacral therapy for you and me] group (n=20). Examinations were performed at baseline (E0) and at the end of the last treatment (E1), and consisted of subjective pain intensity with the Visual Analog Scale, Helkimo Index and SF-36 Health Survey. Subjects had five treatments, once a week. 36 subjects completed the study.

Patients in both groups showed significant reduction in Visual Analog Scale score (osteopathic manipulative treatment group: p = 0.001; osteopathy in the cranial field group: p< 0.001), Helkimo Index (osteopathic manipulative treatment group: p = 0.02; osteopathy in the cranial field group: p = 0.003) and a significant improvement in the SF-36 Health Survey – subscale “Bodily Pain” (osteopathic manipulative treatment group: p = 0.04; osteopathy in the cranial field group: p = 0.007) after five treatments (E1). All subjects (n = 36) also showed significant improvements in the above named parameters after five treatments (E1): Visual Analog Scale score (p< 0.001), Helkimo Index (p< 0.001), SF-36 Health Survey – subscale “Bodily Pain” (p = 0.001). The differences between the two groups were not statistically significant for any of the three endpoints.

The authors concluded that both therapeutic modalities had similar clinical results. The findings of this pilot trial support the use of osteopathic manipulative treatment and osteopathy in the cranial field as an effective treatment modality in patients with temporomandibular disorders. The positive results in both treatment groups should encourage further research on osteopathic manipulative treatment and osteopathy in the cranial field and support the importance of an interdisciplinary collaboration in patients with temporomandibular disorders. Implications for rehabilitation Temporomandibular disorders are the second most prevalent musculoskeletal condition with a negative impact on physical and psychological factors. There are a variety of options to treat temporomandibular disorders. This pilot study demonstrates the reduction of pain, the improvement of temporomandibular joint dysfunction and the positive impact on quality of life after osteopathic manipulative treatment and osteopathy in the cranial field. Our findings support the use of osteopathic manipulative treatment and osteopathy in the cranial field and should encourage further research on osteopathic manipulative treatment and osteopathy in the cranial field in patients with temporomandibular disorders. Rehabilitation experts should consider osteopathic manipulative treatment and osteopathy in the cranial field as a beneficial treatment option for temporomandibular disorders.

This study has so many flaws that I don’t know where to begin. Here are some of the more obvious ones:

  • There is, as already mentioned, no rationale for this study. I can see no reason why craniosacral therapy should work for the condition. Without such a rationale, the study should never even have been conceived.
  • Technically,  this RCTs an equivalence study comparing one therapy against another. As such it needs to be much larger to generate a meaningful result and it also would require a different statistical approach.
  • The authors mislabelled their trial a ‘pilot study’. However, a pilot study “is a preliminary small-scale study that researchers conduct in order to help them decide how best to conduct a large-scale research project. Using a pilot study, a researcher can identify or refine a research question, figure out what methods are best for pursuing it, and estimate how much time and resources will be necessary to complete the larger version, among other things.” It is not normally a study suited for evaluating the effectiveness of a therapy.
  • Any trial that compares one therapy of unknown effectiveness to another of unknown effectiveness is a complete and utter nonsense. Equivalent studies can only ever make sense, if one of the two treatments is of proven effectiveness – think of it as a mathematical equation: one equation with two unknowns is unsolvable.
  • Controlled studies such as RCTs are for comparing the outcomes of two or more groups, and only between-group differences are meaningful results of such trials.
  • The ‘positive results’ which the authors mention in their conclusions are meaningless because they are based on such within-group changes and nobody can know what caused them: the natural history of the condition, regression towards the mean, placebo-effects, or other non-specific effects – take your pick.
  • The conclusions are a bonanza of nonsensical platitudes and misleading claims which do not follow from the data.

As regular readers of this blog will doubtlessly have noticed, I have seen plenty of similarly flawed pseudo-research before – so, why does this paper upset me so much? The reason is personal, I am afraid: even though I do not know any of the authors in person, I know their institution more than well. The study comes from the Department of Physical Medicine and Rehabilitation, Medical University of Vienna, Austria. I was head of this department before I left in 1993 to take up the Exeter post. And I had hoped that, even after 25 years, a bit of the spirit, attitude, knowhow, critical thinking and scientific rigor – all of which I tried so hard to implant in my Viennese department at the time – would have survived.

Perhaps I was wrong.

Difficulties breastfeeding?

Some say that Chinese herbal medicine offers a solution.

This Chinese multi-centre RCT included 588 mothers considering breastfeeding. The intervention group received the Chinese herbal mixture Zengru Gao, while the control group received no therapy. The primary outcomes were the percentages of fully and partially breastfeeding mothers, and a secondary outcome was baby’s daily formula intake.

At day 3 and 7 after delivery, significant differences were found in favour of Zengru Gao group on the percentage of full/ partial breastfeeding. At day 7, the percentage of full/ partial breastfeeding of the active group increased to 71.48%/20.70% versus 58.67%/30.26% in the control group, the differences remained significant. No statistically significant differences were detected on primary measures at day. While intake of formula differed between groups at day 1 and 3, this difference did not achieve statistical significance, but this difference was apparent by day 7.

The authors concluded that the Chinese Herbal medicine Zengru Gao enhanced breastfeeding success during one week postpartum. The approach is acceptable to participants and merits further evaluation.

To the naïve observer, this study might look rigorous, but it is a seriously flawed RCT. Here are just some of its most obvious limitations:

  • All we get in the methods section is this explanation: Participants were randomly allocated to the blank control group or the intervention group: Zengru Gao, orally, 30 g a time and 3 times a day. This seems to indicate that the control group got no treatment at all which means there was no blinding nor placebo control. The authors even comment on this point in the discussion section of their paper stating that because we included new mothers who received no treatment as a control group, we were able to prove that the improvement in breastfeeding was not due to the placebo effect. However, this is a totally nonsensical argument.
  • The experimental treatment is not reproducible. The authors state: Zengru Gao, a Chinese herbal formula, which is composed of 8 herbs: Semen Vaccariae, Medulla Tetrapanacis, Radix Rehmanniae Praeparata, Radix Angelicae Sinensis, Radix Paeoniae Alba,Rhizoma Chuanxiong, Herba Leonuri, Radix Trichosanthis. This is not enough information to replicate the study outside China where the mixture is not commercially available.
  • The primary outcome was the percentage of fully, and partially breastfeeding mothers. Breastfeeding was defined as mother’s milk given by direct breast feeding. Full breastfeeding meant that no other types of milk or solids were given. Partially breastfeeding meant that sustained latch with deep rhythmic sucking through the length of the feed, with some pause, on either/ or both breasts. We are not being told how the endpoint was quantified. Presumably women kept diaries. We cannot guess how accurate this process was.
  • As far as I can see, there was no correction for multiple testing for statistical significance. This means that some or all of the significant results might be false-positive.
  • There is insufficient data to show that the herbal mixture is safe for the mothers and the babies. At the very minimum, the researchers should have measured essential safety parameters. This omission is a gross violation of research ethics.
  • Towards the end of the paper, we find the following statement: The authors would like to thank the Research and Development Department of Zhangzhou Pien Tze Huang Pharmaceutical co., Ltd. … The authors declare that they have no competing interests. And the 1st and 3rd authors are “affiliated with” Guangzhou Hipower Pharmaceutical Technology Co., Ltd, Guangzhou, China, i. e. work for the manufacturer of the mixture. This does clearly not make any sense whatsoever.

I have seen too many flawed studies of alternative medicine to be shocked or even surprised by this level of incompetence and nonsense. Yet, I still find it lamentable. But, in my view, the worst is that supposedly peer-reviewed journals such as ‘BMC Complement Altern Med’ publish such overt rubbish.

It would be easy to shrug one’s shoulder and bin the paper. But the effect of such fatally flawed research is too serious for that. In our recent book MORE HARM THAN GOOD? THE MORAL MAZE OF COMPLEMENTARY AND ALTERNATIVE MEDICINE, we discuss that such flawed science amounts to a violation of medical ethics:  CAM journals allocate peer review tasks to a narrow range of CAM enthusiasts who often have been chosen by the authors of the article in question. The raison d’être of CAM journals and CAM researchers is inextricably tied to a belief in CAM, resulting in a self-referential situation which is permissive to the acceptance of weak or flawed reports of clinical effectiveness… Defective research—whether at the design, execution, analysis, or reporting stage—corrupts the repository of reliable medical knowledge. Ultimately, this leads to suboptimal and erroneous treatment decisions…

The authors of this systematic review aimed to summarize the evidence of clinical trials on cupping for athletes. Randomized controlled trials on cupping therapy with no restriction regarding the technique, or co-interventions, were included, if they measured the effects of cupping compared with any other intervention on health and performance outcomes in professionals, semi-professionals, and leisure athletes. Data extraction and risk of bias assessment using the Cochrane Risk of Bias Tool were conducted independently by two pairs of reviewers.

Eleven trials with n = 498 participants from China, the United States, Greece, Iran, and the United Arab Emirates were included, reporting effects on different populations, including soccer, football, and handball players, swimmers, gymnasts, and track and field athletes of both amateur and professional nature. Cupping was applied between 1 and 20 times, in daily or weekly intervals, alone or in combination with, for example, acupuncture. Outcomes varied greatly from symptom intensity, recovery measures, functional measures, serum markers, and experimental outcomes. Cupping was reported as beneficial for perceptions of pain and disability, increased range of motion, and reductions in creatine kinase when compared to mostly untreated control groups. The majority of trials had an unclear or high risk of bias. None of the studies reported safety.

Risk of bias of included trials. “+” indicates low risk of bias, “−” indicates high risk of bias, and “?” indicates unclear risk of bias.

The authors concluded that no explicit recommendation for or against the use of cupping for athletes can be made. More studies are necessary for conclusive judgment on the efficacy and safety of cupping in athletes.

Considering the authors’ stated aim, this conclusion seems odd. Surely, they should have concluded that THERE IS NO CONVINCING EVIDENCE FOR THE USE OF CUPPING IN ATHLETES. But this sounds rather negative, and the JCAM does not seem to tolerate negative conclusions, as discussed repeatedly on this blog.

The discussion section of this paper is bar of any noticeable critical input (for those who don’t know: the aim of any systematic review must be to CRITICALLY EVALUATE THE PRIMARY DATA). The authors even go as far as stating that the trials reported in this systematic review found beneficial effects of cupping in athletes when compared to no intervention. I find this surprising and bordering on scientific misconduct. The RCTs were mostly not on cupping but on cupping in combination with some other treatments. More importantly, they were of such deplorable quality that they allow no conclusions about effectiveness. Lastly, they mostly failed to report on adverse effects which, as I have often stated, is a violation of research ethics.

In essence, all this paper proves is that, if you have rubbish trials, you can produce a rubbish review and publish it in a rubbish journal.

The authors of this review aimed to present an overview of the literature on physicochemical research performed on homeopathic preparations with respect to publication quality and methods used. They searched major scientific databases to find relevant publications from its origin to the end of 2015. Publications were assessed using a scoring scheme, the Manuscript Information Score (MIS). Information regarding country of origin of the research and experimental techniques used was extracted.

The authors identified 183 publications. The rate of publication in the field was 2 per year from the 1970s until 2000. Afterward, it increased to over 5.5 publications per year. The quality of publications was seen to increase sharply from 2000 onward: before 2000, only 12 (13%) publications were rated as ‘‘high quality’’ (MIS ‡7.5); 44 (48%) publications were rated as ‘‘high quality’’ after 2000.

Countries with most publications were Germany (n=42, 23%), France (n=29, 16%), India (n=27, 15%), and Italy (n=26, 14%). Techniques most frequently used were electrical impedance (26%), analytical methods (20%), spectroscopy (20%), and nuclear magnetic resonance (19%).

The authors concluded that physicochemical research into homeopathic preparations is increasing both in terms of quantity and quality of the publications.

They also announce that there will be a further paper on the subject: In part 2, we aim to identify the most interesting experimental techniques. With this, we aim to be in a position to generate meaningful hypotheses regarding a possible mode of action of homeopathic preparations.

It might be relevant to ask who the authors are and where they come from. They are Klein SD, Würtenberger S, Wolf U, Baumgartner S, and Tournier A. And their affiliations are:

  • Institute of Complementary Medicine, University of Bern, Switzerland.
  • Scientific & Regulatory Affairs, Hevert-Arzneimittel GmbH & Co. KG, Germany.
  • Society for Cancer Research, Arlesheim, Switzerland.
  • Institute of Integrative Medicine, University of Witten/Herdecke, Germany.
  • Homeopathy Research Institute (HRI), London, United Kingdom.

In other words, they are without exception proponents of homeopathy, some burdened with considerable conflicts of interest in the subject. Personally, I think it unlikely that anything meaningful will ever come of this research. But unsurprisingly, the enthusiasts beg to differ: on facebook, the HRI claimed that this new systematic review is a major step towards developing clear and testable hypotheses regarding the mode of action of homeopathy.

In a previous blog-post I have tried to explain my reservations in some detail; please allow me to repeat them here:

… homeopaths have been keen to find more rational support for their theories. Thus they have developed several ‘sciency’ concepts to explain the mode of action of their highly diluted homeopathic remedies. For instance they postulated that water can form secondary structures that hold some information of the original substance (stock), even if it has long been diluted out of the remedy. Alternatively, they claimed that the shaking of the remedy generates nano-particles or silicone-particles which, in turn, are the cause of the clinical effects.

Today, I want to assume for a minute, that one of these theories is correct – they cannot all be right, of course. Homeopaths regularly show us investigations that seem to support them, even though it only needs a real expert in the particular field of science to cast serious doubt on them. I will nevertheless assume that, after potentisation, the diluent retains information via nano-particles or some other phenomenon. For the purpose of this mind-experiment, I grant homeopaths that, in this respect, they are correct. In other words, let’s for a moment assume that the ‘memory of water’ theory is correct.

As I have been more than generous, I want homeopaths to return the favour and consider what this would really mean: information has been transferred from the stock to the diluent. Does that prove anything? Does it show that homeopathy is valid?

Could the homeopaths who make this assumption be equally generous and answer the following questions, please?

  1. How does a nano-particle of coffee, for instance, affect the sleep centre in the brain to make the patient sleep? Or how does a nano-particle of the Berlin Wall or a duck liver affect anything at all in the human body? The claim that information has been retained by the diluent is no where near to an explanation of a rational mode of action, isn’t it?
  2. Most homeopathic remedies are consumed not as liquids but as ‘globuli’, i. e.  tiny little pills made of lactose. They are prepared by dropping the liquid remedy on to them. The liquid subsequently evaporates. How is it that the information retained in the liquid does not evaporate with the diluent?
  3. The diluent usually is a water-alcohol mixture which inevitably contains impurities. In fact, a liquid C12 remedy most certainly contains dimensions more impurities than stock. These impurities have, of course, also been vigorously shaken, i. e. potentised. How can we explain that their ‘potency’ has not been beefed up at each dilution step? Would this not necessitate a process where only some molecules in the diluent are agitated, while all the rest remain absolutely still? How can we explain this fantastic concept?
  4. Some stock used in homeopathy is insoluble (for instance Berlin Wall). Such stock is not diluted but its concentration in the remedy is initially lowered by a process called ‘trituration’, a process which consists in grinding the source material in another solid material, usually lactose. I have granted you that potentisation works in the way you think. But how is information transferred from one solid material to another?
  5. Everything we drink is based on water containing molecules that have been inadvertently potentised in nature a million times and therefore should have hugely powerful effects on our bodies. How is it that we experience none of these effects each time we drink?

Now, homeopaths, let me propose a deal.

If you can answer these questions satisfactorily, I will no longer doubt your memory of water theory. If you cannot do this, I think you ought to admit that all your ‘sciency’ theories about the mode of action of highly diluted homeopathic remedies are really quite silly – more silly even than Hahnemann’s idea of a ‘spirit-like’ effect.





Traditional Chinese Medicine (TCM) is popular, not least because it is heavily marketed and thus often perceived as natural and safe. But is this assumption true?

This study analysed  liver tests before and following treatment with herbal Traditional Chinese Medicine (TCM) in order to evaluate the risk of liver injury. Patients with normal values of alanine aminotransferase (ALT) as a diagnostic marker for ruling out pre-existing liver disease were enrolled in a prospective study of a safety program carried out at the First German Hospital of TCM from 1994 to 2015. All patients received herbal products, and their ALT values were reassessed 1-3 d prior to discharge. To evaluate causality for suspected TCM herbs, the Roussel Uclaf Causality Assessment Method (RUCAM) was used.

The report presents data of 21470 patients. ALT ranged from 1 × to < 5 × upper limit normal (ULN) in 844 patients (3.93%) and suggested mild or moderate liver adaptive abnormalities. A total of 26 patients (0.12%) experienced higher ALT values of ≥ 5 × ULN (300.0 ± 172.9 U/L, mean ± SD). Causality for TCM herbs was estimated to be probable in 8/26 patients, possible in 16/26, and excluded in 2/26 cases.

Compared with the large TCM study cohort, patients in the liver injury study cohort were older and contained a higher percentage of women, whereas the duration of the hospital stay was similar in both cohorts. The TCM herbs were rarely applied mostly as mixtures consisting of several herbs adding up to 35 different drugs during the patients’ four-week stay. The daily dosage was 95 ± 30 g and thus slightly higher than in the TCM study cohort. Among the many herbal TCM used by the 26 patients in the liver injury cohort, Bupleuri radix and Scuterllariae radix were the two TCM herbs most frequently implicated in liver injury, with variable RUCAM-based causality gradings. Most of the patients received one to six TCM drugs that were associated with potential liver injury as evidenced from the scientific literature, e.g., one patient (case 8) received six potentially hepatotoxic herbal TCM drugs during their hospital stay.

The authors concluded that in 26 (0.12%) of 21470 patients treated with herbal TCM, liver injury with ALT values of ≥ 5 × ULN was found, which normalized shortly following treatment cessation, also substantiating causality.

In the discussion section of the paper, the authors comment that the use of TCM is widely considered less risky as compared with synthetic drugs, although data on direct comparisons are not available in support of this view. Populations using herbal TCM, drugs, either alone, or combined experience more drug-induced liver injury (DILI) than herb-induced liver injury (HILI), possibly due to a higher use of drugs. Valid data of incidence and prevalence of HILI caused by TCM herbs are lacking, and respective data cannot be derived from the present study.

This study is most valuable, in my view. Its strength is clearly the huge sample size. Top marks for the authors for publishing it!

Having said that, we need to take the incidence figures with a pinch of salt, I think. In reality they could be much higher because:

  • other settings will not be as tightly supervised as the unusual hospital setting;
  • in most other situations the quality of the Chinese herbs might be less controlled;
  • there could be adulteration;
  • there could be contamination.

The ‘elephant in the room’ obviously is the inevitable question about benefit. Like any other treatment, TCM cannot be judged on the basis of its risk but must be evaluated according to its risk/benefit balance. I realise that this was not the subject of the present study, but it is nevertheless crucial: do the benefits of TCM outweigh its risks?

I am not aware that this is the case (but more than willing to consider any sound evidence readers might supply). More importantly, I am not aware of good evidence to show that, for any condition, TCM would be superior in terms of risk/benefit balance than conventional options. This is not a trivial issue: clinicians have the ethical obligation to apply the best (the one with the most positive risk/benefit balance) treatment to their patients.

If I am right, then TCM should not be used in therapeutic routine in or outside hospitals.

If I am right, the ‘First German Hospital of TCM‘ should close asap; it would be violating fundamental ethical principles.

If I am right, the debate about the risks of TCM is almost irrelevant because we simply should not use it.

Or did I misunderstand something here?

What do you think?


The aim of this three-armed, parallel, randomized exploratory study was to determine, if two types of acupuncture (auricular acupuncture [AA] and traditional Chinese acupuncture [TCA]) were feasible and more effective than usual care (UC) alone for TBI–related headache. The subjects were previously deployed Service members (18–69 years old) with mild-to-moderate TBI and headaches. The interventions explored were UC alone or with the addition of AA or TCA. The primary outcome was the Headache Impact Test (HIT). Secondary outcomes were the Numerical Rating Scale (NRS), Pittsburgh Sleep Quality Index, Post-Traumatic Stress Checklist, Symptom Checklist-90-R, Medical Outcome Study Quality of Life (QoL), Beck Depression Inventory, State-Trait Anxiety Inventory, the Automated Neuropsychological Assessment Metrics, and expectancy of outcome and acupuncture efficacy.

Mean HIT scores decreased in the AA and TCA groups but increased slightly in the UC-only group from baseline to week 6 [AA, −10.2% (−6.4 points); TCA, −4.6% (−2.9 points); UC, +0.8% (+0.6 points)]. Both acupuncture groups had sizable decreases in NRS (Pain Best), compared to UC (TCA versus UC: P = 0.0008, d = 1.70; AA versus UC: P = 0.0127, d = 1.6). No statistically significant results were found for any other secondary outcome measures.

The authors concluded that both AA and TCA improved headache-related QoL more than UC did in Service members with TBI.

The stated aim of this study (to determine whether AA or TCA both with UC are more effective than UC alone) does not make sense and should therefore never have passed ethics review, in my view. The RCT followed a design which essentially is the much-lamented ‘A+B versus B’ protocol (except that a further groups ‘C+B’ was added). The nature of such designs is that there is no control for placebo effects, the extra time and attention, etc. Therefore, such studies cannot fail but generate positive results, even if the tested intervention is a placebo. In such trials, it is impossible to attribute any outcome to the experimental treatment. This means that the positive results are known before the first patient has been enrolled; hence they are an unethical waste of resources which can only serve one purpose: to mislead us. It also means that the conclusions drawn above are not correct.

An alternative and in my view more accurate conclusion would be this one: both AA and TCA had probably no effect; the improved headache-related QoL was due to the additional attention and expectation in the two experimental groups and is unrelated to the interventions tested in this study.

In our new book, MORE HARM THAN GOOD, we discuss that such trials are deceptive to the point of being unethical. Considering the prominence and experience of Wayne Jonas, the 1st author of this paper, such obvious transgression is more than a little disappointing – I would argue that is amounts to overt scientific misconduct.

Recent Comments

Note that comments can be edited for up to five minutes after they are first submitted.

Click here for a comprehensive list of recent comments.