Scolaris Content Display Scolaris Content Display

Acupuncture and acupressure for premenstrual syndrome

Collapse all Expand all

Abstract

Background

Acupuncture has a history of traditional use in China for women's health conditions including premenstrual syndrome (PMS), but its effectiveness for this condition remains unclear. This review examined the available evidence supporting the use of acupuncture or acupressure to treat PMS.

Objectives

To evaluate the effectiveness and safety of acupuncture or acupressure for women with PMS or premenstrual dysphoric disorder (PMDD).

Search methods

We searched the Cochrane Gynaecology and Fertility Specialised Register, Cochrane Central Register of Studies Online (CENTRAL CRSO), MEDLINE, Embase, AMED, PsycINFO, CINAHL (from inception to 21 September 2017), two clinical trial databases (from their inception to 21 September 2017), and four electronic databases in China (from their inception to 15 October 2017): Chinese Biomedical Literature database (CBM), China National Knowledge Infrastructure (CNKI), VIP information/ Chinese Scientific Journals database and WANFANG. Reference lists from included articles were handsearched.

Selection criteria

We included studies if they randomised women with PMS and associated disorders (PMDD and late luteal phase dysphoric disorder/LPDD) to receive acupuncture or acupressure versus sham, usual care/waiting‐list control or pharmaceutical interventions mentioned by the International Society for Premenstrual Disorders (ISPMD). If acupuncture or acupressure were combined with another therapy, these studies were also included where the additional therapy was the same in both groups. Cross‐over studies were eligible for inclusion, but only data from the first phase could be used.

Data collection and analysis

Two review authors independently selected the studies, assessed eligible studies for risk of bias, and extracted data from each study. Study authors were contacted for missing information. The quality of the evidence was assessed using GRADE. Our primary outcomes were overall premenstrual symptoms and adverse events. Secondary outcomes included specific PMS symptoms, response rate and quality of life.

Main results

Five trials (277 women) were included in this review. No trials compared acupuncture or acupressure versus other active treatments. The number of treatment sessions ranged from seven to 28. The quality of the evidence ranged from low to very low quality, the main limitations being imprecision due to small sample sizes and risk of bias related to detection bias and selective reporting.

Acupuncture versus sham acupuncture

Acupuncture may provide a greater reduction in mood‐related PMS symptoms (mean difference (MD) ‐9.03, 95% confidence interval (CI) ‐10.71 to ‐7.35, one randomised controlled trial (RCT), n = 67, low‐quality evidence) and in physical PMS symptoms (MD ‐9.11, 95% CI ‐10.82 to ‐7.40, one RCT, n = 67, low‐quality evidence) than sham acupuncture, as measured by the Daily Record of Severity of Problems scale (DRSP). The evidence suggests that if women have a mood score of 51.91 points with sham acupuncture, their score with acupuncture would be between 10.71 and 7.35 points lower and if women have a physical score of 46.11 points, their score with acupuncture would be between 10.82 and 7.4 points lower.There was insufficient evidence to determine whether there was any difference between the groups in the rate of adverse events (risk ratio (RR) 1.74, 95% CI 0.39 to 7.76, three RCTs, n = 167, I2 = 0%, very low‐quality evidence).

Specific PMS symptoms were not reported

There may be little or no difference between the groups in response rates. Use of a fixed‐effect model suggested a higher response rate in the acupuncture group than in the sham group (RR 2.59, 95% CI 1.71 to 3.92; participants = 100; studies = 2; I2 = 82%), but owing to the high heterogeneity we tested the effect of using a random‐effects model, which provided no clear evidence of benefit for acupuncture (RR 4.22, 95% CI 0.45 to 39.88, two RCTs, n = 100, I2 = 82%, very low‐quality evidence).

Acupuncture may improve quality of life (measured by the WHOQOL‐BREF) compared to sham (MD 2.85, 95% CI 1.47 to 4.23, one RCT, n = 67, low‐quality evidence).

Acupuncture versus no treatment

Due to the very low quality of the evidence, we are uncertain whether acupuncture reduces PMS symptoms compared to a no treatment control (MD ‐13.60, 95% CI ‐15.70 to ‐11.50, one RCT, n = 14).

No adverse events were reported in either group.

No data were available on specific PMS symptoms, response rate or quality of life outcomes.

Acupressure versus sham acupressure

We found low‐quality evidence that acupressure may reduce the number of women with moderate to severe PMS symptoms at the end of the trial compared to sham acupressure (RR 0.64 95% CI 0.52 to 0.79, one RCT, n = 90, low‐quality evidence). The evidence suggests that if 97 women out of 100 in the sham acupressure group had moderate to severe PMS symptoms, the number of women in the acupressure group with moderate to severe symptoms would be 50 to 76 women.

Acupressure may improve both physical (MD 24.3, 95% CI 17.18 to 31.42, one RCT, n = 90, low‐quality evidence) and mental (MD 17.17, 95% CI 13.08 to 21.26, one RCT, n = 90, low‐quality evidence) quality of life.

No data were available on adverse events, specific symptoms or response rates.

Authors' conclusions

The limited evidence available suggests that acupuncture and acupressure may improve both physical and psychological symptoms of PMS when compared to a sham control. There was insufficient evidence to determine whether there was a difference between the groups in rates of adverse events.There is no evidence comparing acupuncture or acupressure versus current ISPMD recommended treatments for PMS such as selective serotonin reuptake inhibitors (SSRIs). Further research is required, using validated outcome measures for PMS, adequate blinding and suitable comparator groups reflecting current best practice.

PICOs

Population
Intervention
Comparison
Outcome

The PICO model is widely used and taught in evidence-based health care as a strategy for formulating questions and search strategies and for characterizing clinical studies or meta-analyses. PICO stands for four different potential components of a clinical question: Patient, Population or Problem; Intervention; Comparison; Outcome.

See more on using PICO in the Cochrane Handbook.

Plain language summary

Acupuncture and acupressure for premenstrual syndrome (PMS)

Review question

Cochrane authors reviewed the evidence on the effectiveness and safety of acupuncture or acupressure in women with premenstrual syndrome (PMS) or premenstrual dysphoric disorder.

Background

We wanted to know whether using acupuncture or acupressure therapy was better than receiving sham acupuncture, no treatment or currently recommended pharmaceutical medications for PMS such as serotonin reuptake inhibitors (SSRIs ‐ a type of anti‐depressant).

Study characteristics

We found five randomised controlled trials including 277 women that examined the effect of acupuncture or acupressure in women with PMS. The evidence is current to September 2017. We also ran a smaller search in November 2017 to see if we had missed any recently published studies. Three trials compared acupuncture with sham acupuncture, one compared acupuncture with no treatment and one compared acupressure with sham acupressure.

Key results

Acupuncture may reduce overall mood and physical PMS symptoms when compared to sham. Acupressure may reduce the number of women having moderate to severe PMS symptoms when compared to sham acupressure. There was not enough evidence to determine the safety of acupuncture or acupressure.

Quality of the evidence

The quality of the evidence ranged from low to very low.The main limitations were imprecision due to small sample sizes (too few women in the study) and risk of bias related to blinding (where researchers or participants knew what treatment they were getting).

Authors' conclusions

Implications for practice

The limited evidence available suggests that acupuncture and acupressure may improve both physical and psychological symptoms of premenstrual syndrome (PMS) when compared to a sham control. There was insufficient evidence to determine whether there was a difference between the groups in rates of adverse events.There is no evidence comparing acupuncture or acupressure versus current International Society for Premenstrual Disorders (ISPMD) recommended treatments for PMS, such as selective serotonin reuptake inhibitors (SSRIs). Further research is required, using validated outcome measures for PMS, adequate blinding and suitable comparator groups reflecting current best practice.

Implications for research

Very few studies included validated outcome measures for PMS and none compared acupuncture or acupressure with a currently accepted pharmaceutical or psychological intervention for PMS. We suggest future research include three‐armed trials comparing acupuncture or acupressure with a suitable sham and with an active pharmaceutical or psychological intervention suggested by current clinical guidelines. Given the issues with blinding a three‐armed trial, having blinded clinician‐rated scales as well as subjective patient‐centred outcomes should be considered. There is a need to improve the quality and reporting in future trials, especially with regards to the choice of acupuncture point selection and the rationale behind the choice of point(s) should be clear. Future trials need to improve reporting on the qualifications and experience of the practitioner delivering the acupuncture or acupressure intervention, including years of training, qualification and current registration status.

Summary of findings

Open in table viewer
Summary of findings for the main comparison. Acupuncture compared to sham for premenstrual syndrome

Acupuncture compared to sham acupuncture for premenstrual syndrome

Patient or population: women with PMS
Setting: outpatient clinic or community
Intervention: acupuncture
Comparison: sham acupuncture

Outcomes

Anticipated absolute effects* (95% CI)

Relative effect
(95% CI)

№ of participants
(studies)

Certainty of the evidence
(GRADE)

Comments

Risk with sham acupuncture

Risk with Acupuncture

Overall premenstrual symptoms ‐ Mood at two month follow‐up. Assessed with: Daily Record of Severity of Problems (DRSP) scale. Scale range for DRSP (mood) 0‐60.

Lower scores indicate less severe premenstrual symptoms

The mean overall premenstrual symptoms‐ Mood was 51.91

MD 9.03 lower
(10.71 lower to 7.35 lower)

67
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupuncture may reduce the number of mood PMS related symptoms women have, compared to sham acupuncture.

Overall premenstrual symptoms ‐ Physical at two month follow‐up.Assessed with: DRSP.
Scale range for DRSP (physical) 0‐66.
Lower scores indicate less severe premenstrual symptoms

The mean overall premenstrual symptoms ‐ Physical was 46.11

MD 9.11 lower
(10.82 lower to 7.40 lower)

67
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupuncture may reduce the number of physical PMS related symptoms women have, compared to sham acupuncture.

Adverse events at up to 2 months' follow‐up

24 per 1000

42 per 1000
(10 to 189)

RR 1.74
(0.39 to 7.76)

167
(3 RCTs)

⊕⊝⊝⊝
VERY LOW 2 3

There was insufficient evidence to determine whether there was any difference between the groups in the rate of adverse events.

Specific PMS symptoms ‐ not measured

No studies reported on this outcome

Response rate at end of treatment

327 per 1000

1000 per 1000
(147 to 1000)

RR 4.22
(0.45 to 39.88)

100
(2 RCTs)

⊕⊝⊝⊝
VERY LOW 4 5 6

There was insufficient evidence to determine whether there was any difference in response rate between the groups.

Quality of Life
assessed with: WHOQOL‐BREF.

Scale range 0‐100.

Higher scores indicate greater quality of life.

The mean quality of life score was 73.8

MD 2.85 higher
(1.47 higher to 4.23 higher)

67
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupuncture may improve quality of life, compared to sham acupuncture.

*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; MD: mean difference; RCT: randomised controlled trial; RR: risk ratio

GRADE Working Group grades of evidence
High certainty: We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect
Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect

1 Downgraded two levels due to very serious imprecision: small sample size (67 participants)

2 Downgraded one level due to serious risk of bias: unclear risk of blinding of both participants and assessors in one of the three trials. This may have affected adverse event reporting.

3 Downgraded two levels due to very serious imprecision: small sample size (167 participants) and rare events. 95% confidence interval covers both benefit and harm

4 Downgraded two levels due to very serious imprecision: small sample size (100 participants)

5 Downgraded one level due to serious risk of bias: unclear risk of blinding of both participants and assessors in one of the two included trials. This may be affected response rate.

6 Downgraded one level due to serious inconsistency: significant heterogeneity (I2 = 82%) due to different definitions of response rate.

Open in table viewer
Summary of findings 2. Acupuncture compared to no treatment for premenstrual syndrome

Acupuncture compared to no treatment for premenstrual syndrome

Patient or population: women with PMS
Setting: community
Intervention: acupuncture
Comparison: no treatment

Outcomes

Anticipated absolute effects* (95% CI)

Relative effect
(95% CI)

№ of participants
(studies)

Certainty of the evidence
(GRADE)

Comments

Risk with no treatment

Risk with Acupuncture

Overall premenstrual symptoms at end of treatment. Assessed with the Menstrual Symptom Severity List (MSSL).

Scale range 0‐33.

Lower scores indicate less severe symptoms.

The mean overall premenstrual symptom score was 17.54

MD 13.60 lower
(15.70 lower to 11.50 lower)

14
(1 RCT)

⊕⊝⊝⊝
VERY LOW 1 2

Due to the very low quality of the evidence, we are uncertain whether acupuncture reduces PMS symptoms compared to a no treatment control

Adverse events at end of treatment

0 per 1000

0 per 1000
(0 to 0)

not estimable

20
(1 RCT)

⊕⊝⊝⊝
VERY LOW 1 3

Neither group reported any adverse events

Specific PMS symptoms ‐ not measured

No studies reported on this outcome

Response rate ‐ not measured

No studies reported on this outcome

Quality of Life ‐ not measured

No studies reported on this outcome

*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; MD: mean difference

GRADE Working Group grades of evidence
High certainty: We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect
Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect

1 Downgraded one level due to serious risk of bias: unclear risk of bias in randomisation, high risk of bias in blinding of participants which is likely to affect patient reported outcome. High level of missing data.

2 Downgraded two levels due to very serious imprecision: very small sample size (14 participants)

3 Downgraded two levels due to very serious imprecision: very small sample size (14 participants) for rare events.

Open in table viewer
Summary of findings 3. Acupressure compared to sham acupressure for premenstrual syndrome

Acupressure compared to sham acupressure for premenstrual syndrome

Patient or population: women with PMS
Setting: community
Intervention: acupressure
Comparison: sham acupressure

Outcomes

Anticipated absolute effects* (95% CI)

Relative effect
(95% CI)

№ of participants
(studies)

Certainty of the evidence
(GRADE)

Comments

Risk with sham acupressure

Risk with Acupressure

Overall premenstrual symptoms at end of treatment. Assessed with: number of women reporting moderate or severe PMS symptoms at the end of the intervention. Lower scores indicate more women with mild PMS or no PMS symptoms.

97 per 100

62 per 100
(50 to 76)

RR 0.64 (0.52 to 0.79)

90
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupressure may reduce the number of women with moderate to severe PMS symptoms at the end of the trial compared to sham acupressure

Adverse events ‐ not measured

No studies reported on this outcome

Specific PMS symptoms ‐ not measured

No studies reported on this outcome

Response rate ‐ not measured

No studies reported on this outcome

Quality of Life ‐ Physical at end of treatment. Assessed with: SF‐12

Scale range 0‐100

Higher scores indicate greater quality of life

The mean quality of Life ‐ Physical score was 67.2

MD 24.30 higher
(17.18 higher to 31.42 higher)

90
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupressure may improve physical quality of life compared to sham acupressure.

Quality of Life ‐ Mental at end of treatment. Assessed with: SF‐12

Scale range 0‐100

Higher scores indicate greater quality of life

The mean quality of Life ‐ Mental score was 52.34

MD 17.17 higher
(13.08 higher to 21.26 higher)

90
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupressure may improve mental quality of life compared to sham acupressure.

*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; RR: Risk ratio; MD: mean difference

GRADE Working Group grades of evidence
High certainty: We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect
Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect

1 Downgraded two levels due to very serious imprecision: small sample size (90 participants)

Background

Description of the condition

Premenstrual syndrome (PMS) is a clinical syndrome resulting in symptoms that occur in relation to the menstrual cycle, and can lead to interference with women's lives. Symptoms usually begin five to 11 days before menstruation, and cease when menstruation starts, or soon after (Backstorm 1991; Dickerson 2004). PMS is common, with 95% of women reported to have one or more premenstrual symptoms during their reproductive life (Hylan 1999). Symptoms are often mild, but in some women can be severe enough to substantially affect daily activities. About 5% to 8% of women thus suffer from severe PMS; most of these women also meet criteria for the associated condition of premenstrual dysphoric disorder (PMDD) (Kimberly 2008). There is evidence that raised levels of inflammatory markers may play a role in PMS (Bertone‐Johnson 2014; Gold 2016).The aetiology of PMS is still unclear.

More than 200 premenstrual symptoms have been recorded and can be classified into three broad categories:

  1. behavioural symptoms including fatigue, insomnia, dizziness, changes in sexual interest, food cravings or overeating;

  2. psychological symptoms including irritability, anger, depressed mood, crying and tearfulness, anxiety, tension, mood swings, lack of concentration, confusion, forgetfulness, restlessness, loneliness, decreased self‐esteem, tension;

  3. physical symptoms including headaches, breast tenderness and swelling, back pain, abdominal pain and bloating, weight gain, swelling of extremities, water retention, nausea, muscle and joint pain (Dickerson 2004; Reid 1986).

PMDD, previously known as late luteal phase dysphoric disorder (LLPDD), is a more severe form of PMS (O'Brien 2011). PMDD is characterised by intense emotional symptoms occurring between ovulation and menstruation. Symptoms include severe depression, irritability and/or mood swings which interfere with relationships, social functioning, and work or school (Bancroft 1993; Medem 2004). According to the Diagnostic and Statistical Manual of Mental Disorders (DSM‐V) the diagnosis of PMDD requires (1) the presence of at least five luteal‐phase symptoms (panel), at least one of which must be a mood symptom (i.e. depressed mood, anxiety or tension, affect lability, or persistent anger and irritability); (2) the timing of symptoms confirmed by two cycles of daily charting; and (3) evidence of functional impairment. Finally, symptoms must not be the exacerbation of another psychiatric condition (APA 2013).The American College of Obstetrics and Gynecology (ACOG) criteria include the presence of at least one psychological or physical symptom that causes significant impairment and confirmed by means of prospective ratings (ACOG 2001).

Description of the intervention

Acupuncture is a Chinese medical treatment that has been used for more than 2000 years in China and Japan (White 2004). Acupuncture involves the insertion of fine metal needles into the skin and underlying tissues at precise points on the body. The needle can be left alone, or can be stimulated by twisting of the needle in various ways, or stimulated by electricity (electro‐acupuncture). Acupressure stimulates the same points as acupuncture using manual pressure, usually with the finger or thumb, on these points rather than the insertion of a needle (Beal 1999). Acupressure is able to be self‐delivered by the person themselves, while acupuncture is usually delivered by a trained practitioner. However, acupuncture is considered to give a greater 'dose' of treatment due to the insertion and manipulation of the needle (Armour 2016), rather than pressure alone. Acupuncture treatment is composed of needling aspects (choice of points and needling techniques), specific components relating to the style of diagnosis and treatment used, and generic needling components not specific to acupuncture such as belief, time, and attention given to the patient (Langevin 2011).

How the intervention might work

The mechanism of acupuncture remains unclear although potential mechanisms have been identified from research, and it is likely that a number of concurrent pathways may contribute to the overall therapeutic effect. Acupuncture can modulate endogenous opioids (Mayor 2013), and needling of specific acupuncture points such as SP6 may alter blood flow to the uterus and modulate prostaglandin levels (Armour 2016). Recent research has shown that acupuncture acts on a variety of inflammatory markers (McDonald 2013; McDonald 2015). Because women with PMS have elevated levels of inflammatory markers (Gold 2016), this may be a primary pathway through which acupuncture may reduce PMS symptoms.

Why it is important to do this review

Many interventions have been used to treat PMS, such as selective serotonin reuptake inhibitors (SSRIs) and hormonal interventions. Current guidelines for the treatment of PMS and PMDD from the International Society for Premenstrual Disorders (ISPMD) recommend SSRI medication as a first line of treatment (Ismaili 2016). SSRI treatments are effective for many women with PMS however the side effects are frequent and can lead to discontinuation (Marjoribanks 2013). Acupuncture may be an effective non‐pharmacological therapeutic option with a much lower rate of adverse events than SSRI medication (Witt 2009).

Objectives

To evaluate the effectiveness and safety of acupuncture or acupressure for women with premenstrual syndrome (PMS) or premenstrual dysphoric disorder (PMDD).

Methods

Criteria for considering studies for this review

Types of studies

We included randomised controlled trials without restriction on language and publication types. Cross‐over studies were eligible for inclusion, but we planned to include only data from the first phase. We excluded non‐randomised studies (for example studies with evidence of inadequate sequence generation such as alternate days, patient numbers).

Types of participants

Inclusion

We included studies of women of any age who met the diagnostic criteria for premenstrual syndrome (PMS) or premenstrual dysphoric disorder (PMDD). Diagnosis of PMS or PMDD had to be made prior to trial inclusion by a healthcare professional such as a general practitioner using an established diagnostic tool such as the DSM‐IV (Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition) and ICD‐10 (International Statistical Classification of Diseases and Related Health Problems 10th Revision), or a validated symptom screening tool.

Exclusion

We excluded studies where there was only a self‐diagnosis of PMS.

Types of interventions

The included intervention group methods were body acupuncture, electro‐acupuncture, scalp acupuncture, ear acupuncture, acupressure, electromagnetic acupuncture, pyonex, intradermal needling, or moxibustion. However, we excluded acupoint injection. If acupuncture or acupressure were combined with another valid therapy, we included these studies if the additional therapy was the same in both groups such as acupuncture plus selective serotonin reuptake inhibitors (SSRIs) versus SSRI alone.

Control group methods were sham acupuncture or sham acupressure, no treatment, or any conventional treatments recommended by the International Society for Premenstrual Disorders (ISPMD), including SSRIs, oral contraceptives, gonadotropin‐releasing hormone agonists, danazol or estradiol.

We excluded studies which compared two different forms of acupuncture, or compared acupuncture with a herbal medicine or a conventional treatment without proven efficacy.

Types of outcome measures

Primary outcomes

  1. Overall premenstrual symptoms, measured using a validated prospective screening tool (such as the Premenstrual Symptom Screening Tool (PSST) or Daily Record of Severity of Problems (DRST)), or by pre‐defined medical diagnostic criteria. These may be reported as an overall score or by separate sub scales such as mental and physical symptoms.

  2. Adverse events (all adverse events, specific adverse effects, withdrawals for adverse effects).

Secondary outcomes

  1. Specific symptoms of PMS: behavioural, psychological, physical.

  2. Response rate (according to how response to treatment was defined in individual studies).

  3. Quality of life, measured by a validated scale such as the Short form (SF) SF‐36 or SF‐12.

Search methods for identification of studies

We conducted a comprehensive and exhaustive search strategy to identify all relevant studies regardless of language or publication status (published, unpublished, in press, and in progress). The search was conducted in consultation with the Gynaecology and Fertility Group Information Specialist.

See Appendix 1, Appendix 2; Appendix 3; Appendix 4; Appendix 5; Appendix 6; Appendix 7 for search strategies.

Electronic searches

We searched the Cochrane Gynaecology and Fertility Specialised Register, the Cochrane Central Register of Studies (CENTRAL CRSO), MEDLINE, AMED, Embase, PsycINFO, CINAHL (from inception to 21 September 2017) and four electronic databases in China (from inception to 15 Oct 2017): Chinese Biomedical Literature database (CBM), China National Knowledge Infrastructure (CNKI), VIP information/Chinese Scientific Journals database and WANFANG database. An additional search on 2 November 2017 was undertaken using PubMed and Google Scholar to identify any recent trials not yet included in the previous databases.

Searching other resources

Other sources of trials included:

  1. Trial registers for ongoing and registered trials: http://www.clinicaltrials.gov and WHO International Clinical Trials Registry Platform (apps.who.int/trialsearch/).

  2. Reference lists from included articles and previous systematic reviews.

  3. Additional electronic resources were searched in November 2017:

    1. DARE – Database of Abstracts of Reviews of Effects (reference lists from non Cochrane reviews on similar topics);

    2. ProQuest Dissertations & Theses for unpublished dissertations and theses;

    3. Conference abstracts on the Web of Science;

    4. OpenGrey for unpublished literature from Europe ‐ http://www.opengrey.eu/;

    5. LILACS database for the Portuguese and Spanish speaking world ‐ http://regional.bvsalud.org/php/index.php?lang=en.

Additional electronic resources were searched in English.

Data collection and analysis

Selection of studies

Two review authors (MA and CE) screened the titles and abstracts of articles found in the search, and discarded trials that were clearly not eligible. One review author (JH) searched for and selected the trials from the Chinese databases.

One review author (JH) translated Chinese papers. MA, CE and JH independently assessed whether trials met the inclusion criteria, with disagreements resolved by discussion. If articles contained insufficient information to make a decision about eligibility, MA or CE attempted to contact authors of the original reports to obtain further details. If details of randomisation were unclear in the reporting, we contacted all trial authors to ascertain if the study was truly randomised. We made a first contact and then sent a reminder one month later.

The selection process was documented with a "PRISMA" flow chart (Figure 1).


Study flow diagram.

Study flow diagram.

Data extraction and management

Following an assessment for inclusion MA, CS, CE, TW and JH independently extracted data. We resolved discrepancies by discussion. For each included trial we extracted data regarding the location of the trial, the methods of the trial (as per assessment of risk of bias), the participants (age range, eligibility criteria), the nature of the interventions, and data relating to the outcomes specified above. We collected information on reported benefits and adverse effects. We extracted data and entered them onto a form sourced from the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011). MA checked and entered data into Review Manager 5.3 (RevMan 2014).

Assessment of risk of bias in included studies

Two review authors (from MA, CS, CE, TW and JH) independently assessed risks of bias for each trial, using the criteria described in the Cochrane Handbook (Higgins 2011). The tool consists of seven items, with three potential responses: 'yes', 'no', and 'unclear'. In all cases a judgement of ‘yes’ indicates a low risk of bias and a judgement of ‘no’ indicates a high risk of bias. If insufficient detail was reported our judgement was usually ‘unclear’. We also made a judgement of 'unclear' if we knew what happened in the study but the risk of bias was unknown to us, or if an entry was not relevant to the study at hand (particularly for assessing blinding and incomplete outcome data, or when the outcome being assessed by the entry had not been measured in the study).

We assessed the following characteristics: sequence generation, allocation concealment, blinding (or masking) of participants, blinding (or masking) of outcome assessors, incomplete data assessment, selective outcome reporting, and other sources of bias. We resolved disagreements that arose at any stage by discussion between the review authors or with a third party, when necessary. We generated a 'Risk of bias' assessment table for each study. We assessed other aspects of trial quality including the extent of blinding (if appropriate), whether groups were comparable at baseline, the extent of losses to follow‐up, non‐compliance and whether the outcome assessment was standardised according to the Cochrane Handbook (Higgins 2011). This information is presented in the Characteristics of included studies.

Measures of treatment effect

We performed statistical analysis in accordance with the Cochrane Handbook (Higgins 2011). We performed statistical analysis using Review Manager 5 software (RevMan 2014). For dichotomous data, we expressed results for each study as summary risk ratios (RRs) with corresponding 95% confidence intervals (CIs), using the Mantel‐Haenszel method. We expressed continuous data as mean differences (MDs) with 95% CIs, or as standardised mean differences (SMDs) if outcomes were conceptually the same but measured in different ways in the different trials. A standard rule of thumb for interpreting effect sizes is that 0.2 represents a small effect, 0.5 a moderate effect and 0.8 a large effect.

Unit of analysis issues

The primary analysis was per woman randomised. We included trials with multiple arms and describe them in the Characteristics of included studies, for example, two different types of acupressure compared with sham acupressure. If there were two acupuncture or acupressure groups, we combined data from both treatment arms into one group. For studies with a sham control and no treatment control group, we divided the shared intervention evenly between groups as described in the Cochrane Handbook (Higgins 2011). Where outcomes were repeated measures, we undertook analysis of outcomes at the end of the intervention.

Dealing with missing data

We analysed data on an intention‐to‐treat basis, as far as possible. We did not impute missing data but we did report the proportion lost to follow‐up and analysed only the available data. In future updates, if more eligible studies are included, the impact of including studies with high levels of missing data in the overall assessment of treatment effect will be explored by using sensitivity analysis.

Assessment of heterogeneity

We measured inconsistency across trials in the meta‐analysis using the I² statistic. This describes the percentage of total variation across studies that is due to heterogeneity rather than chance (Higgins 2003; Higgins 2011). The interpretation of the I² statistic is as follows:

  1. 10% to 40% might not be important;

  2. 30% to 60% may represent moderate heterogeneity;

  3. 50% to 90% may represent substantial heterogeneity;

  4. 75% to 100% considerable heterogeneity.

We regarded heterogeneity as substantial if an I² was greater than 50% and either the Tau² was greater than zero, or there was a low P value (less than 0.10) in the Chi² test for heterogeneity. If we identified substantial heterogeneity (above 50%), we used a random‐effects model in the meta‐analysis.

Assessment of reporting biases

We planned to investigate potential biases of publication using the funnel plot or other analytical methods (Egger 1997). If there were 10 or more studies in the same analysis we would have investigated reporting biases (such as publication bias) using funnel plots. If visual assessment had suggested asymmetry, we would have explored possible reasons.

Data synthesis

We carried out statistical analysis using the Review Manager software (RevMan 2014). We used fixed‐effect meta‐analysis for combining data where it was reasonable to assume that studies were estimating the same underlying treatment effect: i.e. where trials were examining the same intervention, and the trials’ populations and methods were judged sufficiently similar, or if the number of studies was small (less than three).

If there was clinical heterogeneity sufficient to expect that the underlying treatment effects differed between trials, or if substantial statistical heterogeneity was detected, we used random‐effects meta‐analysis to produce an overall summary, if an average treatment effect across trials was considered clinically meaningful. The random‐effects summary was treated as the average range of possible treatment effects and we discussed the clinical implications of treatment effects differing between trials. If the average treatment effect was not clinically meaningful, we did not combine trials. Where we used random‐effects analyses, the results were presented as the average treatment effect with 95% CIs, and the estimates of Tau² and I².

An increase in the odds of a particular outcome, which may be beneficial (e.g. reduction in symptoms) or detrimental (e.g. adverse effects), will be displayed graphically in the meta‐analyses to the right of the centre‐line and a decrease in the odds of an outcome to the left of the centre‐line.

Subgroup analysis and investigation of heterogeneity

Had we identified substantial heterogeneity, we planned to investigate it using sensitivity analyses and to consider whether an overall summary was meaningful, and if it was, to undertake a random‐effects analysis.

Sensitivity analysis

We planned to conduct sensitivity analyses for the primary outcomes to determine whether the conclusions were robust to arbitrary decisions made regarding the eligibility and analysis. These analyses included consideration of whether the review conclusions would have differed if eligibility had been restricted to studies at low risk of bias (defined as studies rated as being at low risk of bias with respect to sequence generation and allocation concealment, and not rated as at high risk of bias in any of the domains assessed).

We did not perform any sensitivity analyses because each subgroup analysis did not include sufficient studies.

Overall quality of the body of evidence: 'Summary of findings' tables

We used the GRADE approach as outlined in the GRADE handbook in order to assess the quality of the body of evidence. We prepared a 'Summary of findings' table using GRADEpro (GRADEpro GDT 2015) and Cochrane methods. This table outlines the overall quality of the body of evidence for the main review outcomes (Overall premenstrual symptoms, adverse events, specific PMS symptoms, response rates and quality of life) for the main review comparison (acupuncture versus sham). We also generated 'Summary of findings' tables for the comparisons 'acupuncture versus no treatment'; and 'acupressure versus sham acupressure' for these same outcomes.

The GRADE approach uses five considerations (study limitations, consistency of effect, imprecision, indirectness and publication bias) to assess the quality of the body of evidence for each outcome. The evidence can be downgraded from 'high quality' by one level for serious (or by two levels for very serious) limitations, depending on assessments for risk of bias, indirectness of evidence, serious inconsistency, imprecision of effect estimates or potential publication bias. Judgements were justified, documented, and incorporated into reporting of results for each outcome.

Assessment of the quality of the acupuncture intervention delivered

We assessed the quality of the acupuncture or acupressure intervention in published journal articles using the National Institute for Complementary Medicine Acupuncture Network (NICMAN) scale (Smith 2017). The NICMAN scale was developed to assess the quality of the acupuncture delivered in a clinical trial, and is designed to be used in combination with the STandards for Reporting Interventions in Clinical Trials of Acupuncture (STRICTA) reporting guidelines (MacPherson 2010). This scale assesses and scores the following: the rationale behind the point selection, the qualifications and experience of the practitioner delivering the intervention, the adequacy of the number of treatment sessions, and the reporting of vital components of the acupuncture practice itself such as needle depth, diameter and stimulation. Higher scores indicate improved quality of the acupuncture intervention delivered, but there is no cut‐off for a 'high' versus 'low' quality score on the NICMAN scale. Each trial was independently scored out of 23 by two review authors (from CS, JH, CE, TW, YY). Any discrepancies were resolved by discussion. If this was not reached a third review author (MA) decided on the final score.

Results

Description of studies

Results of the search

The search retrieved 379 articles. Twenty‐eight studies were potentially eligible and were retrieved in full text. Five studies met our inclusion criteria. Twenty studies were excluded. One study was placed in awaiting classification due to an inability to contact the study authors. Two studies are ongoing. See study tables: Characteristics of included studies; Characteristics of excluded studies; Characteristics of studies awaiting classification; Characteristics of ongoing studies

Included studies

Study design and setting

Two hundred and seventy‐seven women with premenstrual syndrome (PMS )were included in this review. One trial was from Croatia (Habek 2002), two from China (Yu 2006; Zhang 2017), one from Korea (Shin 2009) and one from Iraq (Bazarganipour 2017). One trial (Bazarganipour 2017) reported funding support.

Two trials were undertaken in the community (Bazarganipour 2017; Shin 2009), two trials in an outpatient hospital setting (Yu 2006; Zhang 2017), and in one trial the setting was unclear (Habek 2002).

All studies were randomised controlled trials (RCTs). Two studies had a parallel two‐arm design in which participants were randomised either to acupuncture or sham acupuncture group (Habek 2002; Yu 2006). Three studies had a three‐arm design (Bazarganipour 2017; Shin 2009; Zhang 2017). For two of the three armed trials, only the appropriate arms of verum acupuncture and sham or waiting‐list control were included in the analysis (Shin 2009; Zhang 2017). One trial included two different acupressure groups, one using the acupuncture point LI4 and one using LR3 (Bazarganipour 2017). These were combined into one active group as specified in Unit of analysis issues.

Participants

Women in two trials were aged from 18 to 45 years (Bazarganipour 2017;Yu 2006), one from 18 to 40 years (Zhang 2017),one from 20 to 35 years (Shin 2009); the other only reported the mean ages of the two groups were both about 30 years (Habek 2002). Sample sizes ranged from 20 (Shin 2009) to 90 (Bazarganipour 2017).

DSM‐IV was used as diagnostic criteria in two trials (Yu 2006; Zhang 2017), ICD‐10 was used in one trial (Shin 2009), PSST scores in one trial (Bazarganipour 2017) and a patient history form was used in the other trial (Habek 2002).

Women were screened for baseline PMS symptoms for three menstrual cycles in two trials (Shin 2009; Yu 2006), two cycles in one trial (Habek 2002), and one cycle in two trials (Bazarganipour 2017; Zhang 2017).

Interventions and comparators

Three trials reported the effect between acupuncture and sham acupuncture (Habek 2002; Yu 2006; Zhang 2017), and one between acupuncture and no treatment control (Shin 2009). One of these trials (Habek 2002), combined acupuncture with auriculo‐acupuncture, which is acupuncture administered to points on the pinna of the ear, while another administered Korean hand acupuncture (Shin 2009). One trial compared acupressure and sham acupressure (Bazarganipour 2017). The type of sham acupuncture used was superficial needling on 'non‐meridian' or 'non‐acupuncture' points in two trials (Habek 2002; Yu 2006) and a non‐penetrative sham in one trial (Zhang 2017). Sham acupressure consisted of acupressure being applied to an area unrelated to PMS symptoms (Bazarganipour 2017).

Treatment frequency and total number of sessions

Acupuncture 'dose' consists of neurophysiological and cumulative dose (Armour 2016). Neurophysiological components include needle retention time, number of acupuncture points used, location of acupuncture points and type of stimulation used (manual versus electro‐acupuncture). Cumulative dose components include the frequency and total number of treatments.

The treatment frequency, timing of treatment and total number of treatments used in the included studies were very heterogenous. The number of treatments ranged from seven (Habek 2002) to 28 (Bazarganipour 2017) in total. Treatments were given throughout the menstrual cycle in two studies (Shin 2009; Zhang 2017), in the luteal phase only in two studies (Bazarganipour 2017; Habek 2002), and in the last seven days of the cycle in another study (Yu 2006). Treatment frequency varied from daily (Bazarganipour 2017), to three times per week (Zhang 2017).

The number of acupuncture points varied between studies

Duration of treatment sessions

Session durations and needle retention ranged from 15 minutes (Shin 2009) to 30 minutes (Habek 2002; Yu 2006; Zhang 2017).

Outcomes

All trials except Zhang 2017 collected their outcomes at the end of the intervention. Zhang 2017 had a one‐ and two‐month follow‐up. Data from the two‐month follow‐up were used.

Overall premenstrual symptoms

Daily symptom rating charts were used in three studies (Shin 2009, Yu 2006; Zhang 2017). The Premenstrual Daily Symptom Diary (PTSD), a four‐point scale from 'no symptom' to 'severe symptoms' on 19 items for 31 days was used in one trial (Yu 2006). It is unclear if this was a validated outcome measure. The sum of the symptoms score change between pretreatment and the third treatment month was used, 'mean rank' but not 'mean and standard deviation' was reported as the outcome, so the data were not included in the meta‐analysis (Yu 2006). The Menstrual Symptom Severity List, a five‐point scale rating symptoms from 'not present' to 'extreme' on 33 items (Mitchell 1992) was used in one trial (Shin 2009). The treatment effects were defined as the differences between the first pretreatment luteal phase (five days prior to menses) mean score and the first post‐treatment luteal phase (five days prior to menses) mean score, but only end scores were reported, so we used end scores as the outcome in this review (Shin 2009). One study (Bazarganipour 2017) used the Premenstrual Symptom Screening Tool (PSST) scores as their outcome (Steiner 2003). PSST has 19 questions in two parts, 14 questions related to mood, body, and behaviour and five questions based on the potential impact of these symptoms on an individual’s life. These data were presented as the number of women reporting no or mild PMS symptoms compared to those reporting moderate to severe at the end of the intervention. The number of women reporting moderate to severe PMS at the end of the review was presented in the analysis. One study (Zhang 2017), used a Daily Record of Severity of Problems scale (DRSP), a validated measure of PMS symptoms (Endicott 2006), and provided separate scores for mood and physical symptom components. These were reported separately in the meta‐analysis.

Adverse events

Adverse events were reported as an outcome in all acupuncture trials but not in the acupressure trial (Bazarganipour 2017). Specific adverse events reported included haematomas, pain at the site of needling and itching after needling.

Specific symptoms of PMS: psychological, physical and functional symptoms, irritability.

No studies reported on this outcome.

Response rate

Two trials reported effectiveness rate. Treatments were considered to be successful if there were no further symptoms of PMS, if acupuncture and medication treatment of PMS became unnecessary or if PMS symptoms did not occur for at least a year after the acupuncture therapy (Habek 2002). Acupuncture was considered to be effective if the improvement rate of the scores on PTSD was greater than or equal to 30% (Yu 2006).

Quality of life

Two trials reported quality of life (QoL) outcomes. One study (Bazarganipour 2017) used the SF12 and one study (Zhang 2017) used the WHO Quality of Life‐BREF scale (WHOQOL‐BREF). The SF‐12 is a 12‐item questionnaire used to assess generic health outcomes from the patient’s perspective (Ware 1996). It is based on a subset of questions from the larger SF‐36 and covers the same eight domains of health outcomes, including physical functioning (PF), role‐physical (RP), bodily pain (BP), general health (GH), vitality (VT), social functioning (SF), role‐emotional (RE), and mental health (MH). The WHOQOL‐BREF instrument comprises 26 items, which measure the following broad domains: physical health, psychological health, social relationships, and environment (Whoqol 1998).

Excluded studies

Twenty studies were excluded, see Characteristics of excluded studies;

Three studies were excluded because they were not randomised controlled trials (RCTs) (Li 2004Li 2008; Wang 2003).

Four studies were excluded because they were quasi‐RCTs (Carvalho 2013; Hong 2002Sun 2004Jiang 2005).

Four studies were excluded because they used combined comprehensive treatment methods as control group (Xu 2006aXu 2006bXu 2011; Zhang 1994).

Two studies were excluded because they used two acupuncture methods as control group (Gu 2008; Zhao 2012).

Two studies were excluded because they used traditional Chinese medicine as control group (Hu 2010Li 2014).

Five studies were excluded because they compared acupuncture with western medicine but that medicine was not mentioned in Types of interventions (Cao 2013Chen 2011; Koleini 2017Sun 2008Zheng 2001).

Risk of bias in included studies

See Figure 2 and Figure 3. One study (Zhang 2017) was at a low risk of bias in all domains.


Risk of bias graph: review authors' judgments about each risk of bias item presented as percentages across all included studies.

Risk of bias graph: review authors' judgments about each risk of bias item presented as percentages across all included studies.


Risk of bias summary: review authors' judgments about each risk of bias item for each included study.

Risk of bias summary: review authors' judgments about each risk of bias item for each included study.

Allocation

Random sequence generation

The method of random sequence generation was associated with a low risk of bias in three trials (Bazarganipour 2017; Yu 2006; Zhang 2017). The two remaining trials were rated as unclear risk of bias as they did not report the methods of randomisation (Habek 2002Shin 2009).

Allocation concealment

Two trials were rated as at low risk of bias related to allocation concealment (Yu 2006; Zhang 2017), while the other three were rated as unclear due to lack of clear reporting (Bazarganipour 2017; Habek 2002; Shin 2009).

Blinding

Sham‐acupuncture was used in three trials  (Habek 2002; Yu 2006; Zhang 2017). Blinding of participants was not tested in one trial (Habek 2002) and was rated as unclear. Blinding remained intact for two trials (Yu 2006; Zhang 2017), which were rated as low risk of bias. Sham acupressure was used in one trial (Bazarganipour 2017), blinding was not tested but it is unlikely that due to the pressure used that blinding would have been unsuccessful and was rated as low risk of bias. One trial using a no treatment control did not attempt to blind (Shin 2009) and was rated as high risk of bias.

Detection bias was rated as low risk of bias in two trials (Yu 2006; Zhang 2017), where the clinicians assessing the outcomes were blinded to group allocation. Blinding was rated as unclear in the other three trials (Bazarganipour 2017; Habek 2002; Shin 2009), where the blinding status of the assessors and analysts was not reported.

Incomplete outcome data

Four trials had missing data for less than 20% of participants and were rated as at low risk of attrition bias (Bazarganipour 2017; Habek 2002; Yu 2006; Zhang 2017). One trial had data missing for over 20% of participants and was rated as at high risk of attrition bias (Shin 2009).

Selective reporting

Four trials were rated as at unclear risk of selective reporting as there was no published protocol (Bazarganipour 2017; Habek 2002; Shin 2009; Yu 2006). One trial (Zhang 2017) was rated as at low risk of bias: this was a PhD thesis and included a comprehensive protocol in the methods section.

Other potential sources of bias

No other potential source of bias was identified in any study, and all studies were rated as at low risk for this domain.

NICMAN scale scores

Three studies were analysed and NICMAN scale scores assessed (Bazarganipour 2017; Habek 2002; Shin 2009). It was not practical to perform this assessment on an entire Masters or PhD thesis so no score was generated for Yu 2006 or Zhang 2017. Scores (out of 23) ranged from 14 (Habek 2002), 16 (Bazarganipour 2017) to 19 (Shin 2009). Overall, the design of the study, including the population, intervention, comparison and outcomes was considered adequate in all three studies, however there was poor reporting on the rationale behind the acupuncture point selection and the qualifications of the therapist performing the acupuncture in both Bazarganipour 2017 and Shin 2009.

Effects of interventions

See: Summary of findings for the main comparison Acupuncture compared to sham for premenstrual syndrome; Summary of findings 2 Acupuncture compared to no treatment for premenstrual syndrome; Summary of findings 3 Acupressure compared to sham acupressure for premenstrual syndrome

1 Acupuncture versus sham

One trial used 'mean rank' but not 'mean and standard deviation' as the outcome for 'Overall premenstrual symptoms', so was not included in this meta‐analysis (Yu 2006).

Primary outcomes
1.1 Overall premenstrual symptoms ‐ Mood

(Analysis 1.1)

One trial (Zhang 2017) reported on this outcome.

Acupuncture may provide a greater reduction in mood‐related premenstrual syndrome (PMS) symptoms than sham acupuncture (mean difference (MD) ‐9.03, 95% confidence interval (CI) ‐10.71 to ‐7.35, one RCT, n = 67, low‐quality evidence), as measured by the Daily Record of Severity of Problems scale (DRSP). The evidence suggests that if women have a mood score of 51.91 points with sham acupuncture, their score with acupuncture would be between 10.71 and 7.35 points lower.

1.2 Overall premenstrual symptoms ‐ Physical

(Analysis 1.2)

One trial (Zhang 2017) reported on this outcome. Acupuncture may provide a greater reduction in physical PMS symptoms than sham acupuncture (MD ‐9.11, 95% CI ‐10.82 to ‐7.40, one RCT, n = 67, low‐quality evidence), as measured by the DRSP. The evidence suggests that if women have physical score of 46.11 points their score with acupuncture would be between 10.82 and 7.4 points lower.

1.3 Adverse events

(Analysis 1.3)

Three trials reported the safety of acupuncture in this comparison. Overall, there was insufficient evidence to determine if there was a difference in the adverse event rate between acupuncture and sham acupuncture (RR 1.74, 95% CI 0.39 to 7.76, three RCTs, n = 167, I2 = 0%, very low‐quality evidence)

Secondary outcomes
1.4 Response rate

(Analysis 1.4)

Two trials (Habek 2002; Yu 2006) reported response rate for acupuncture versus sham acupuncture

Individually, both studies found a benefit in the intervention group, and pooling of their data supported this finding (RR 2.59, 95% CI 1.71 to 3.92; participants = 100; studies = 2; I2 = 82%). However, due to the high heterogeneity we tested the effect of using a random‐effects model, which provided no clear evidence of benefit for acupuncture (RR 4.22, 95% CI 0.45 to 39.88, two RCTs, n = 100, I2 = 82%, very low‐quality evidence). The high heterogeneity is most likely due to very heterogenous definitions of what was defined as a 'response rate' between the two studies as well as clinical heterogeneity in the intervention delivered. We concluded that owing to the very low quality of the evidence, there may be little or no difference between the groups for this outcome.

1.5 Quality of life

(Analysis 1.5)

One trial reported quality of life outcomes using the WHOQOL‐BREF (Zhang 2017). Acupuncture may improve quality of life compared to sham acupuncture, measured at the end of the intervention (MD 2.85, 95% CI 1.47 to 4.23, one RCT, n = 67, low‐quality evidence).

1.6 Specific symptoms of PMS

No study reported this outcome.

2 Acupuncture versus no treatment

Primary outcomes
2.1. Overall premenstrual symptoms

(Analysis 2.1)

One trial reported overall premenstrual symptoms using the Menstrual Symptom Severity List (MSSL), and reported end scores for this outcome (Shin 2009). Due to the very low quality of the evidence, we are uncertain whether acupuncture reduces PMS symptoms compared to a no treatment control (MD ‐13.60, 95% CI ‐15.70 to ‐11.50, one RCT, n = 14, very low‐quality evidence)

2.2. Adverse events

(Analysis 2.2)

One trial reported the safety of acupuncture. There were no adverse events reported in either group (Shin 2009).

Secondary outcomes
2.3 Specific symptoms of PMS

No study reported this outcome.

2.4 Response rate

No study reported this outcome.

3 Acupressure versus sham acupressure

Primary outcomes
3.1 Overall premenstrual symptoms

(Analysis 3.1)

One trial reported overall PMS (Bazarganipour 2017). There was evidence that acupressure may reduce the number of women with moderate to severe PMS symptoms at the end of the trial compared to sham acupressure (RR 0.64 95% CI 0.52 to 0.79, one RCT, n = 90, low‐quality evidence). The evidence suggests that if 97 women out of 100 in the sham acupressure group had moderate to severe PMS symptoms, the number of women in the acupressure group with moderate to severe symptoms would be 50 to 76 women.

3.2. Adverse events

No study reported this outcome.

Secondary outcomes
3.3 Specific symptoms of PMS

No study reported this outcome.

3.4 Response rate

No study reported this outcome.

3.5 Quality of life

(Analysis 3.2)

One trial reported quality of life outcomes using the SF‐12 scale (Bazarganipour 2017). The mental and physical subscale components were reported separately. There was evidence that acupressure may improve both physical (MD 24.30, 95% CI 17.18 to 31.42, one RCT, n = 90, low‐quality evidence) and mental (MD 17.17, 95% CI 13.08 to 21.26, one RCT, n = 90, low‐quality evidence) quality of life, compared to sham acupressure.

Discussion

Summary of main results

Evidence from these five trials including 277 women found that both acupuncture and acupressure may reduce overall premenstrual syndrome (PMS) symptoms. Acupuncture may reduce the overall severity of PMS symptoms as well as the number of both mood‐related and physical PMS symptoms at the end of the intervention. There was insufficient evidence to determine whether there is a difference in rates of adverse events between acupuncture and sham acupuncture. For the single acupressure trial there was a reduction in the number of women suffering from moderate or severe PMS at the end of the intervention. There are a small number of trials included within each comparison, and this limits the power of the review to detect any meaningful differences between groups.Therefore, while there may be benefits for acupuncture and acupressure, these benefits should be interpreted with caution until larger studies are undertaken.

Overall completeness and applicability of evidence

There were a small number of trials included in this review, and the outcome measures for PMS symptoms were heterogenous, limiting the ability to make meaningful meta‐analysis. The completeness and applicability of the evidence is limited due to the small number of trials, with relatively small sample sizes. The reporting on specific safety outcomes was limited in some trials, either due to a lack of reporting or due to very small sample sizes making it hard to detect these relatively rare events. All trials compared acupuncture with either sham or with no‐treatment. No trial compared acupuncture or acupressure with the current International Society for Premenstrual Disorders (ISPMD) recommended treatments.

Trials varied significantly in the frequency and total number of treatments given as well as the acupuncture points chosen and the timing of the intervention relative to menses, which may reflect differences in 'best practice' between different countries. This difference in both 'dosage' and timing around menses can potentially affect the outcome in menstrual conditions (Armour 2016), however due to the small number of trials a subgroup analysis on this factor could not be undertaken.

Quality of the evidence

The 'Risk of bias' tables (Figure 2; Figure 3) show that acupuncture and acupressure studies on PMS have not generally been undertaken to a high methodological standard. Only one trial was at a low risk of bias across all domains. The majority of studies had low risk of bias in randomisation and unclear risk of bias in allocation concealment, blinding of outcome assessors and selective reporting. In cases where it was unclear, we contacted the authors to provide additional details, however there was no response.

None of the trials' acupuncture interventions were rated highly on the National Institute for Complementary Medicine Acupuncture Network (NICMAN) scale, with issues in reporting the rationale for point selection being common. Acupuncture point selection, based on either biomedical or traditional understanding, depending on the framework chosen, should be used to guide point selection for trials, and without a clear rationale for point choice it is difficult to determine if studies are using treatments that reflect clinical practice.

Blinding of participants was at low risk of bias in three studies, however none of these reported assessments of blinding success between groups. The lack of blinding of outcome assessors, or in the case of participant‐reported outcomes, of participants themselves, may influence outcomes, and therefore, were rated as high risk of bias. Due to the small number of studies in each comparison means that while individual studies have had positive results, there is currently insufficient evidence for a consistent treatment effect.

The GRADE quality of evidence was often downgraded due to imprecision, mostly related to small sample sizes, and risk of bias particularly related to participant blinding. The overall quality of evidence for GRADE was moderate to very low for PMS symptoms and very low for adverse events (summary of findings Table for the main comparison; summary of findings Table 2; summary of findings Table 3).

Potential biases in the review process

We attempted to minimise bias during the review process. Two review authors assessed the eligibility of studies, carried out data extraction and assessed the risk of bias. We are aware that some literature on acupuncture and acupressure may not be published in mainstream journals and therefore maybe excluded from the main databases. Our search was comprehensive and we included studies identified in languages other than English, however we did not systematically search all other language databases, for example Korean language only databases, therefore we cannot rule out the possibility that some studies may have been missed.

Agreements and disagreements with other studies or reviews

There has been only one systematic review on this topic performed in the past five years (Jang 2014). That review included eight acupuncture studies and included four trials that did not meet the inclusion criteria for this review due to the choice of control interventions (Gu 2008; Hong 2002; Xu 2006a; Xu 2006b). Overall, the authors found that physical symptoms such as breast tenderness improved while psychological symptoms did not. Our review included several more recent trials that were not included in Jang 2014 and found benefits for acupuncture for both physical and psychological symptoms.

Study flow diagram.
Figures and Tables -
Figure 1

Study flow diagram.

Risk of bias graph: review authors' judgments about each risk of bias item presented as percentages across all included studies.
Figures and Tables -
Figure 2

Risk of bias graph: review authors' judgments about each risk of bias item presented as percentages across all included studies.

Risk of bias summary: review authors' judgments about each risk of bias item for each included study.
Figures and Tables -
Figure 3

Risk of bias summary: review authors' judgments about each risk of bias item for each included study.

Comparison 1 Acupuncture vs sham acupuncture, Outcome 1 Overall premenstrual symptoms‐ Mood.
Figures and Tables -
Analysis 1.1

Comparison 1 Acupuncture vs sham acupuncture, Outcome 1 Overall premenstrual symptoms‐ Mood.

Comparison 1 Acupuncture vs sham acupuncture, Outcome 2 Overall premenstrual symptoms ‐ Physical.
Figures and Tables -
Analysis 1.2

Comparison 1 Acupuncture vs sham acupuncture, Outcome 2 Overall premenstrual symptoms ‐ Physical.

Comparison 1 Acupuncture vs sham acupuncture, Outcome 3 Adverse events.
Figures and Tables -
Analysis 1.3

Comparison 1 Acupuncture vs sham acupuncture, Outcome 3 Adverse events.

Comparison 1 Acupuncture vs sham acupuncture, Outcome 4 Response rate.
Figures and Tables -
Analysis 1.4

Comparison 1 Acupuncture vs sham acupuncture, Outcome 4 Response rate.

Comparison 1 Acupuncture vs sham acupuncture, Outcome 5 Quality of Life.
Figures and Tables -
Analysis 1.5

Comparison 1 Acupuncture vs sham acupuncture, Outcome 5 Quality of Life.

Comparison 2 Acupuncture vs no treatment, Outcome 1 Overall premenstrual symptoms.
Figures and Tables -
Analysis 2.1

Comparison 2 Acupuncture vs no treatment, Outcome 1 Overall premenstrual symptoms.

Comparison 2 Acupuncture vs no treatment, Outcome 2 Adverse events.
Figures and Tables -
Analysis 2.2

Comparison 2 Acupuncture vs no treatment, Outcome 2 Adverse events.

Comparison 3 Acupressure vs sham acupressure, Outcome 1 Overall premenstrual symptoms.
Figures and Tables -
Analysis 3.1

Comparison 3 Acupressure vs sham acupressure, Outcome 1 Overall premenstrual symptoms.

Comparison 3 Acupressure vs sham acupressure, Outcome 2 Quality of Life.
Figures and Tables -
Analysis 3.2

Comparison 3 Acupressure vs sham acupressure, Outcome 2 Quality of Life.

Summary of findings for the main comparison. Acupuncture compared to sham for premenstrual syndrome

Acupuncture compared to sham acupuncture for premenstrual syndrome

Patient or population: women with PMS
Setting: outpatient clinic or community
Intervention: acupuncture
Comparison: sham acupuncture

Outcomes

Anticipated absolute effects* (95% CI)

Relative effect
(95% CI)

№ of participants
(studies)

Certainty of the evidence
(GRADE)

Comments

Risk with sham acupuncture

Risk with Acupuncture

Overall premenstrual symptoms ‐ Mood at two month follow‐up. Assessed with: Daily Record of Severity of Problems (DRSP) scale. Scale range for DRSP (mood) 0‐60.

Lower scores indicate less severe premenstrual symptoms

The mean overall premenstrual symptoms‐ Mood was 51.91

MD 9.03 lower
(10.71 lower to 7.35 lower)

67
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupuncture may reduce the number of mood PMS related symptoms women have, compared to sham acupuncture.

Overall premenstrual symptoms ‐ Physical at two month follow‐up.Assessed with: DRSP.
Scale range for DRSP (physical) 0‐66.
Lower scores indicate less severe premenstrual symptoms

The mean overall premenstrual symptoms ‐ Physical was 46.11

MD 9.11 lower
(10.82 lower to 7.40 lower)

67
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupuncture may reduce the number of physical PMS related symptoms women have, compared to sham acupuncture.

Adverse events at up to 2 months' follow‐up

24 per 1000

42 per 1000
(10 to 189)

RR 1.74
(0.39 to 7.76)

167
(3 RCTs)

⊕⊝⊝⊝
VERY LOW 2 3

There was insufficient evidence to determine whether there was any difference between the groups in the rate of adverse events.

Specific PMS symptoms ‐ not measured

No studies reported on this outcome

Response rate at end of treatment

327 per 1000

1000 per 1000
(147 to 1000)

RR 4.22
(0.45 to 39.88)

100
(2 RCTs)

⊕⊝⊝⊝
VERY LOW 4 5 6

There was insufficient evidence to determine whether there was any difference in response rate between the groups.

Quality of Life
assessed with: WHOQOL‐BREF.

Scale range 0‐100.

Higher scores indicate greater quality of life.

The mean quality of life score was 73.8

MD 2.85 higher
(1.47 higher to 4.23 higher)

67
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupuncture may improve quality of life, compared to sham acupuncture.

*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; MD: mean difference; RCT: randomised controlled trial; RR: risk ratio

GRADE Working Group grades of evidence
High certainty: We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect
Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect

1 Downgraded two levels due to very serious imprecision: small sample size (67 participants)

2 Downgraded one level due to serious risk of bias: unclear risk of blinding of both participants and assessors in one of the three trials. This may have affected adverse event reporting.

3 Downgraded two levels due to very serious imprecision: small sample size (167 participants) and rare events. 95% confidence interval covers both benefit and harm

4 Downgraded two levels due to very serious imprecision: small sample size (100 participants)

5 Downgraded one level due to serious risk of bias: unclear risk of blinding of both participants and assessors in one of the two included trials. This may be affected response rate.

6 Downgraded one level due to serious inconsistency: significant heterogeneity (I2 = 82%) due to different definitions of response rate.

Figures and Tables -
Summary of findings for the main comparison. Acupuncture compared to sham for premenstrual syndrome
Summary of findings 2. Acupuncture compared to no treatment for premenstrual syndrome

Acupuncture compared to no treatment for premenstrual syndrome

Patient or population: women with PMS
Setting: community
Intervention: acupuncture
Comparison: no treatment

Outcomes

Anticipated absolute effects* (95% CI)

Relative effect
(95% CI)

№ of participants
(studies)

Certainty of the evidence
(GRADE)

Comments

Risk with no treatment

Risk with Acupuncture

Overall premenstrual symptoms at end of treatment. Assessed with the Menstrual Symptom Severity List (MSSL).

Scale range 0‐33.

Lower scores indicate less severe symptoms.

The mean overall premenstrual symptom score was 17.54

MD 13.60 lower
(15.70 lower to 11.50 lower)

14
(1 RCT)

⊕⊝⊝⊝
VERY LOW 1 2

Due to the very low quality of the evidence, we are uncertain whether acupuncture reduces PMS symptoms compared to a no treatment control

Adverse events at end of treatment

0 per 1000

0 per 1000
(0 to 0)

not estimable

20
(1 RCT)

⊕⊝⊝⊝
VERY LOW 1 3

Neither group reported any adverse events

Specific PMS symptoms ‐ not measured

No studies reported on this outcome

Response rate ‐ not measured

No studies reported on this outcome

Quality of Life ‐ not measured

No studies reported on this outcome

*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; MD: mean difference

GRADE Working Group grades of evidence
High certainty: We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect
Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect

1 Downgraded one level due to serious risk of bias: unclear risk of bias in randomisation, high risk of bias in blinding of participants which is likely to affect patient reported outcome. High level of missing data.

2 Downgraded two levels due to very serious imprecision: very small sample size (14 participants)

3 Downgraded two levels due to very serious imprecision: very small sample size (14 participants) for rare events.

Figures and Tables -
Summary of findings 2. Acupuncture compared to no treatment for premenstrual syndrome
Summary of findings 3. Acupressure compared to sham acupressure for premenstrual syndrome

Acupressure compared to sham acupressure for premenstrual syndrome

Patient or population: women with PMS
Setting: community
Intervention: acupressure
Comparison: sham acupressure

Outcomes

Anticipated absolute effects* (95% CI)

Relative effect
(95% CI)

№ of participants
(studies)

Certainty of the evidence
(GRADE)

Comments

Risk with sham acupressure

Risk with Acupressure

Overall premenstrual symptoms at end of treatment. Assessed with: number of women reporting moderate or severe PMS symptoms at the end of the intervention. Lower scores indicate more women with mild PMS or no PMS symptoms.

97 per 100

62 per 100
(50 to 76)

RR 0.64 (0.52 to 0.79)

90
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupressure may reduce the number of women with moderate to severe PMS symptoms at the end of the trial compared to sham acupressure

Adverse events ‐ not measured

No studies reported on this outcome

Specific PMS symptoms ‐ not measured

No studies reported on this outcome

Response rate ‐ not measured

No studies reported on this outcome

Quality of Life ‐ Physical at end of treatment. Assessed with: SF‐12

Scale range 0‐100

Higher scores indicate greater quality of life

The mean quality of Life ‐ Physical score was 67.2

MD 24.30 higher
(17.18 higher to 31.42 higher)

90
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupressure may improve physical quality of life compared to sham acupressure.

Quality of Life ‐ Mental at end of treatment. Assessed with: SF‐12

Scale range 0‐100

Higher scores indicate greater quality of life

The mean quality of Life ‐ Mental score was 52.34

MD 17.17 higher
(13.08 higher to 21.26 higher)

90
(1 RCT)

⊕⊕⊝⊝
LOW 1

Acupressure may improve mental quality of life compared to sham acupressure.

*The risk in the intervention group (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; RR: Risk ratio; MD: mean difference

GRADE Working Group grades of evidence
High certainty: We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty: We are moderately confident in the effect estimate: The true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty: Our confidence in the effect estimate is limited: The true effect may be substantially different from the estimate of the effect
Very low certainty: We have very little confidence in the effect estimate: The true effect is likely to be substantially different from the estimate of effect

1 Downgraded two levels due to very serious imprecision: small sample size (90 participants)

Figures and Tables -
Summary of findings 3. Acupressure compared to sham acupressure for premenstrual syndrome
Comparison 1. Acupuncture vs sham acupuncture

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Overall premenstrual symptoms‐ Mood Show forest plot

1

67

Mean Difference (IV, Fixed, 95% CI)

‐9.03 [‐10.71, ‐7.35]

2 Overall premenstrual symptoms ‐ Physical Show forest plot

1

67

Mean Difference (IV, Fixed, 95% CI)

‐9.11 [‐10.82, ‐7.40]

3 Adverse events Show forest plot

3

167

Risk Ratio (M‐H, Fixed, 95% CI)

1.74 [0.39, 7.76]

4 Response rate Show forest plot

2

100

Risk Ratio (M‐H, Random, 95% CI)

4.22 [0.45, 39.88]

5 Quality of Life Show forest plot

1

67

Mean Difference (IV, Fixed, 95% CI)

2.85 [1.47, 4.23]

Figures and Tables -
Comparison 1. Acupuncture vs sham acupuncture
Comparison 2. Acupuncture vs no treatment

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Overall premenstrual symptoms Show forest plot

1

14

Mean Difference (IV, Fixed, 95% CI)

‐13.6 [‐15.70, ‐11.50]

2 Adverse events Show forest plot

1

20

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

Figures and Tables -
Comparison 2. Acupuncture vs no treatment
Comparison 3. Acupressure vs sham acupressure

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Overall premenstrual symptoms Show forest plot

1

90

Risk Ratio (M‐H, Fixed, 95% CI)

0.64 [0.52, 0.79]

2 Quality of Life Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Subtotals only

Figures and Tables -
Comparison 3. Acupressure vs sham acupressure