Copy deep-research skill from local Qoder installation to config repo for version control
354 lines
19 KiB
Markdown
354 lines
19 KiB
Markdown
# Systematic Review Toolkit — Reference Guide
|
|
|
|
## Purpose
|
|
|
|
Comprehensive reference for conducting systematic reviews and meta-analyses. Covers Cochrane methodology, PRISMA 2020 reporting, risk of bias instruments, heterogeneity interpretation, GRADE certainty framework, and protocol registration. Used by `risk_of_bias_agent`, `meta_analysis_agent`, `bibliography_agent`, and `report_compiler_agent`.
|
|
|
|
---
|
|
|
|
## 1. Cochrane Handbook v6.4 — Key Principles
|
|
|
|
The Cochrane Handbook for Systematic Reviews of Interventions (v6.4, 2023) is the gold standard reference for systematic review methodology.
|
|
|
|
### Core Methodology Stages
|
|
|
|
| Stage | Cochrane Chapter | Key Requirements |
|
|
|-------|-----------------|-----------------|
|
|
| Planning | Ch 1-3 | Protocol registration, clear objectives, PICOS |
|
|
| Searching | Ch 4 | Comprehensive search (≥ 2 databases), documented strategy |
|
|
| Selecting | Ch 4 | Independent dual screening, predefined criteria |
|
|
| Data extraction | Ch 5 | Standardized forms, pilot testing, dual extraction |
|
|
| Risk of bias | Ch 8 (RoB 2), Ch 25 (ROBINS-I) | Domain-based assessment, signaling questions |
|
|
| Synthesis | Ch 10-12 | Appropriate statistical methods, heterogeneity assessment |
|
|
| GRADE | Ch 14 | Certainty of evidence for each outcome |
|
|
| Reporting | Ch 15 | PRISMA 2020 compliance |
|
|
|
|
### Fundamental Principles
|
|
|
|
1. **A priori protocol**: Register the protocol before conducting the review (PROSPERO, OSF)
|
|
2. **Comprehensive searching**: Search multiple databases; do not rely on a single source
|
|
3. **Dual independent processes**: Two reviewers for screening, extraction, and risk of bias (at minimum for a subset)
|
|
4. **Pre-specified methods**: Analysis plan defined before seeing results
|
|
5. **Transparent reporting**: Document everything; another team should be able to replicate the review
|
|
|
|
---
|
|
|
|
## 2. PRISMA 2020 — Full 27-Item Checklist
|
|
|
|
**Full Name**: Preferred Reporting Items for Systematic Reviews and Meta-Analyses
|
|
**Reference**: Page et al. (2021). BMJ, 372, n71. https://doi.org/10.1136/bmj.n71
|
|
|
|
### Title and Abstract
|
|
|
|
| # | Item | Guidance |
|
|
|---|------|---------|
|
|
| 1 | **Title** | Identify the report as a systematic review, meta-analysis, or both |
|
|
| 2 | **Abstract** | Structured summary: background, objectives, data sources, study eligibility criteria, participants, interventions, study appraisal/synthesis methods, results, limitations, conclusions, registration number |
|
|
|
|
### Introduction
|
|
|
|
| # | Item | Guidance |
|
|
|---|------|---------|
|
|
| 3 | **Rationale** | Describe the rationale for the review in the context of existing knowledge |
|
|
| 4 | **Objectives** | Provide an explicit statement of the questions being addressed with reference to PICOS |
|
|
|
|
### Methods
|
|
|
|
| # | Item | Guidance |
|
|
|---|------|---------|
|
|
| 5 | **Eligibility criteria** | Specify inclusion and exclusion criteria (PICOS components, date range, language, publication status) |
|
|
| 6 | **Information sources** | Describe all information sources searched (databases, registers, websites, organizations, reference lists) with dates |
|
|
| 7 | **Search strategy** | Present the complete search strategy for at least one database, including any filters and limits |
|
|
| 8 | **Selection process** | State methods for deciding which studies met eligibility criteria (number of reviewers, consensus process) |
|
|
| 9 | **Data collection process** | Describe methods for extracting data (number of reviewers, whether independently, any processes for obtaining/confirming data from investigators) |
|
|
| 10 | **Data items** | List and define all outcome variables and other variables extracted |
|
|
| 11 | **Study risk of bias assessment** | Describe methods for assessing risk of bias in included studies, including tools used and how results were used in synthesis |
|
|
| 12 | **Effect measures** | Specify for each outcome the effect measure(s) used (e.g., RR, MD, SMD) |
|
|
| 13a | **Synthesis methods** | Describe the processes used to decide which studies were eligible for each synthesis |
|
|
| 13b | | Describe any methods required to prepare the data for synthesis (e.g., handling multi-arm studies) |
|
|
| 13c | | Describe any methods used to tabulate or visually display results of individual studies and syntheses |
|
|
| 13d | | Describe any methods used to synthesize results and rationale (meta-analysis: model, software; narrative: SWiM) |
|
|
| 13e | | Describe any methods used to explore possible causes of heterogeneity (subgroup, meta-regression) |
|
|
| 13f | | Describe any sensitivity analyses conducted |
|
|
| 14 | **Reporting bias assessment** | Describe any methods used to assess risk of bias due to missing results (publication bias) |
|
|
| 15 | **Certainty assessment** | Describe any methods used to assess certainty in the body of evidence (e.g., GRADE) |
|
|
|
|
### Results
|
|
|
|
| # | Item | Guidance |
|
|
|---|------|---------|
|
|
| 16a | **Study selection** | Describe results of the search and selection process, ideally using a PRISMA flow diagram |
|
|
| 16b | | Cite studies that appeared to meet inclusion criteria but were excluded, and explain why |
|
|
| 17 | **Study characteristics** | For each included study cite it and present its characteristics |
|
|
| 18 | **Risk of bias in studies** | Present assessments of risk of bias for each included study |
|
|
| 19 | **Results of individual studies** | For all outcomes, present for each study: summary data, effect estimates and CIs, results of syntheses |
|
|
| 20a | **Results of syntheses** | For each synthesis, briefly summarize the characteristics and risk of bias among contributing studies |
|
|
| 20b | | Present results of all statistical syntheses conducted, including CIs and measures of heterogeneity |
|
|
| 20c | | Present results of all investigations of possible causes of heterogeneity |
|
|
| 20d | | Present results of all sensitivity analyses |
|
|
| 21 | **Reporting biases** | Present assessments of risk of bias due to missing results |
|
|
| 22 | **Certainty of evidence** | Present assessments of certainty of evidence for each outcome assessed |
|
|
|
|
### Discussion
|
|
|
|
| # | Item | Guidance |
|
|
|---|------|---------|
|
|
| 23 | **Discussion** | Provide a general interpretation of results in the context of other evidence, discuss limitations of the evidence and of the review process, implications |
|
|
| 24 | **Registration and protocol** | Provide registration information including register name and registration number, and a link to the protocol |
|
|
| 25 | **Support** | Describe sources of financial or non-financial support and the role of funders |
|
|
| 26 | **Competing interests** | Declare any competing interests of review authors |
|
|
| 27 | **Availability of data, code, and other materials** | Report which of the following are publicly available: template data collection forms, data extracted from included studies, analysis code, any other materials |
|
|
|
|
### PRISMA 2020 Flow Diagram
|
|
|
|
```
|
|
┌─────────────────────────────────────────────────────┐
|
|
│ IDENTIFICATION │
|
|
├─────────────────────────────────────────────────────┤
|
|
│ Records identified from databases (n = ) │
|
|
│ Records identified from other sources (n = ) │
|
|
└──────────────────────┬──────────────────────────────┘
|
|
│
|
|
┌──────────────────────▼──────────────────────────────┐
|
|
│ Records removed before screening: │
|
|
│ Duplicate records (n = ) │
|
|
│ Records marked as ineligible by automation (n = ) │
|
|
│ Records removed for other reasons (n = ) │
|
|
└──────────────────────┬──────────────────────────────┘
|
|
│
|
|
┌──────────────────────▼──────────────────────────────┐
|
|
│ SCREENING │
|
|
├─────────────────────────────────────────────────────┤
|
|
│ Records screened (n = ) │
|
|
│ Records excluded (n = ) │
|
|
└──────────────────────┬──────────────────────────────┘
|
|
│
|
|
┌──────────────────────▼──────────────────────────────┐
|
|
│ Reports sought for retrieval (n = ) │
|
|
│ Reports not retrieved (n = ) │
|
|
└──────────────────────┬──────────────────────────────┘
|
|
│
|
|
┌──────────────────────▼──────────────────────────────┐
|
|
│ Reports assessed for eligibility (n = ) │
|
|
│ Reports excluded, with reasons (n = ) │
|
|
│ Reason 1 (n = ) │
|
|
│ Reason 2 (n = ) │
|
|
│ Reason 3 (n = ) │
|
|
└──────────────────────┬──────────────────────────────┘
|
|
│
|
|
┌──────────────────────▼──────────────────────────────┐
|
|
│ INCLUDED │
|
|
├─────────────────────────────────────────────────────┤
|
|
│ Studies included in review (n = ) │
|
|
│ Reports of included studies (n = ) │
|
|
│ │
|
|
│ Studies included in quantitative synthesis (n = ) │
|
|
└─────────────────────────────────────────────────────┘
|
|
```
|
|
|
|
---
|
|
|
|
## 3. RoB 2 Instrument Summary
|
|
|
|
**Full Name**: Risk of Bias tool for randomized trials (version 2)
|
|
**Reference**: Sterne et al. (2019). BMJ, 366, l4898. https://doi.org/10.1136/bmj.l4898
|
|
|
|
### Domains
|
|
|
|
| Domain | Abbreviation | Focus |
|
|
|--------|-------------|-------|
|
|
| Bias arising from the randomization process | D1 | Sequence generation, allocation concealment, baseline balance |
|
|
| Bias due to deviations from intended interventions | D2 | Blinding, protocol adherence, ITT analysis |
|
|
| Bias due to missing outcome data | D3 | Completeness, differential dropout, handling of missing data |
|
|
| Bias in measurement of the outcome | D4 | Outcome assessment method, blinding of assessors |
|
|
| Bias in selection of the reported result | D5 | Pre-registration, selective reporting |
|
|
|
|
### Judgment Scale
|
|
|
|
- **Low risk of bias**: The study is judged to be at low risk of bias for this domain
|
|
- **Some concerns**: The study raises some concerns about bias for this domain
|
|
- **High risk of bias**: The study is judged to be at high risk of bias for this domain
|
|
|
|
### Overall Judgment Algorithm
|
|
|
|
- All domains Low → Overall **Low**
|
|
- Some Concerns in ≥ 1 domain, no High → Overall **Some Concerns**
|
|
- High in ≥ 1 domain → Overall **High**
|
|
|
|
---
|
|
|
|
## 4. ROBINS-I Summary
|
|
|
|
**Full Name**: Risk Of Bias In Non-randomized Studies of Interventions
|
|
**Reference**: Sterne et al. (2016). BMJ, 355, i4919. https://doi.org/10.1136/bmj.i4919
|
|
|
|
### Domains (7 domains spanning 3 time points)
|
|
|
|
**Pre-intervention**:
|
|
- D1: Bias due to confounding
|
|
- D2: Bias in selection of participants into the study
|
|
|
|
**At intervention**:
|
|
- D3: Bias in classification of interventions
|
|
|
|
**Post-intervention**:
|
|
- D4: Bias due to deviations from intended interventions
|
|
- D5: Bias due to missing data
|
|
- D6: Bias in measurement of outcomes
|
|
- D7: Bias in selection of the reported result
|
|
|
|
### Judgment Scale
|
|
|
|
- **Low risk**: Comparable to a well-performed RCT
|
|
- **Moderate risk**: Sound for a non-randomized study but cannot be considered comparable to a well-performed RCT
|
|
- **Serious risk**: Some important problems
|
|
- **Critical risk**: Study is too problematic to provide useful evidence
|
|
- **No information**: Insufficient reporting
|
|
|
|
---
|
|
|
|
## 5. I² Interpretation Guide
|
|
|
|
| I² Range | Label | What It Means | Action |
|
|
|----------|-------|---------------|--------|
|
|
| 0-40% | Low | Heterogeneity might not be important | Proceed with pooling; report I² |
|
|
| 30-60% | Moderate | May represent moderate heterogeneity | Proceed with pooling; investigate sources |
|
|
| 50-90% | Substantial | Substantial heterogeneity | Investigate sources; consider subgroup analyses; report prediction interval |
|
|
| 75-100% | Considerable | Considerable heterogeneity | Question whether pooling is meaningful; consider narrative synthesis |
|
|
|
|
**Important caveats**:
|
|
- Ranges overlap intentionally (per Cochrane Handbook 10.10.2)
|
|
- I² significance depends on: magnitude of effects, p-value from Q-test, and visual inspection of forest plot
|
|
- A high I² with all effects in the same direction is less concerning than moderate I² with effects crossing zero
|
|
- I² is influenced by precision of studies — many precise studies can yield high I² even with small absolute differences
|
|
- Always report the 95% CI for I² (which can be very wide with few studies)
|
|
|
|
---
|
|
|
|
## 6. GRADE Certainty of Evidence Framework
|
|
|
|
**Full Name**: Grading of Recommendations, Assessment, Development and Evaluations
|
|
**Reference**: Guyatt et al. (2008). BMJ, 336, 924-926.
|
|
|
|
### Starting Points
|
|
|
|
| Study Design | Starting Certainty |
|
|
|-------------|-------------------|
|
|
| Randomized trials | HIGH (⊕⊕⊕⊕) |
|
|
| Non-randomized studies | LOW (⊕⊕◯◯) |
|
|
|
|
### Factors That Lower Certainty (Rate Down)
|
|
|
|
| Factor | Rate Down | When to Apply |
|
|
|--------|-----------|---------------|
|
|
| Risk of bias | -1 or -2 | Serious or very serious limitations in study design/execution |
|
|
| Inconsistency | -1 or -2 | Unexplained heterogeneity (I² > 50%, different directions of effect) |
|
|
| Indirectness | -1 or -2 | Evidence does not directly address the PICOS of the review question |
|
|
| Imprecision | -1 or -2 | Wide CIs, small sample sizes, CIs cross clinical decision threshold |
|
|
| Publication bias | -1 | Funnel plot asymmetry, small study effects, known unpublished trials |
|
|
|
|
### Factors That Raise Certainty (Rate Up — Observational Studies Only)
|
|
|
|
| Factor | Rate Up | When to Apply |
|
|
|--------|---------|---------------|
|
|
| Large effect | +1 or +2 | RR > 2 or < 0.5 (large), RR > 5 or < 0.2 (very large), without confounders |
|
|
| Dose-response gradient | +1 | Clear dose-response relationship observed |
|
|
| Plausible confounding | +1 | All plausible confounders would reduce the observed effect |
|
|
|
|
### Certainty Levels
|
|
|
|
| Level | Symbol | Meaning |
|
|
|-------|--------|---------|
|
|
| High | ⊕⊕⊕⊕ | Very confident the true effect lies close to the estimate |
|
|
| Moderate | ⊕⊕⊕◯ | Moderately confident; the true effect is likely close but may be substantially different |
|
|
| Low | ⊕⊕◯◯ | Limited confidence; the true effect may be substantially different |
|
|
| Very Low | ⊕◯◯◯ | Very little confidence; the true effect is likely substantially different |
|
|
|
|
---
|
|
|
|
## 7. Protocol Registration Guidance
|
|
|
|
### When to Register
|
|
|
|
- **Always** for systematic reviews intended for publication
|
|
- **Before** starting the literature search
|
|
- Registration prevents outcome reporting bias and demonstrates a priori planning
|
|
|
|
### Where to Register
|
|
|
|
| Platform | Focus | Cost | URL |
|
|
|----------|-------|------|-----|
|
|
| **PROSPERO** | Health-related systematic reviews | Free | crd.york.ac.uk/prospero |
|
|
| **OSF Registries** | Any discipline | Free | osf.io/registries |
|
|
| **INPLASY** | Any discipline | ~$40 | inplasy.com |
|
|
| **Research Registry** | Any discipline | Free for systematic reviews | researchregistry.com |
|
|
|
|
### Protocol Content (PRISMA-P 2015)
|
|
|
|
See `templates/prisma_protocol_template.md` for the complete protocol template.
|
|
|
|
Key sections:
|
|
1. Title, registration, authors, amendments
|
|
2. Rationale, objectives, PICOS eligibility criteria
|
|
3. Information sources, search strategy, study records management
|
|
4. Data extraction, risk of bias assessment, data synthesis plan
|
|
5. Meta-bias assessment, confidence in cumulative evidence
|
|
|
|
---
|
|
|
|
## 8. Software and Tools
|
|
|
|
### Statistical Software for Meta-Analysis
|
|
|
|
| Tool | Language | Best For | Key References |
|
|
|------|----------|----------|---------------|
|
|
| **metafor** (R) | R | Comprehensive meta-analysis (all models, diagnostics) | Viechtbauer (2010) |
|
|
| **meta** (R) | R | User-friendly standard meta-analyses | Balduzzi et al. (2019) |
|
|
| **dmetar** (R) | R | Companion to "Doing Meta-Analysis in R" textbook | Harrer et al. (2021) |
|
|
| **RevMan** | Standalone | Cochrane reviews (required for Cochrane) | Cochrane Collaboration |
|
|
| **robvis** (R) | R | Risk of bias visualization (traffic-light plots) | McGuinness & Higgins (2020) |
|
|
| **GRADE pro GDT** | Web-based | GRADE Summary of Findings tables | McMaster University |
|
|
|
|
### Screening and Management Tools
|
|
|
|
| Tool | Purpose | Cost |
|
|
|------|---------|------|
|
|
| **Covidence** | Study screening, data extraction, RoB | Paid (free Cochrane license) |
|
|
| **Rayyan** | Abstract screening (AI-assisted) | Free |
|
|
| **EPPI-Reviewer** | Full review management | Paid |
|
|
| **ASReview** | AI-assisted screening | Free (open source) |
|
|
| **Zotero/Mendeley** | Reference management | Free |
|
|
|
|
---
|
|
|
|
## Quick Decision Guide
|
|
|
|
```
|
|
Starting a systematic review?
|
|
│
|
|
├── 1. Register your protocol
|
|
│ └── PROSPERO (health) or OSF (any field)
|
|
│
|
|
├── 2. Write your protocol
|
|
│ └── Use PRISMA-P template → templates/prisma_protocol_template.md
|
|
│
|
|
├── 3. Search systematically
|
|
│ └── ≥ 2 databases, document everything, PRISMA flow
|
|
│
|
|
├── 4. Screen and select
|
|
│ └── Dual screening, predefined criteria
|
|
│
|
|
├── 5. Assess risk of bias
|
|
│ └── RCTs → RoB 2 | Non-randomized → ROBINS-I
|
|
│
|
|
├── 6. Synthesize evidence
|
|
│ ├── Quantitative data + comparable studies → Meta-analysis
|
|
│ └── Otherwise → Narrative synthesis (SWiM)
|
|
│
|
|
├── 7. Assess certainty
|
|
│ └── GRADE for each outcome
|
|
│
|
|
└── 8. Report
|
|
└── PRISMA 2020 checklist → templates/prisma_report_template.md
|
|
```
|