Files
qoder-config/skills/deep-research/references/systematic_review_toolkit.md
aszerW f571b20598 feat(skills): add deep-research skill
Copy deep-research skill from local Qoder installation to config repo for version control
2026-06-06 13:22:55 +08:00

19 KiB

Systematic Review Toolkit — Reference Guide

Purpose

Comprehensive reference for conducting systematic reviews and meta-analyses. Covers Cochrane methodology, PRISMA 2020 reporting, risk of bias instruments, heterogeneity interpretation, GRADE certainty framework, and protocol registration. Used by risk_of_bias_agent, meta_analysis_agent, bibliography_agent, and report_compiler_agent.


1. Cochrane Handbook v6.4 — Key Principles

The Cochrane Handbook for Systematic Reviews of Interventions (v6.4, 2023) is the gold standard reference for systematic review methodology.

Core Methodology Stages

Stage Cochrane Chapter Key Requirements
Planning Ch 1-3 Protocol registration, clear objectives, PICOS
Searching Ch 4 Comprehensive search (≥ 2 databases), documented strategy
Selecting Ch 4 Independent dual screening, predefined criteria
Data extraction Ch 5 Standardized forms, pilot testing, dual extraction
Risk of bias Ch 8 (RoB 2), Ch 25 (ROBINS-I) Domain-based assessment, signaling questions
Synthesis Ch 10-12 Appropriate statistical methods, heterogeneity assessment
GRADE Ch 14 Certainty of evidence for each outcome
Reporting Ch 15 PRISMA 2020 compliance

Fundamental Principles

  1. A priori protocol: Register the protocol before conducting the review (PROSPERO, OSF)
  2. Comprehensive searching: Search multiple databases; do not rely on a single source
  3. Dual independent processes: Two reviewers for screening, extraction, and risk of bias (at minimum for a subset)
  4. Pre-specified methods: Analysis plan defined before seeing results
  5. Transparent reporting: Document everything; another team should be able to replicate the review

2. PRISMA 2020 — Full 27-Item Checklist

Full Name: Preferred Reporting Items for Systematic Reviews and Meta-Analyses Reference: Page et al. (2021). BMJ, 372, n71. https://doi.org/10.1136/bmj.n71

Title and Abstract

# Item Guidance
1 Title Identify the report as a systematic review, meta-analysis, or both
2 Abstract Structured summary: background, objectives, data sources, study eligibility criteria, participants, interventions, study appraisal/synthesis methods, results, limitations, conclusions, registration number

Introduction

# Item Guidance
3 Rationale Describe the rationale for the review in the context of existing knowledge
4 Objectives Provide an explicit statement of the questions being addressed with reference to PICOS

Methods

# Item Guidance
5 Eligibility criteria Specify inclusion and exclusion criteria (PICOS components, date range, language, publication status)
6 Information sources Describe all information sources searched (databases, registers, websites, organizations, reference lists) with dates
7 Search strategy Present the complete search strategy for at least one database, including any filters and limits
8 Selection process State methods for deciding which studies met eligibility criteria (number of reviewers, consensus process)
9 Data collection process Describe methods for extracting data (number of reviewers, whether independently, any processes for obtaining/confirming data from investigators)
10 Data items List and define all outcome variables and other variables extracted
11 Study risk of bias assessment Describe methods for assessing risk of bias in included studies, including tools used and how results were used in synthesis
12 Effect measures Specify for each outcome the effect measure(s) used (e.g., RR, MD, SMD)
13a Synthesis methods Describe the processes used to decide which studies were eligible for each synthesis
13b Describe any methods required to prepare the data for synthesis (e.g., handling multi-arm studies)
13c Describe any methods used to tabulate or visually display results of individual studies and syntheses
13d Describe any methods used to synthesize results and rationale (meta-analysis: model, software; narrative: SWiM)
13e Describe any methods used to explore possible causes of heterogeneity (subgroup, meta-regression)
13f Describe any sensitivity analyses conducted
14 Reporting bias assessment Describe any methods used to assess risk of bias due to missing results (publication bias)
15 Certainty assessment Describe any methods used to assess certainty in the body of evidence (e.g., GRADE)

Results

# Item Guidance
16a Study selection Describe results of the search and selection process, ideally using a PRISMA flow diagram
16b Cite studies that appeared to meet inclusion criteria but were excluded, and explain why
17 Study characteristics For each included study cite it and present its characteristics
18 Risk of bias in studies Present assessments of risk of bias for each included study
19 Results of individual studies For all outcomes, present for each study: summary data, effect estimates and CIs, results of syntheses
20a Results of syntheses For each synthesis, briefly summarize the characteristics and risk of bias among contributing studies
20b Present results of all statistical syntheses conducted, including CIs and measures of heterogeneity
20c Present results of all investigations of possible causes of heterogeneity
20d Present results of all sensitivity analyses
21 Reporting biases Present assessments of risk of bias due to missing results
22 Certainty of evidence Present assessments of certainty of evidence for each outcome assessed

Discussion

# Item Guidance
23 Discussion Provide a general interpretation of results in the context of other evidence, discuss limitations of the evidence and of the review process, implications
24 Registration and protocol Provide registration information including register name and registration number, and a link to the protocol
25 Support Describe sources of financial or non-financial support and the role of funders
26 Competing interests Declare any competing interests of review authors
27 Availability of data, code, and other materials Report which of the following are publicly available: template data collection forms, data extracted from included studies, analysis code, any other materials

PRISMA 2020 Flow Diagram

 ┌─────────────────────────────────────────────────────┐
 │                 IDENTIFICATION                      │
 ├─────────────────────────────────────────────────────┤
 │ Records identified from databases (n = )            │
 │ Records identified from other sources (n = )        │
 └──────────────────────┬──────────────────────────────┘
                        │
 ┌──────────────────────▼──────────────────────────────┐
 │ Records removed before screening:                   │
 │   Duplicate records (n = )                          │
 │   Records marked as ineligible by automation (n = ) │
 │   Records removed for other reasons (n = )          │
 └──────────────────────┬──────────────────────────────┘
                        │
 ┌──────────────────────▼──────────────────────────────┐
 │                  SCREENING                          │
 ├─────────────────────────────────────────────────────┤
 │ Records screened (n = )                             │
 │ Records excluded (n = )                             │
 └──────────────────────┬──────────────────────────────┘
                        │
 ┌──────────────────────▼──────────────────────────────┐
 │ Reports sought for retrieval (n = )                 │
 │ Reports not retrieved (n = )                        │
 └──────────────────────┬──────────────────────────────┘
                        │
 ┌──────────────────────▼──────────────────────────────┐
 │ Reports assessed for eligibility (n = )             │
 │ Reports excluded, with reasons (n = )               │
 │   Reason 1 (n = )                                   │
 │   Reason 2 (n = )                                   │
 │   Reason 3 (n = )                                   │
 └──────────────────────┬──────────────────────────────┘
                        │
 ┌──────────────────────▼──────────────────────────────┐
 │                  INCLUDED                           │
 ├─────────────────────────────────────────────────────┤
 │ Studies included in review (n = )                   │
 │ Reports of included studies (n = )                  │
 │                                                     │
 │ Studies included in quantitative synthesis (n = )   │
 └─────────────────────────────────────────────────────┘

3. RoB 2 Instrument Summary

Full Name: Risk of Bias tool for randomized trials (version 2) Reference: Sterne et al. (2019). BMJ, 366, l4898. https://doi.org/10.1136/bmj.l4898

Domains

Domain Abbreviation Focus
Bias arising from the randomization process D1 Sequence generation, allocation concealment, baseline balance
Bias due to deviations from intended interventions D2 Blinding, protocol adherence, ITT analysis
Bias due to missing outcome data D3 Completeness, differential dropout, handling of missing data
Bias in measurement of the outcome D4 Outcome assessment method, blinding of assessors
Bias in selection of the reported result D5 Pre-registration, selective reporting

Judgment Scale

  • Low risk of bias: The study is judged to be at low risk of bias for this domain
  • Some concerns: The study raises some concerns about bias for this domain
  • High risk of bias: The study is judged to be at high risk of bias for this domain

Overall Judgment Algorithm

  • All domains Low → Overall Low
  • Some Concerns in ≥ 1 domain, no High → Overall Some Concerns
  • High in ≥ 1 domain → Overall High

4. ROBINS-I Summary

Full Name: Risk Of Bias In Non-randomized Studies of Interventions Reference: Sterne et al. (2016). BMJ, 355, i4919. https://doi.org/10.1136/bmj.i4919

Domains (7 domains spanning 3 time points)

Pre-intervention:

  • D1: Bias due to confounding
  • D2: Bias in selection of participants into the study

At intervention:

  • D3: Bias in classification of interventions

Post-intervention:

  • D4: Bias due to deviations from intended interventions
  • D5: Bias due to missing data
  • D6: Bias in measurement of outcomes
  • D7: Bias in selection of the reported result

Judgment Scale

  • Low risk: Comparable to a well-performed RCT
  • Moderate risk: Sound for a non-randomized study but cannot be considered comparable to a well-performed RCT
  • Serious risk: Some important problems
  • Critical risk: Study is too problematic to provide useful evidence
  • No information: Insufficient reporting

5. I² Interpretation Guide

I² Range Label What It Means Action
0-40% Low Heterogeneity might not be important Proceed with pooling; report I²
30-60% Moderate May represent moderate heterogeneity Proceed with pooling; investigate sources
50-90% Substantial Substantial heterogeneity Investigate sources; consider subgroup analyses; report prediction interval
75-100% Considerable Considerable heterogeneity Question whether pooling is meaningful; consider narrative synthesis

Important caveats:

  • Ranges overlap intentionally (per Cochrane Handbook 10.10.2)
  • I² significance depends on: magnitude of effects, p-value from Q-test, and visual inspection of forest plot
  • A high I² with all effects in the same direction is less concerning than moderate I² with effects crossing zero
  • I² is influenced by precision of studies — many precise studies can yield high I² even with small absolute differences
  • Always report the 95% CI for I² (which can be very wide with few studies)

6. GRADE Certainty of Evidence Framework

Full Name: Grading of Recommendations, Assessment, Development and Evaluations Reference: Guyatt et al. (2008). BMJ, 336, 924-926.

Starting Points

Study Design Starting Certainty
Randomized trials HIGH (⊕⊕⊕⊕)
Non-randomized studies LOW (⊕⊕◯◯)

Factors That Lower Certainty (Rate Down)

Factor Rate Down When to Apply
Risk of bias -1 or -2 Serious or very serious limitations in study design/execution
Inconsistency -1 or -2 Unexplained heterogeneity (I² > 50%, different directions of effect)
Indirectness -1 or -2 Evidence does not directly address the PICOS of the review question
Imprecision -1 or -2 Wide CIs, small sample sizes, CIs cross clinical decision threshold
Publication bias -1 Funnel plot asymmetry, small study effects, known unpublished trials

Factors That Raise Certainty (Rate Up — Observational Studies Only)

Factor Rate Up When to Apply
Large effect +1 or +2 RR > 2 or < 0.5 (large), RR > 5 or < 0.2 (very large), without confounders
Dose-response gradient +1 Clear dose-response relationship observed
Plausible confounding +1 All plausible confounders would reduce the observed effect

Certainty Levels

Level Symbol Meaning
High ⊕⊕⊕⊕ Very confident the true effect lies close to the estimate
Moderate ⊕⊕⊕◯ Moderately confident; the true effect is likely close but may be substantially different
Low ⊕⊕◯◯ Limited confidence; the true effect may be substantially different
Very Low ⊕◯◯◯ Very little confidence; the true effect is likely substantially different

7. Protocol Registration Guidance

When to Register

  • Always for systematic reviews intended for publication
  • Before starting the literature search
  • Registration prevents outcome reporting bias and demonstrates a priori planning

Where to Register

Platform Focus Cost URL
PROSPERO Health-related systematic reviews Free crd.york.ac.uk/prospero
OSF Registries Any discipline Free osf.io/registries
INPLASY Any discipline ~$40 inplasy.com
Research Registry Any discipline Free for systematic reviews researchregistry.com

Protocol Content (PRISMA-P 2015)

See templates/prisma_protocol_template.md for the complete protocol template.

Key sections:

  1. Title, registration, authors, amendments
  2. Rationale, objectives, PICOS eligibility criteria
  3. Information sources, search strategy, study records management
  4. Data extraction, risk of bias assessment, data synthesis plan
  5. Meta-bias assessment, confidence in cumulative evidence

8. Software and Tools

Statistical Software for Meta-Analysis

Tool Language Best For Key References
metafor (R) R Comprehensive meta-analysis (all models, diagnostics) Viechtbauer (2010)
meta (R) R User-friendly standard meta-analyses Balduzzi et al. (2019)
dmetar (R) R Companion to "Doing Meta-Analysis in R" textbook Harrer et al. (2021)
RevMan Standalone Cochrane reviews (required for Cochrane) Cochrane Collaboration
robvis (R) R Risk of bias visualization (traffic-light plots) McGuinness & Higgins (2020)
GRADE pro GDT Web-based GRADE Summary of Findings tables McMaster University

Screening and Management Tools

Tool Purpose Cost
Covidence Study screening, data extraction, RoB Paid (free Cochrane license)
Rayyan Abstract screening (AI-assisted) Free
EPPI-Reviewer Full review management Paid
ASReview AI-assisted screening Free (open source)
Zotero/Mendeley Reference management Free

Quick Decision Guide

Starting a systematic review?
│
├── 1. Register your protocol
│   └── PROSPERO (health) or OSF (any field)
│
├── 2. Write your protocol
│   └── Use PRISMA-P template → templates/prisma_protocol_template.md
│
├── 3. Search systematically
│   └── ≥ 2 databases, document everything, PRISMA flow
│
├── 4. Screen and select
│   └── Dual screening, predefined criteria
│
├── 5. Assess risk of bias
│   └── RCTs → RoB 2 | Non-randomized → ROBINS-I
│
├── 6. Synthesize evidence
│   ├── Quantitative data + comparable studies → Meta-analysis
│   └── Otherwise → Narrative synthesis (SWiM)
│
├── 7. Assess certainty
│   └── GRADE for each outcome
│
└── 8. Report
    └── PRISMA 2020 checklist → templates/prisma_report_template.md