Guides: AI and Text Mining for Searching and Screening the Literature: Tools for screening (study selection)

Tools for screening (study selection)

Given the high sensitivity/recall of most knowledge synthesis search strategies and the often associated low precision/specificity of the results (e.g., high number of false positives), researchers are investigating the feasibility of using text mining and machine learning in the record screening phase, to reduce the burden on reviewers while still capturing relevant studies from the search set. I recommend that librarians understand what software is available for this, to allow them to advise users on their options.

The approaches that have been explored can be generally categorized into the following:

Improving workflow through screening prioritisation - for process parallelisation, allowing reviewers to perform tasks in parallel (e.g., prioritising relevant records in screening phase so that full texts, data extraction, and synthesis can begin earlier)
Using software as a second reviewer
Speeding up the screening process

Hamel et al. (2021) provide guidance on using artificial intelligence (including text mining) for title and abstract screening in systematic reviews and other knowledge syntheses.

Screening: AI and text mining tools for non-programmers

Digital Evidence Synthesis Tool (DEST) Evaluations - Examines automation tools in evidence synthesis, including tools available for the screening stage

Use filters in the left column to limit to, e.g., Evidence synthesis stage: Screening, then click on "List records" button at the top
Focuses on tools for health and climate change syntheses, but tools are often agnostic or applicable to fields more generally speaking, like health sciences

Abstrackr
Semi-automated screening program. Free with sign up. To add records from EndNote: Try selecting all deduplicated records in EndNote and go to File > Export... > Type:*.txt (NOT .xml), Output Style: RefMan (RIS) Export
ASReview
"Active learning for systematic reviews"

Requires installing Python and running commands, but instructions are easy to follow
Uploaded references should already be deduplicated
See: van de Schoot, R., de Bruin, J., Schram, R., Zahedi, P., de Boer, J., Weijdema, F., Kramer, B., Huijts, M., Hoogerwerf, M., Ferdinands, G., Harkema, A., Willemsen, J., Ma, Y., Fang, Q., Hindriks, S., Tummers, L., & Oberski, D. L. (2021). An open source machine learning framework for efficient and transparent systematic reviews. Nature Machine Intelligence, 3(2), 125-133. https://doi.org/10.1038/s42256-020-00287-7

Colandr
Computer assistance for screening and metadata extraction. Uses machine learning, natural language processing, and text mining. Open source. Free with sign up.
Covidence at McGill
Web-based tool available to members of the McGill community that will help you through the process of importing records, screening references, data extraction, and keeping track of your work
more...less...
Sorting records by "most relevant" uses machine learning for screening prioritization. Learn more: https://support.covidence.org/help/sorting-the-title-abstract-screening-list
DistillerSR
Software uses natural language processing to automate the screening process by sorting and ranking results, acting as a second screener, and checking for errors. Paid

See: Gartlehner, G., Wagner, G., Lux, L., Affengruber, L., Dobrescu, A., Kaminski-Hartenthaler, A., & Viswanathan, M. (2019). Assessing the accuracy of machine-assisted abstract screening with DistillerAI: A user study. Systematic Reviews, 8(1), 277. https://doi.org/10.1186/s13643-019-1221-3

EPPI-Reviewer
Knowledge synthesis software which includes text mining, data clustering, classification and term extraction, to automate or semi-automate systematic reviews throughout the process; includes option for priority screening. Paid (free for Cochrane collaborators).
PICO Portal
Assists with study identification, deduplication, screening, full-text review, and mapping
Rayyan
Easy-to-use tool for screening records, uses machine learning (support vector machine classifier) to learn, as the reviewer progresses, the features of studies chosen for inclusion and exclusion, then predicts and suggests what non-screened studies to include; potentially useful for screening prioritisation. Paid but trial version available

See: Olofsson, H., Brolund, A., Hellberg, C., Silverstein, R., Stenström, K., Österberg, M., & Dagerhamn, J. (2017). Can abstract screening workload be reduced using text mining? User experiences of the tool Rayyan. Research synthesis methods, 8(3), 275-280. https://doi.org/10.1002/jrsm.1237

RobotAnalyst
Based on queried results, uses search engine functionality, machine learning, and text mining to reduce the record screening burden. Contact Prof. Ananiadou for an account.
SWIFT-Active Screener
Uses statistical text mining and machine learning to rank order documents in the screening process

See: Howard, B.E., Phillips, J., Miller, K. et al. (2016). SWIFT-Review: A text-mining workbench for systematic review. Systematic Reviews, 5, 87. https://doi.org/10.1186/s13643-016-0263-z

SyRF -
Use for screening, annotating, and extracting information from in vivo studies. Free

References on AI and text mining in study selection

In addition to the references below, you can use the following search strategy in Google Scholar to identify more literature on these and other tools (this is not a comprehensive search):

Useful repository of articles

Bond, M., Finnerty, A., O'Mara-Eves, A., O'Driscoll, P., Thomas, J., Minx, J., Callaghan, M., & Scheelbeek, P. (2024). Digital Evidence Synthesis Tool Evaluations. EPPI Visualiser database.
- In left column, under "Evidence Synthesis Stage": Select "Screening" > Click "List Records"