Automating Research Synthesis with Domain-Specific Large Language Model Fine-Tuning

Teo Susnjak*, Peter Hwang, Napoleon Reyes, Andre L.C. Barczak, Timothy Mcintosh, Surangika Ranathunga

*Corresponding author for this work

Research output: Contribution to journalArticleResearchpeer-review

3 Citations (Scopus)

Abstract

This research pioneers the use of fine-tuned Large Language Models (LLMs) to automate Systematic Literature Reviews (SLRs), presenting a significant and novel contribution in integrating AI to enhance academic research methodologies. Our study employed advanced fine-tuning methodologies on open sourced LLMs, applying textual data mining techniques to automate the knowledge discovery and synthesis phases of an SLR process, thus demonstrating a practical and efficient approach for extracting and analyzing high-quality information from large academic datasets. The results maintained high fidelity in factual accuracy in LLM responses, and were validated through the replication of an existing PRISMA-conforming SLR. Our research proposed solutions for mitigating LLM hallucination and proposed mechanisms for tracking LLM responses to their sources of information, thus demonstrating how this approach can meet the rigorous demands of scholarly research. The findings ultimately confirmed the potential of fine-tuned LLMs in streamlining various labor-intensive processes of conducting literature reviews. As a scalable proof-of-concept, this study highlights the broad applicability of our approach across multiple research domains. The potential demonstrated here advocates for updates to PRISMA reporting guidelines, incorporating AI-driven processes to ensure methodological transparency and reliability in future SLRs. This study broadens the appeal of AI-enhanced tools across various academic and research fields, demonstrating how to conduct comprehensive and accurate literature reviews with more efficiency in the face of ever-increasing volumes of academic studies while maintaining high standards.

Original languageEnglish
Article number68
Pages (from-to)1-39
Number of pages39
JournalACM Transactions on Knowledge Discovery from Data
Volume19
Issue number3
DOIs
Publication statusPublished - 11 Mar 2025

Cite this