Optimizing theranostics chatbots with context-augmented large language models

dc.contributor.authorKoller, Pia
dc.contributor.authorClement, Christoph
dc.contributor.authorVan Eijk, Albert
dc.contributor.authorSeifert, Robert
dc.contributor.authorZhang, Jingjing
dc.contributor.authorPrenosil, George
dc.contributor.authorSathekge, Mike Machaba
dc.contributor.authorHerrmann, Ken
dc.contributor.authorBaum, Richard
dc.contributor.authorWeber, Wolfgang A.
dc.contributor.authorRominger, Axel
dc.contributor.authorShi, Kuangyu
dc.date.accessioned2025-07-11T07:20:30Z
dc.date.available2025-07-11T07:20:30Z
dc.date.issued2025-04
dc.description.abstractIINTRODUCTION : Nuclear medicine theranostics is rapidly emerging, as an interdisciplinary therapy option with multi-dimensional considerations. Healthcare Professionals do not have the time to do in depth research on every therapy option. Personalized Chatbots might help to educate them. Chatbots using Large Language Models (LLMs), such as ChatGPT, are gaining interest addressing these challenges. However, chatbot performances often fall short in specific domains, which is critical in healthcare applications. METHODS : This study develops a framework to examine the use of contextual augmentation to improve the performance of medical theranostic chatbots to create the first theranostic chatbot. Contextual augmentation involves providing additional relevant information to LLMs to improve their responses. We evaluate five state-of-the-art LLMs on questions translated into English and German. We compare answers generated with and without contextual augmentation, where the LLMs access pre-selected research papers via Retrieval Augmented Generation (RAG). We are using two RAG techniques: Naïve RAG and Advanced RAG. RESULTS : A user study and LLM-based evaluation assess answer quality across different metrics. Results show that Advanced RAG techniques considerably enhance LLM performance. Among the models, the best-performing variants are CLAUDE 3 OPUS and GPT-4O. These models consistently achieve the highest scores, indicating robust integration and utilization of contextual information. The most notable improvements between Naive RAG and Advanced RAG are observed in the GEMINI 1.5 and COMMAND R+ variants. CONCLUSION : This study demonstrates that contextual augmentation addresses the complexities inherent in theranostics. Despite promising results, key limitations include the biased selection of questions focusing primarily on PRRT, the need for comprehensive context documents. Future research should include a broader range of theranostics questions, explore additional RAG methods and aim to compare human and LLM evaluations more directly to enhance LLM performance further.
dc.description.departmentNuclear Medicine
dc.description.librarianhj2025
dc.description.sdgSDG-03: Good health and well-being
dc.description.sdgSDG-09: Industry, innovation and infrastructure
dc.description.sponsorshipITM Radiopharma grant and by the Swiss National Science Foundation (SNSF).
dc.description.urihttps://www.thno.org/
dc.identifier.citationKoller, P., Clement, C., Van Eijk, A. et al. 2025, 'Optimizing theranostics chatbots with context-augmented large language models', Theranostics, vol. 15, no. 12, pp. 5693-5704, doi : 10.7150/thno.107757.
dc.identifier.issn1838-7640 (online)
dc.identifier.other10.7150/thno.107757
dc.identifier.urihttp://hdl.handle.net/2263/103308
dc.language.isoen
dc.publisherIvyspring International Publisher
dc.rights© The author(s). This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/).
dc.subjectLarge language model (LLM)
dc.subjectContextual augmentation
dc.subjectRetrieval augmented generation (RAG)
dc.subjectNuclear medicine
dc.subjectTheranostics
dc.subjectArtificial intelligence (AI)
dc.subjectHealth care professional (HCP)
dc.titleOptimizing theranostics chatbots with context-augmented large language models
dc.typeArticle

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Koller_Optimizing_2025.pdf
Size:
3.89 MB
Format:
Adobe Portable Document Format
Description:
Article
Loading...
Thumbnail Image
Name:
Koller_OptimizingAppen_2025.pdf
Size:
96.79 KB
Format:
Adobe Portable Document Format
Description:
Appendix

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: