DOAJ Open Access 2025

Large language models for closed-library multi-document query, test generation, and evaluation

Claire Randolph Adam Michaleas Darrell O. Ricke

Abstrak

IntroductionLearning complex, detailed, and evolving knowledge is a challenge in multiple technical professions. Relevant source knowledge is contained within many large documents and information sources with frequent updates to these documents. Knowledge tests need to be generated on new material and existing tests revised, tracking knowledge base updates. Large Language Models (LLMs) provide a framework for artificial intelligence-assisted knowledge acquisition and continued learning. Retrieval-Augmented Generation (RAG) provides a framework to leverage available, trained LLMs combined with technical area-specific knowledge bases.MethodsHerein, two methods are introduced (DaaDy: document as a dictionary and SQAD: structured question answer dictionary), which together enable effective implementation of LLM-RAG question-answering on large documents. Additionally, the AI for knowledge intensive tasks (AIKIT) solution is presented for working with numerous documents for training and continuing education. AIKIT is provided as a containerized open source solution that deploys on standalone, high performance, and cloud systems. AIKIT includes LLM, RAG, vector stores, relational database, and a Ruby on Rails web interface.ResultsCoverage of source documents by LLM-RAG generated questions decreases as the length of documents increase. Segmenting source documents improve coverage of generated questions. The AIKIT solution enabled easy use of multiple LLM models with multimodal RAG source documents; AIKIT retains LLM-RAG responses for queries against one or multiple LLM models.DiscussionAIKIT provides an easy-to-use set of tools to enable users to work with complex information using LLM-RAG capabilities. AIKIT enables easy use of multiple LLM models with retention of LLM-RAG responses.

Topik & Kata Kunci

Electronic computers. Computer science

Penulis (3)

Claire Randolph

Adam Michaleas

Darrell O. Ricke

Format Sitasi

APA MLA BibTeX

Randolph, C., Michaleas, A., Ricke, D.O. (2025). Large language models for closed-library multi-document query, test generation, and evaluation. https://doi.org/10.3389/frai.2025.1592013

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber doi.org/10.3389/frai.2025.1592013

Informasi Jurnal

Tahun Terbit: 2025
Sumber Database: DOAJ
DOI: 10.3389/frai.2025.1592013
Akses: Open Access ✓