Independent Researcher, Columbus, Ohio.
International Journal of Science and Research Archive, 2025, 16(01), 1452-1462
Article DOI: 10.30574/ijsra.2025.16.1.2156
Received on 11 June 2025; revised on 16 July 2025; accepted on 19 July 2025
We propose a novel Retrieval Augmented Generation (RAG) framework for dynamic document tagging and query driven retrieval. Our system integrates a large language model (LLM) with an explicit memory of documents to generate semantic tags for each document and uses these tags to improve retrieval accuracy (Zhou et al., 2024; Li et al., 2024). A query tag feedback loop is then formalized to iteratively refine document annotations based on user queries and present a modular architecture that separates document preprocessing tag generation, storage and retrieval (Sharma et al., 2024). To evaluate such systems, we introduce a synthetic multi domain benchmark containing documents from scientific (ArXiv), governmental, and legal sources, along with ground truth tags and query pools (Kim et al., 2024; Lin et al., 2024). We also define a new Query to Tag Matching Score (Q2T), measuring the semantic alignment between queries and generated tags. Experiments on our benchmark and real world corpora show that dynamic tagging significantly improves recall and annotation quality over static baselines. We include ablation studies isolating the effects of each component and evaluate across multiple domains (e.g. PDFs, filings, rulings). Finally, we discuss ethical implications such as annotation bias and hallucinations, and outline mitigation strategies (Tokunaga et al., 2024). This work provides a rigorous foundation and evaluation framework for adaptive RAG systems in document understanding.
RAG; Retrieval Augmented Generation; Dynamic document tagging; Query driven retrieval
Preview Article PDF
Surya Kuchibhotla and Sri Kuchibhotla. Retrieval augmented generation system for dynamic document tagging and query-driven retrieval. International Journal of Science and Research Archive, 2025, 16(01), 1452-1462. Article DOI: https://doi.org/10.30574/ijsra.2025.16.1.2156.
Copyright © 2025 Author(s) retain the copyright of this article. This article is published under the terms of the Creative Commons Attribution Liscense 4.0







