Egor Kratkov
2025
RAGulator: Effective RAG for Regulatory Question Answering
Islam Aushev
|
Egor Kratkov
|
Evgenii Nikoalev
|
Andrei Vladimirovich Glinskii
|
Vasilii Krikunov
|
Alexander Panchenko
|
Vasily Konovalov
|
Julia Belikova
Proceedings of the 1st Regulatory NLP Workshop (RegNLP 2025)
Regulatory Natural Language Processing (RegNLP) is a multidisciplinary domain focused on facilitating access to and comprehension of regulatory regulations and requirements. This paper outlines our strategy for creating a system to address the Regulatory Information Retrieval and Answer Generation (RIRAG) challenge, which was conducted during the RegNLP 2025 Workshop. The objective of this competition is to design a system capable of efficiently extracting pertinent passages from regulatory texts (ObliQA) and subsequently generating accurate, cohesive responses to inquiries related to compliance and obligations. Our proposed method employs a lightweight BM25 pre-filtering in retrieving relevant passages. This technique efficiently shortlisting candidates for subsequent processing with Transformer-based embeddings, thereby optimizing the use of resources.