Enhancing HR Support in a Thai Organization with LLM-Based Question Answering
Loading...
Date
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Large language models (LLMs) enhanced with retrieval-augmented generation (RAG) and multimodal inputs are increasingly used as interfaces to organizational knowledge. However, their effectiveness in specialized, non-English enterprise settings-such as Thai HR support-remains largely unclear. In many Thai organizations, employees frequently ask detailed HR-related questions, but the relevant information is scattered across internal webpages, PDF manuals, announcements, and images, making it difficult for generic LLMs to provide accurate, policy-consistent responses. To address this issue, we develop a multimodal RAG pipeline that combines hybrid dense-sparse retrieval over a vector database and evaluate six LLM models on a private Thai Visual Question Answering (VQA) HR dataset consisting of 226 questions and reference images across five HR topics. The results show that recent multimodal models, especially Qwen2.5-VL, achieve the best performance, with the highest averages in correctness (0.54), relevance (0.75), and helpfulness (0.64), clearly outperforming older vision-language systems and a text-only reasoning model. For large-scale answer evaluation, we apply an LLM-as-a-judge approach using GPT-4.1 and Gemini 2.5 Flash. We found that it serves as a generally reliable, though imperfect, substitute for human evaluation.