TY - GEN
T1 - RAG for Question-Answering for Vocal Training Based on Domain Knowledge Base
AU - Leung, Chun Hung Jonas
AU - Yi, Yicheng
AU - Kuai, Le
AU - Li, Zongxi
AU - Yeung, Siu Kei Au
AU - Lee, Kwok Wah John
AU - Ho, Ka Him Kelvin
AU - Hung, Kevin
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Although Large language models (LLMs) are well-known due to their superior capacity for text generation and logical inference, they are found to be inaccurate in domain-specific question-answering tasks. The powerful generator still tends to generate content even when the LLM does not have sufficient knowledge at all, which is known as the hallucination problem. We find there is a research void in applying LLMs in the vocal training industry, which requires intensive expert knowledge in any chatbot or intelligent tutor services. This paper details employing Retrieval-Augmented Generation (RAG) technology to develop a domain-specific language model, addressing inherent challenges such as hallucination, where large models generate plausible but inaccurate content, and lack of domain specificity. By segmenting the knowledge base and establishing semantic similarities between user queries and knowledge data, the project lays a solid foundation for integrating RAG, significantly improving response accuracy and contextual relevance. The report highlights the successful implementation of RAG, enhancing system intelligence and personalization for user-specific needs, discusses challenges and solutions during the implementation process, and outlines future directions to expand RAG capabilities and improve user experiences.
AB - Although Large language models (LLMs) are well-known due to their superior capacity for text generation and logical inference, they are found to be inaccurate in domain-specific question-answering tasks. The powerful generator still tends to generate content even when the LLM does not have sufficient knowledge at all, which is known as the hallucination problem. We find there is a research void in applying LLMs in the vocal training industry, which requires intensive expert knowledge in any chatbot or intelligent tutor services. This paper details employing Retrieval-Augmented Generation (RAG) technology to develop a domain-specific language model, addressing inherent challenges such as hallucination, where large models generate plausible but inaccurate content, and lack of domain specificity. By segmenting the knowledge base and establishing semantic similarities between user queries and knowledge data, the project lays a solid foundation for integrating RAG, significantly improving response accuracy and contextual relevance. The report highlights the successful implementation of RAG, enhancing system intelligence and personalization for user-specific needs, discusses challenges and solutions during the implementation process, and outlines future directions to expand RAG capabilities and improve user experiences.
UR - http://www.scopus.com/inward/record.url?scp=85215531476&partnerID=8YFLogxK
U2 - 10.1109/BESC64747.2024.10780718
DO - 10.1109/BESC64747.2024.10780718
M3 - Conference contribution
AN - SCOPUS:85215531476
T3 - Proceedings of the 2024 11th IEEE International Conference on Behavioural and Social Computing, BESC 2024
BT - Proceedings of the 2024 11th IEEE International Conference on Behavioural and Social Computing, BESC 2024
T2 - 11th IEEE International Conference on Behavioural and Social Computing, BESC 2024
Y2 - 16 August 2024 through 18 August 2024
ER -