Developing a BERT based triple classification model using knowledge graph embedding for question answering system

Thạc sĩPhan Hồ Viết TrườngDo Phuc

Khoa Công Nghệ Thông Tin

Thể loại: Bài báo

Sơ lược nội dung

The current BERT-based question answering systems use a question and a contextual text to find the answer. This causes the systems to return wrong answers or nothing if the text contains irrelevant contents with the input question. Besides, the systems haven’t answered yes-no and aggregate questions yet. Besides that, the systems only concentrate on the contents of text regardless of the relationship between entities in the corpus. This systems cannot validate the answer. In this paper, we presented a solution to solve these issues by using the BERT model and the knowledge graph to enhance a question answering system. We combined content-based and linked-based information for knowledge graph representation learning and classified triples into one of three classes such as base class, derived class, or non-existent class. We then used the BERT model to build two classifiers: BERT-based text classification for content information and BERT-based triple classification for link information. The former was able to make a contextual embedding vector for representing triples that were used to classify into the three above classes. The latter generated all path instances from all meta paths of a large heterogeneous information network by running the Motif Search method of Apache Spark on a distributed environment. After creating the path instances, we produced triples from these path instances. We made content-based information by converting triples into natural language text with labels and considered them as a text classification problem. Our proposed solution outperformed other embedding methods with an average accuracy of 92.34% on benchmark datasets and the Motif Finding algorithm with an average executive time improvement of 37% on the distributed environment.

Thông tin chung
Thể loại
Bài báo
Năm xuất bản
08 Thg5 2021
Ngôn ngữ gốc
Tiếng Anh
Tạp chí công bố
Applied Intelligence
Ấn phẩm số
Vol. 178
Loại tạp chí
Danh mục ISI/Scopus
Mã ISSN
0924-669X / 1573-7497
Chất lượng
Q2

Tài liệu tham khảo

Để đọc toàn văn của bài báo này, bạn có thể yêu cầu một bản sao đầy đủ trực tiếp từ các tác giả.