A Comparative Analysis of Generative Neural Attention-based Service Chatbot

Sinarwati, Mohamad Suhaili and Naomie, Salim and Mohamad Nazim, Jambli (2022) A Comparative Analysis of Generative Neural Attention-based Service Chatbot. (IJACSA) International Journal of Advanced Computer Science and Applications, 13 (8). pp. 742-751. ISSN 2156-5570

PDF
A Comparative Analysis - Copy.pdf
Download (126kB)

Official URL: https://thesai.org/Publications/ViewPaper?Volume=1...

Abstract

Companies constantly rely on customer support to deliver pre-and post-sale services to their clients through websites, mobile devices or social media platforms such as Twitter. In assisting customers, companies employ virtual service agents (chatbots) to provide support via communication devices. The primary focus is to automate the generation of conversational chat between a computer and a human by constructing virtual service agents that can predict appropriate and automatic responses to customers’ queries. This paper aims to present and implement a seq2seq-based learning task model based on encoder-decoder architectural solutions by training generative chatbots on customer support Twitter datasets. The model is based on deep Recurrent Neural Networks (RNNs) structures which are uni-directional and bi-directional encoder types of Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU). The RNNs are augmented with an attention layer to focus on important information between input and output sequences. Word level embedding such as Word2Vec, GloVe, and FastText are employed as input to the model. Incorporating the base architecture, a comparative analysis is applied where baseline models are compared with and without the use of attention as well as different types of input embedding for each experiment. Bilingual Evaluation Understudy (BLEU) was employed to evaluate the model’s performance. Results revealed that while biLSTM performs better with Glove, biGRU operates better with FastText. Thus, the finding significantly indicated that the attention-based, bi-directional RNNs (LSTM or GRU) model significantly outperformed baseline approaches in their BLEU score as a promising use in future works.

Item Type:	Article
Uncontrolled Keywords:	—Sequence-to-sequence; encoder-decoder; service chatbot; attention-based encoder-decoder; Recurrent Neural Network (RNN); Long Short-Term Memory (LSTM); Gated Recurrent Unit (GRU); word embedding.
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:	Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology Faculties, Institutes, Centres > Faculty of Computer Science and Information Technology Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Depositing User:	Gani
Date Deposited:	09 Nov 2022 02:23
Last Modified:	09 Nov 2022 02:23
URI:	http://ir.unimas.my/id/eprint/40379

Actions (For repository members only: login required)

View Item