Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet

Lee, Jun Choi and Cheah, Yu-N (2016) Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet. In: International Conference on Advanced Informatics: Concepts, Theory and Applications, 16-17 August 2016, Parkroyal Penang Resort.

[img]
Preview
PDF
Paraphrase Detection using Semantic Relatedness (abstract).pdf

Download (119kB) | Preview
Official URL: DOI: 10.1109/ICAICTA.2016.7803127

Abstract

This study presents a sentence-to-sentence semantic relatedness measures for paraphrase detection. The proposed measures adopt the shortest path between synsets in WordNet as the core to measure the relatedness between two sentences. The interlinked synsets in WordNet are based on the conceptual-semantic relation between two synsets. Thus the distance between two synsets in WordNet can be used to measure the semantic relatedness between two synsets. This study derived a sentence-level semantic relatedness using this feature to detect paraphrasing among sentences. The performance of the proposed semantic relatedness in paraphrasing is evaluated based on the accuracy and F-measures of the proposed measures in identifying paraphrase in Microsoft Research Paraphrase Corpus. The proposed method achieved 71.1% in accuracy and 81.8% in F-measures. The performance of the proposed method is compared with 6 paraphrase detection methods which include Salient Semantic Analysis and Second-order Co-occurrence Pointwise Mutual Information. In the comparison, the proposed method achieved the fourth highest accuracy and the second highest F-measure compare to other methods. This is a reasonable performance for the proposed semantic relatedness in paraphrase detection. Keywords— paraphrase detection, semantic relatedness, WordNet, synset shortest path.

Item Type: Proceeding (Paper)
Uncontrolled Keywords: paraphrase detection, semantic relatedness, WordNet, synset shortest path, research, Universiti Malaysia Sarawak, unimas, university, universiti, Borneo, Malaysia, Sarawak, Kuching, Samarahan, ipta, education
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Faculties, Institutes, Centres > Faculty of Computer Science and Information Technology
Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Depositing User: Mr. Jun Choi Lee
Date Deposited: 29 Aug 2016 20:07
Last Modified: 17 Feb 2017 08:18
URI: http://ir.unimas.my/id/eprint/13239

Actions (For repository members only: login required)

View Item View Item