Measuring Text Similarity based on Structure and Word Embedding

Research Abstract

The problem of ﬁnding similarity between natural language sentences is crucial for many applications in Natural Language Processing (NLP). Moreover, accurate calculation of similarity between sentences is highly needed. Many approaches depends on word-to-word similarity to measure sentences similarity. This paper proposes a new approach to improve accuracy of sentences similarity calculation. The proposed approach combines diﬀerent similarity measures in calculation of sentences similarity. In addition to traditional word-to-word similarity measure the proposed approach exploits sentences semantic structure. Discourse representation structure (DRS) which is a semantic representation for natural sentences is generated and used to calculated structure similarity. Furthermore, word order similarity is measured to consider order of words in sentences. Experiments show that exploiting structural information achieves good results. Moreover, the proposed method outperforms the current approaches on Pilot standard benchmark dataset achieving 0.8813 peasron correlation with human similarity.

Research Authors

Mamdouh Farouk

Research Date

Thu, 15 Oct 2020 - 12:00

Research Department

Computer Science Department

Research File

8-Measuring Text Similarity Based on Structure and Word Embedding (1).pdf (328.01 KB)

Research Member

Mamdouh Farouk Mohamed Farghaly

Research Pages

1-10

Research Publisher

elsevier

Research Vol

63

Research Year

2020

Faculty of Computers and Information

Measuring Text Similarity based on Structure and Word Embedding

آخر الأبحاث

Assiut
University

Important Links

Our Address

Measuring Text Similarity based on Structure and Word Embedding

آخر الأبحاث

Assiut University

Important Links

Our Address

Assiut
University