Keyword Extraction for Medium-Sized Documents Using Corpus-Based Contextual Semantic Smoothing

Complexity 2022:1-8 (2022)
  Copy   BIBTEX

Abstract

Keyword extraction refers to the process of selecting most significant, relevant, and descriptive terms as keywords, which are present inside a single document. Keyword extraction has major applications in the information retrieval domain, such as analysis, summarization, indexing, and search, of documents. In this paper, we present a novel supervised technique for extraction of keywords from medium-sized documents, namely Corpus-based Contextual Semantic Smoothing. CCSS extends the concept of Contextual Semantic Smoothing, which considers term usage patterns in similar texts to improve term relevance information. We introduce four more features beyond CSS as our novel contributions in this work. We systematically compare the performance of CCSS with other techniques, when implemented over INSPEC dataset, where CCSS outperforms all state-of-the-art keyphrase extraction techniques presented in the literature.

Other Versions

No versions found

Links

PhilArchive

    This entry is not archived by us. If you are the author and have permission from the publisher, we recommend that you archive it. Many publishers automatically grant permission to authors to archive pre-prints. By uploading a copy of your work, you will enable us to better index it, making it easier to find.

    Upload a copy of this work     Papers currently archived: 106,951

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

生徒の検索情報を利用した講義の重要語抽出.入部 百合絵 篠原 修二 - 2007 - Transactions of the Japanese Society for Artificial Intelligence 22 (6):604-611.
A comparative study of keyword extraction algorithms for English texts.Jinye Li - 2021 - Journal of Intelligent Systems 30 (1):808-815.

Analytics

Added to PP
2022-10-01

Downloads
22 (#1,082,780)

6 months
1 (#1,602,128)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Muhammad Siddiqui
York University

Citations of this work

No citations found.

Add more citations