Downloads: 119 | Views: 256
M.Tech / M.E / PhD Thesis | Information Technology | Burma | Volume 8 Issue 1, January 2019 | Popularity: 6.8 / 10
Comparison of Keyword-based and Semantic-based Web Page Clustering Systems
Ei Ei Moe, Hnin Hnin Htun
Abstract: Today, web page clustering is useful for many applications such as categorization, cleaning, schema detection and automatic extractions. Web page clustering is classified into different categories that are hierarchical and flat clustering, online and offline clustering, soft and hard clustering, and document-based and keywords-based clustering. Among them, keyword-based web page clustering uses the single words or compounds words occurring in the web page set as the features for clustering. In this situation, these words cant precisely represent the content of the web page because the synonyms and polysemous of the word can lead the ambiguity problems. Semantic analysis is useful to solve this ambiguity problem. So, this system proposes both keyword-based and semantic-based web page clustering system, and then compares the performance between them. In the semantic analysis, words in each web page are first mapped to word senses by using supervised based word sense disambiguation method. Then, semantic-based web page clustering system uses both keywords and semantic features for clustering. After performing each cluster process, this system points out the semantic-based web page clustering system is more precise and effective than the keyword-based clustering system.
Keywords: Semantic, Word Sense Disambiguation, Clustering
Edition: Volume 8 Issue 1, January 2019
Pages: 1511 - 1516
Make Sure to Disable the Pop-Up Blocker of Web Browser
Similar Articles
Downloads: 2 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2
Research Paper, Information Technology, India, Volume 11 Issue 6, June 2022
Pages: 1959 - 1968Enhancing Cloud-Based Smart Contract Security: A Hybrid AI and Optimization Approach for Vulnerability Prediction in FinTech
Ranadeep Reddy Palle, Haritha Yennapusa, Krishna Chaitanya Rao Kathala
Downloads: 6 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Research Paper, Information Technology, United States of America, Volume 13 Issue 10, October 2024
Pages: 1886 - 1894Knowledge Discovery in Databases Utilizing Large Language Models
Satyam Chauhan
Downloads: 58 | Weekly Hits: ⮙1 | Monthly Hits: ⮙2
Analysis Study Research Paper, Information Technology, United States of America, Volume 12 Issue 5, May 2023
Pages: 1972 - 1982Bridging the Gap in Integrated Care: A Semantic Medical Data Management Framework Leveraging Linked Data and FHIR Standards
Wasim Fathima Shah
Downloads: 103
Research Paper, Information Technology, India, Volume 5 Issue 7, July 2016
Pages: 1920 - 1924Improving Stability, Smoothing and Diversifying of Recommender Systems
Sagar Sontakke, Pratibha Chavan
Downloads: 103
Survey Paper, Information Technology, India, Volume 6 Issue 3, March 2017
Pages: 1403 - 1405Inverse Problem with Solution Using Data Mining
Ashmikumari Shah, Pooja Jardosh