Downloads: 108
Review Papers | Computer Science & Engineering | India | Volume 4 Issue 11, November 2015
Specific Personal Alias Withdrawal from Web and Clustering of Similar Web Documents
Snehal S. Shinde | Prakash R. Devale
Abstract: There are many names available for a person, place or an entity on the web. If accurate alias of a particular individual is identified it becomes very useful in numerous web related tasks like information extraction, relation extraction, biomedical fields, sentiment analysis, personal name disambiguation, etc. Here, one method is projected based on referential ambiguity to find the correct alias for a given name. After accepting real name as input lexical patterns are achieved from the web. Candidate aliases are extracted with the help of these patterns. The candidate aliases are ranked using various ranking scores like co occurrence frequency, web dice, hub discounting, and degree distribution. This method improves the recall and attains a statistically considerable mean reciprocal rank. Using candidate aliases and data files, related web documents are bunched or grouped. Grouping achieves high accuracy and reduces the complexity.
Keywords: Web mining, ranking, clustering, web text analysis, co-occurrence frequency
Edition: Volume 4 Issue 11, November 2015,
Pages: 2503 - 2506
Similar Articles with Keyword 'Web mining'
Downloads: 104
Research Paper, Computer Science & Engineering, India, Volume 4 Issue 8, August 2015
Pages: 1640 - 1647Privacy Preservation Protection for Personalized Web User by k-Anonymity with Profile Construction for Web Search Engines
Uma Maheswari.T | Dr.V. Kavitha
Downloads: 104
Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015
Pages: 1812 - 1815A Survey on Domain Name Categorization Using Artificial Neural Networks
Akshay S. Dhomble | Disha Deotale