International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 2 | Views: 253 | Weekly Hits: ⮙2 | Monthly Hits: ⮙2

Research Paper | Computer Science | India | Volume 12 Issue 4, April 2023 | Popularity: 4.8 / 10


     

AI - Based Solution for Web Crawling

Prashanth Kumar HM, Dr. Subramanya Bhat S


Abstract: Web crawling, also known as web scraping or spidering, is the process of automatically gathering data from the internet. It involves using automated software tools using AI to visit websites, download data like web pages, pdf, videos, metadata, or images. Then store it in a structured format for later use. Web crawlers, also called spiders or bots, follow links from one webpage to another with AI validation. The information gathered by web crawlers can be used for a variety of purposes, including data mining, content aggregation, search engine indexing, market research or Plagiarism detection. Here our crawling is only for plagiarism detection, and our new AI based algorithms help us to do the fastest and most accurate data downloading.


Keywords: Web Crawling, Structured Data, Link Validation, URL, Uniform Resource Locators, Artificial Intelligence


Edition: Volume 12 Issue 4, April 2023


Pages: 179 - 183


DOI: https://www.doi.org/10.21275/SR23331154330



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Prashanth Kumar HM, Dr. Subramanya Bhat S, "AI - Based Solution for Web Crawling", International Journal of Science and Research (IJSR), Volume 12 Issue 4, April 2023, pp. 179-183, https://www.ijsr.net/getabstract.php?paperid=SR23331154330, DOI: https://www.doi.org/10.21275/SR23331154330



Similar Articles

Downloads: 0

Research Paper, Computer Science, India, Volume 11 Issue 12, December 2022

Pages: 1060 - 1063

An Effectual Cardiovascular Disease Classification Using Ensemble Classifier with Oversampling Approach

R. Saranya, Dr. D. Kalaivani

Share this Article

Downloads: 2 | Weekly Hits: ⮙1 | Monthly Hits: ⮙2

Research Paper, Computer Science, Zimbabwe, Volume 12 Issue 6, June 2023

Pages: 297 - 306

The Development of an AI-Based Network Security Algorithm for an IoT Healthcare Platform

Keith Lungile Ncube, Mainford Mutandavari

Share this Article

Downloads: 7 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper, Computer Science, India, Volume 10 Issue 6, June 2021

Pages: 613 - 637

A Literary Review on Big Data & Hadoop

Anudeepa Gon

Share this Article
Top