20% off all books with the code: BOOKS
  • check 10+ million books
  • check New arrivals every day
  • check Trusted by 1M+ customers
  • check Great prices & discounts
  • check Shipping across Europe

Hybrid Algorithm for Enhancing Focused Web Crawling Using Block Segmentation - Niti Saxena

English
2021-01-28
€23.40 €29.25

-20% with code BOOKS

In stock at our supplier

Shipping in 10-16 days

30-day return policy

Search Engine, we are usually referring to the actual search that we are performing through the databases of HTML documents .It is software that helps in locating the information stored on WWW. The purpose of partitioning the web page into blocks is that first we partition the pages into blocks, then only those URLs are extracted which belongs to only the relevant blocks and do not extract those URLs which ... Full description

You May Also Like

Description

Search Engine, we are usually referring to the actual search that we are performing through the databases of HTML documents .It is software that helps in locating the information stored on WWW. The purpose of partitioning the web page into blocks is that first we partition the pages into blocks, then only those URLs are extracted which belongs to only the relevant blocks and do not extract those URLs which do not belong to relevant block. A problem faced by focused crawlers is that they measure the relevancy of a page and calculates the URL score of the whole page and a Web page usually contains both relevant as well as irrelevant topics. Page segmentation transforms multi-topic web page into many single topic context blocks and hence improves its performance. These multiple-topic content blocks such as navigation panels, copyright and privacy notices, unnecessary images, and advertisements distract a user from the actual content and the performance reduces. In this thesis, we present a method to divide the web pages into content blocks. This method uses an algorithm to partition a web page into content blocks with a hierarchical structure and partition the pages based on their pre-defined structure, i.e. the HTML tags. In our proposed method of partitioning the web pages into blocks on the basis of headings gives an advantage over conventional block partitioning is that we divide the blocks which include a complete topic. The heading, content, images, links, tables, sub tables of a particular topic is included in one complete block.

More Information

Author Niti Saxena
Publisher Amazon Digital Services LLC - KDP Print US
Release year 2021
Cover type Softcover
EAN 9798590387021
Write Your Own Review
You're reviewing: Hybrid Algorithm for Enhancing Focused Web Crawling Using Block Segmentation
Your Rating:

Goodreads Reviews

€23.40 €29.25