978-613-7-32848-4

“Language Independent Content Extraction From Web Pages”

Regular price
€39,90
Sale price
€39,90
Regular price
Sold out
Unit price
per 
Shipping calculated at checkout.

Summary:

The rapid development of the internet and web publishing techniques create numerous information sources published as HTML pages on World Wide Web. However, there is lot of redundant and irrelevant information also on web pages. Navigation panels, Table of content (TOC), advertisements, copyright statements, service catalogs, privacy policies etc. on web pages are considered as relevant and irrelevant content. Such information makes various web mining tasks such as web page crawling, web page classification, link based ranking, topic distillation complex.

Author:

R. Chandramma

Biographie:

R Chandramma is working as Associate professor in VKIT BangaloreRavindranath R C is working as Assistant professor in VKIT Bangalore

Author:

Ravindranath R.C RaviTeja

Biographie:

Number of Pages:

52

Book language:

English

Published On:

2019-01-06

ISBN:

978-613-7-32848-4

Publishing House:

LAP LAMBERT Academic Publishing

Keywords:

COMPUTER SCIENCE, Information Technology

Product category:

BUSINESS & ECONOMICS / Careers