Web Mining

snufflevoicelessInternet and Web Development

Oct 22, 2013 (3 years and 9 months ago)

81 views

Web Mining

Reference:
Web Mining: Research And Practice

Kolari, P.; Joshi, A.;


Computing in Science & Engineering

[see also
IEEE

Computational Science and Engineering]


Volume 06,


Issue 4,


July
-
Aug. 2004 Page(s):49


53

IEEE

JNL

2005/05/24

Three Broad Categories of Web
Mining


Web content mining


Application of data
-
mining techniques


Web structure mining


Operates on the Web

s hyperlink structure


Web usage mining


Analyzes user interaction with Web server


Include logs, database transaction,



Privacy concern

Web Context and Structure Mining


Web as a Database


Document Classification


Hubs and Authorities


Clever: Ranking by Content


Identifying Web Communities

Web as a Database


Placing a layer of abstraction
containing some semantic information
on top of semistructured Web


Query the Web as a database


Topic, author, creation date, and so on


WebLog and WebSQL


Recent work: Semantic Web


Document Classification


Roots


Machine learning


Pattern Recognition


Text Analysis


Topic Aggregation


Google News


http://news.google.com

Hubs and Authorities


Authorities


Highly ranked pages for a given topic


Hubs


Pages with links to authorities


Google search engine

Web Usage Mining


e
-
business


Main areas


Web log data preprocessing


Identifying of useful patterns


Notable research


Adaptive Web Sites


Personalization, recommended links


Robust Fuzzy Clustering


Association Rules

Web Usage Mining (Cont.)


Recommender Systems


Converting browsers to buyers



Customers who bought A also bought B.



Association Rules


Web Site Evaluation


Determine needed modifications


primarily to the site

s design of page content and
link structure


Hamlet: To Buy or Not to Buy


Airline ticket purchasing


Recommendations the best time to buy tickets


Privacy Issue


Semantic Web Mining


Semantic Web


Next generation Web


Semantically rich language


Web Ontology Language


More Complex than Web
-
as
-
database


Fit Web mining


More and more benefits

Distributed Web Mining


Questions


The only question is:


Can Web mining catch up to the WWW

s growth?


Researchers


Web Content Mining


Fabrizio Sebastini


Soumen Chakrabarti


Web Structure Mining


Johannes Furnkranz


Sepandar Kamwar


Chakrabarti


Ravi Kumar


Web Usage Mining


Webviz