Time+Place: Thursday 02/04/2009 14:30 Room 337-8 Taub Bld.
Title: Web Mining or The Wisdom of the Crowds
Speaker: Ricardo Baeza-Yates http://www.dcc.uchile.cl/~rbaeza/
Affiliation: Yahoo! Research
Host: Eli Ben-Sasson

Abstract:

The Web continues to grow and evolve very quickly, changing our 
daily lives.

This activity represents the collaborative work of the millions of 
institutions and people that contribute content to the Web as well as
the one billion people that use it. In this ocean of hyperlinked
data there is explicit and implicit information and knowledge. 
Web Mining is the task of analyzing this data and extracting information 
and knowledge for many different purposes. The data comes in three main 
flavors: content (text, images, etc.), structure (hyperlinks) and usage 
(navigation, queries, etc.), implying different techniques such as text, 
graph or log mining. Each case reflects the wisdom of some group of people 
that can be used to make the Web better.  For example, user generated 
tags in Web 2.0 sites.
In this talk we walk through this process and give specific examples.