Workshop on Web Mining 2006 (WebMine)

September 18th, 2006, Berlin, Germany

The World Wide Web is a rich source of information about human behavior. It contains large amount of data organized via interconnected Web pages, traces of information search, user feedback on items of interest, etc. One of the important characteristics of the Web in addition to large data volumes is its dynamics, where content, structure and usage is changing over time. This shows up in the rise of related research areas like communities of practice, knowledge management, web communities, and peer to peer. In particular the notion of collaborative work and thus the need of its systematic analysis become more and more important. For instance, to develop effective web applications, it is essential to analyze patterns hidden in the usage of web resources, their contents and their interconnections. Machine Learning and Data mining methods have been used extensively to find patterns in usage of the network by exploiting both contents and link structures.

The topics of the workshop include:

  • analysis of weblogs and blogs
  • semantic association identification by link analysis
  • discovering social structures and communities
  • predicting trends and user behavior
  • analysis of dynamic networks
  • using prior knowledge and the semantic web for user modelling
  • using content of the web for modelling
  • applications in e-commerce, education
  • discovering misuse and fraud
  • combining the web with data from other sources
  • network or link analysis of social resources sharing systems
  • analysis of folksonomies or web 2.0 application
  • deriving profiles from usage
  • personalized delivery of news and journals
  • semantic web personalization
  • semantic web technologies for recommender systems

Submission

Important Dates:
-----------------
Submissions: June 28
Notifications: August 2
Camera ready: August 9
Workshop day: September 18

We invite two types of submissions for this workshop:

  • Technical papers (maximum 12 pages)
  • Short position papers (maximum 6 pages)

Submitted papers will be peer-reviewed and selected on the basis of these reviews. Accepted papers will be presented at the workshop. All submissions should be made electronically if possible, by email attachment and preferably in Postscript or PDF format.

All submissions must be sent to . Although not required for the initial submission, we recommend to follow the format guidelines of ECML/PKDD (Springer LNCS -- LaTeX Style File), as this will be the required format for accepted papers.

The workshop proceedings will be distributed during the workshop. We plan to issue a post workshop publication of selected papers by Springer Lecture Notes.


Accepted Papers

  • XPath-Wrapper Induction by generalizing tree traversal patterns (Tobias Anton, Technical University of Darmstadt, Germany)
  • An Analysis of Bloggers and Topics for a Blog Recommender System (Conor Hayes, Paolo Avesani, Sriharsha Veeramachaneni, ITC-IRST, Trento, Italy)
  • Web Usage Mining and XML Mining: a real case study (Federico Michele Facca, Politecnico di Milano, Italy)
  • Semi-Supervised Learning to Extract Attribute-Value Pairs from Product Descriptions on the Web (Katharina Probst, Rayid Ghani, Marko Krema, Andy Fano, Yan Liu, Accenture Technology Labs, Chicago, USA/Carnegie Mellon University, Pittsburgh, PA, USA)
  • Mining Associations from Web Query Logs (Benjamin Rey, Pradhuman Jhala, Yahoo! Research, Burbank, USA)
  • Collaborative Filtering: Fallacies and Insights in Measuring Similarity (Panagiotis Symeonidis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, and Yannis Manolopoulos, Aristotle University of Thessaloniki, Greece)
  • Discovering User Profiles from Papers by Using Word Sense Disambiguation (G. Semeraro, P. Basile, M. Degemmis, and P. Lops, Universita di Bari, Italy)

Workshop Program

09:30-09:45

Welcome and opening

Session 1

09:45-10:30

Invited Talk  
Web Usage Mining & Personalization in Noisy, Dynamic, and Ambiguous Environments
Olfa Nasraoui, University of Louisville

10:30-11:00

Coffee Break

Session 2

11:00-11:30

Collaborative Filtering: Fallacies and Insights in Measuring Similarity
(Panagiotis Symeonidis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, and Yannis Manolopoulos, Aristotle University of Thessaloniki, Greece)

11:30-12:00

An Analysis of Bloggers and Topics for a Blog Recommender System
(Conor Hayes, Paolo Avesani, Sriharsha Veeramachaneni, ITC-IRST, Trento, Italy)

12:00-12:30

Semi-Supervised Learning to Extract Attribute-Value Pairs from Product Descriptions on the Web
(Katharina Probst, Rayid Ghani, Marko Krema, Andy Fano, Yan Liu, Accenture Technology Labs, Chicago, USA/Carnegie Mellon University, Pittsburgh, PA, USA)

12:30-14:00

Lunch Break

Session 3

14:00-14:30

Web Usage Mining and XML Mining: a real case study
(Federico Michele Facca, Politecnico di Milano, Italy)

14:30-15:00

Mining Associations from Web Query Logs
(Benjamin Rey, Pradhuman Jhala, Yahoo!
Research, Burbank, USA)

15:00-15:30

XPath-Wrapper Induction by generalizing tree traversal patterns
(Tobias Anton, Technical University of Darmstadt, Germany)

15:30-16:00

Coffee Break

Session 4

16:00-16:30

Discovering User Profiles from Papers by Using Word Sense Disambiguation
(G. Semeraro, P. Basile, M. Degemmis, and P. Lops, University of Bari, Italy)

16:30-17:00

Discussion, Closing

Proceedings

The electronic workshop proceedings are available as 

Organization

Workshop Chairs:

  • Bettina Berendt, Institute of Information Systems of Humboldt University Berlin, Germany
  • Andreas Hotho, Group of Knowledge and Data Engineering, University of Kassel, Germany
  • Dunja Mladenic, J. Stefan Institute, Ljubljana, Slovenia
  • Giovanni Semeraro, Department of Informatics, University of Bari, Italy

Program Committee (to be extended)

Related Events

Kontakt: