Knowledge and Data Engineering
Uni Kassel

Igor Novakovic - Mastering Unstructured Information with SMILA - the SeMantic Information Logistics Architecture

The amount and diversity of information is growing exponentially, mainly in the area of unstructured data, like emails, text files, blogs, images etc. Poor data accessibility, user rights integration and the lack of semantic meta data are constraining factors for building next generation enterprise search and other document centric applications. Missing standards result in proprietary solutions with huge short and long term cost. Overcoming these problems is a key issue for gaining agility in an organization.

SMILA is an extensible framework for processing unstructured information in the enterprise. Besides providing essential infrastructure components and services, SMILA also delivers ready-to-use add-on components, like connectors to most relevant data sources. Using the framework as their basis will enable developers to concentrate on the creation of higher value solutions, like semantic search applications, information extraction and the like.

SMILA is an open source project under the umbrella of the eclipse foundation. It is also a part of the German research programme THESEUS. Further information can be found at and

This half day SMILA Tutorial will introduce the concepts and approach behind the framework, how to use it to build an application and how to integrate new components into it. The topics that will be addressed are:
  • SMILA in a nutshell
  • Installation, crawler and service configuration, and building a search application with SMILA and Lucene
  • Creating a simple native SMILA component
  • Using Web Services as SMILA components (Open Calais as exercise)
  • Presentation of some Demo Applications based on SMILA
Participants should have a basic understanding of JAVA and programming. For the practical exercises, a laptop running Windows or Linux is required. Participants will receive a CD-ROM containing the most recent SMILA release, Eclipse, Protege and a Java SDK.

The tutorial is presented by Igor Novakovic, Attensity Europe GmbH, Germany.

It will be held on Wednesday from 2:30 pm - 6:00 pm.