RSCTC 2010, June 28-30, Warsaw, Poland

Combined Learning Methods and Mining Complex Data

RSCTC 2010 special session

Motivation and Goals:

Data mining and machine learning have shown tremendous progress in the last decades. Numerous methods have been introduced to discover different representations of knowledge from data and the number of their applications in various fields is growing. Nevertheless, many of approaches are based on using a single learning algorithm to static and standard forms of data (mainly tabular forms). However, the rapid growth of information technology gives access to more complex, larger and, generally speaking, “more difficult” data sets that pose new challenges for researchers and ask for a variety of dedicated approaches. Hence, in this session we would like to focus on data characteristics related to the following modern challenges:

  1. Processing data streams and learning in changing and distributed environments. Many sources generate continuous and time changing data, where the data distributions and target concepts change over time. Mining them and adapting to concept drifts is of great interest.
  2.  Semi-supervised learning. In many domains only a limited number of labelled examples is available, therefore learning approaches should take as much advantages of unlabelled data as possible to produce the efficient solution. In case of classification, it leads to development of such approaches as co-training or active learning. On the other hand, supervised information is also taken into account in new clustering algorithms.
  3. Learning from imbalanced data, where one class contains much smaller number of examples than the remaining classes. The imbalanced distribution of classes constitutes a difficulty for standard learning algorithms and calls for specialized approaches.

The other aim of this session is to promote complex learning methods such as combining several strategies, with classifier ensembles being of particular interest. Although the multiple classifiers are still “young” they have already proved to be accurate, flexible and sometime more efficient than single classifiers. The scope of this session also covers other combined methods such as regression ensembles, hierarchical classifiers or meta-learning, and their integration with feature processing.

Both methodological and application oriented papers fit the session. In particular, we encourage researchers to study the use of combined learning methods for above mentioned challenges. Other applications are also encouraged, including relationships to generalisations of rough sets for knowledge discovery.

The main aim of this session is to gather researchers interested in these issues and to demonstrate results in these and related areas of mining difficult data.

Topics of interest:  

  • Supervised ensemble learning problems and multiple classifier systems
  • Theoretical aspects of constructing combined learning systems
  • Combined learning methods for regression and ordinal data
  • Classification, clustering and frequent patterns from data streams
  • Ensemble learning in changing environments
  • Detecting changes and concept drift in evolving data
  • Knowledge discovery from ubiquitous environments
  • Incremental online learning algorithms
  • Active learning and co-training
  • Semi-supervised approaches (including clustering)
  • Sampling techniques for imbalanced data
  • Handling class imbalance by modifying inductive bias and post-processing of learned models
  • Pre-processing, structuring and organizing complex data
  • Applications, especially in data mining, medicine, text processing, web mining, image or multimedia analysis, bioinformatics.

Organisers, contact:

Jerzy Stefanowski - Jerzy.Stefanowski@cs.put.poznan.pl
Poznań University of Technology, Institute of Computing Science
ul. Piotrowo 2, 60-965 Poznań, Poland 
phone: +48 61 6652933
www.cs.put.poznan.pl/jstefanowski

Program Committee of the Session:

Włodzisław Duch
Nicolaus Copernicus University, Toruń, Poland
Krzysztof Krawiec
Poznań University of Technology, Poland
Joao Gama
Univeristy of Porto, Portugal
Ludmila I. Kuncheva
University of Wales, Bangor, UK
Jerzy Grzymała-Busse
University of Kansas, Lawrence, USA
Stan Matwin
University of Ottawa, Canada
Nathalie Japkowicz
University of Ottawa, Canada
Ernestina Menasalvas
Technical University of Madrid, Spain
Jacek Koronacki
IPI PAN, Warsaw, Poland
Zbigniew Raś
University of North Carolina – Charlotte, USA
Wojciech Kotłowski
Centrum Wiskunde & Informatica, Amsterdam, Netherlands
Alexey Tsymbal
Corporate Technology Division, Siemens AG, Erlangen, Germany