Table of Contents of the Wokshop proceedings on SDA

Knowledge discovery from symbolic data and the SODAS software

E. Diday

 

Symbolic Analysis of Financial Data

F. Goupil, M. Touati, E. Diday and H. Van Der Veen

 

Generalization of the Principal Components Analysis to Histogram Data

O. Rodriguez, E. Diday and S. Winsberg

 

Symbolic Representation of Long Time-Series

G. Hebrail and B. Hugueney

 

Pyramidal Clustering Algorithms in ISO-3D Project

O. Rodriguez and E. Diday

 

Clustering Large Datasets and Visualizations of Large Hierarchies and Pyramids: Symbolic Data Analysis Approach

V. Batagelj, E. Pavleti\v{c}, M. Zaver\v{s}nik and S. Korenjak-\v{C}erne

 

Temporal Symbolic Descriptions Graphics in ISO-3D

M. Noirhomme, A. Nahimana and C. Mazel

 

Marking and Generalization by Symbolic Descriptions in the Symbolic Official Data Analysis Software

M. Gettler-Summa

 

Preface

 

This volume contains a selection of papers presented at the workshop on Symbolic Data

Analysis of the PKDD'2000 conference which was held in Lyon the 12 September 2000. This book contains original research contributions, innovative applications and overview papers in various aspects of Symbolic Data Analysis.

When observations in large data sets are aggregated into smaller more manageable data sizes, the resulting descriptions of the new units invariably involve ``symbolic data". By symbolic data, we mean that rather than a specific categorical or numerical value, an observed value can be a set of categories or numbers, an interval or a probability distribution or any kind or more complex information than the usual one. In addition there may be rules or taxonomies. Hence, Symbolic Data Analysis generalises classical methods of exploratory, statistical and graphical data analysis to more complex data issued from huge Relational Data Bases. Now the domain is enhanced by two prototype software called SODAS and ISO3'D issued from two European projects involving 21 European industrial or academic teams from ten countries. Several papers of this book illustrate these software.

The volume presents first an overview and the state of the art in Symbolic Data Analysis. Areas which received more attention in this book are applications to financial data and long time series, generalising standard methods of non supervising Data Analysis with the example of Principal Component Analysis. An example of supervised method for extracting symbolic descriptions from categorical data is also given. Finally, the last papers are devoted to the important question of visualising symbolic descriptions of concepts obtained from queries to huge Relational Data Bases or from clustering large data sets by Hierarchies or special overlapping clusters called "Pyramids".

Acknowledgements

First of all we wish to express our gratitude towards the authors of the papers in the present volume, not only for their contribution but also for their diligence and their timely production of their papers. We also thank the organizer of PKDD'2000 for their invitation to organise this Workshop and specially D. Zighed for his help in this publication.

Paris, August 2000

Edwin Diday,

Oldemar Rodriguez

 

 

 

Program 12 September

9h30 Knowledge discovery from symbolic data and the SODAS software

E. Diday

 

10h Symbolic Analysis of Financial Data

F. Goupil, M. Touati, E. Diday and H. Van Der Veen

 

10h30 Coffee break

 

11h00 Symbolic Representation of Long Time-Series

G. Hebrail and B. Hugueney

 

11h30 Generalization of the Principal Components Analysis to Histogram Data

O. Rodriguez, E. Diday and S. Winsberg

 

12h00 Marking and Generalization by Symbolic Descriptions in the Symbolic Official Data Analysis Software

M. Gettler-Summa

 

12h 30 LUNCH

 

14h 30 The SODAS and ISO3'D Software: General presentation

E. Diday, C. Mazel

 

15h From Data Base To Symbolic Objects in SODAS

F. Vautrain (Dauphine University)

 

15h 30 Symbolic Data Analysis Methods in SODAS by Examples

M. Touati, M. Summa, F. Vautrain

 

16h Coffe Break

 

16h30 Pyramidal Clustering Algorithms in ISO-3D Project

O. Rodriguez and E. Diday

 

16h50 Clustering Large Datas sets and Visualizations of Large Hierarchies and Pyramids: Symbolic Data Analysis Approach

V. Batagelj, E. Pavleti\v{c}, M. Zaver\v{s}nik and S. Korenjak-\v{C}erne

 

17h30 Temporal Symbolic Descriptions Graphics in ISO-3D

M. Noirhomme, A. Nahimana and C. Mazel

 

17h30 General discussion