Research: P2P Networks, Semantic Data Warehouse Search, Parallel XQuery on Hadoop,
Delta Indexing, Solid State Disks,
Data Integration, Personal Information Management
IT: Web hosting, Web applications, Linux, Open-Source
Semantic Data Warehouse Search
(www)
Novel search strategies that enable easy to use key word searching over a Terabyte-scale enterprise data warehouse
Dormant Projects
interFace
Web hosting without web hosting
5s - Delta Indexing
What can you do with 5 seconds?
This lecture teaches the basics of modeling and implementation of databases and information systems. The lecture focuses on relational databases, but also covers extended models, e.g. for scientific applications or models used in the Internet. More precisely, the lecture is about the following topics: E/R and UML modeling, the relational data model, object relational models, semi-structured data models and XML, relational data modeling theory (normal forms), SQL, referential integrity, security, transactions and data warehousing.
The goal of this course is to teach students how to evaluate the performance of complex computer and software systems. Accordingly, the methodology to carry out experiments and measurements is studied. Furthermore, the modeling of systems with the help of queueing network systems is explained.
This course extends the basic concepts of relational data management introduced in an earlier course to examine the internals of the architecture, implementation, and optimization of a relational database system. These include storage, indexing, query processing and optimization, transaction management, and performance tuning and benchmarking. The course will additionally introduce the basic concepts behind distributed databases, data integration, and information retrieval.
The course covers the fundamental concepts of computer programming and gives a short overview of object oriented program design. Specific topics include: computers, programming languages and compilers, data types, type conversion, expressions, language constructs, arrays, pointers, references, functions, scope, modular programming, files, classes, inheritance, polymorphism.
Storage Hierarchies, database buffer management, physical database design, query processing, transaction processing (synchronization and recovery), tuning, architecture and variants, applications. The goal of this course is to understand the internals of the implementation and optimization of a relational database system. Index structures, transaction processing, and query processing are discussed.
This lecture presents the basic tools required for the development of database applications. The focus is on relational database systems and applications that can be built using relational database technology. The following topics will be covered: E/R and UML modelling, the relational data model, relational design theory (normal forms), SQL, database intregrity, security, a brief introduction to transactions and data warehousing (OLAP).
2007
Architecture and Implementation of Database Systems
(www)
[+]
This course reviews fundamental ideas and algorithms that are used in the construction of database systems. Topics to be covered include: query processing, query optimization, one-dimensional indexing, multi-dimensional indexing, performance aspects of database systems, database tuning, data layout, synchronisation, recovery. An additional purpose of this course is to give students the necessary background and tools to be able to write a MSc Thesis in the area of information systems.
Workshops
MatChes - Ein Informatikprojekt an einem Tag, 16 times all over Switzerland, 2008-2011
Theses Supervision
Mike Klausmann User Feedback Integration - Incremental Improvement Master thesis, 2011
(e-collection)
Georg Polzer Scalable Log Analysis using Xadoop Bachelor thesis, 2010
Florian Keusch Indexing on Solid State Drives based on Flash Memory Master thesis, 2009
(e-collection)
Stefan Hildenbrand Performance Tradeoffs in Write-Optimized Databases Master thesis, 2008
(e-collection)
Sandro Blum A Generic Merge-Based Dynamic Indexing Framework for iMeMex Master thesis, 2008
(e-collection)
André Schmidt Private and Public Search in iMeMex Semester thesis, 2008
Lukas Blunschi, Claudio Jossen, Donald Kossmann, Magdalini Mori, Kurt Stockinger SODA: Generating SQL for Business Users
VLDB 2012 (to appear)
Claudio Jossen, Lukas Blunschi, Magdalini Mori, Donald Kossmann, Kurt Stockinger The Credit Suisse Meta-data Warehouse
ICDE 2012, industrial track
2011
Lukas Blunschi, Claudio Jossen, Donald Kossmann, Magdalini Mori, Kurt Stockinger Data-Thirsty Business Analysts need SODA - Search Over DAta Warehouse
CIKM 2011, demo
Lukas Blunschi, Claudio Jossen, Donald Kossmann, Magdalini Mori, Kurt Stockinger SODA - Data-Thirsty Business Analysts need SODA - Search Over DAta Warehouse
MICS Workshop 2011, poster
Jens Dittrich, Lukas Blunschi, Marcos Antonio Vaz Salles MOVIES: indexing moving objects by shooting index images
Geoinformatica 2011, journal paper
2010
Lukas Blunschi, Maria Grineva, Maxim Grinev, Georg Polzer, Donald Kossmann Xadoop - High-Performance Log Analysis on Hadoop
MICS Workshop 2010, poster
Marcos Antonio Vaz Salles, Jens Dittrich, Lukas Blunschi Intensional Associations in Dataspaces
ICDE 2010, short paper
2009
Jens Dittrich, Lukas Blunschi, Marcos Antonio Vaz Salles Indexing Moving Objects Using Short-Lived Throwaway Indexes
SSTD 2009
Jens Dittrich, Marcos Antonio Vaz Salles, Lukas Blunschi iMeMex: From Search to Information Integration and Back
IEEE Data Eng. Bull. 32(2): 28-35 (2009)
2008
Marcos Antonio Vaz Salles, Jens Dittrich, Lukas Blunschi Adding structure to web search with itrails [position paper]
ICDE Workshops 2008
Jens Dittrich, Lukas Blunschi, Marcos Antonio Vaz Salles Dwarfs in the rearview mirror: how big are they really?
PVLDB 1(2): 1586-1597 (2008)
Marcos Antonio Vaz Salles, Jens Dittrich, Lukas Blunschi Managing Personal Information using iTrails
SIGCHI PIM Workshop 2008, Florence, Italy
2007
Marcos Antonio Vaz Salles, Jens-Peter Dittrich, Shant Kirakos Karakashian, Olivier René Girard, Lukas Blunschi iTrails: Pay-as-you-go Information Integration in Dataspaces
VLDB 2007, Vienna, Austria
Jens-Peter Dittrich, Lukas Blunschi, Markus Färber, Olivier René Girard, Shant Kirakos Karakashian, Marcos Antonio Vaz Salles From Personal Desktops to Personal Dataspaces: A Report on Building the iMeMex Personal Dataspace Management System
BTW 2007, Aachen, Germany
Lukas Blunschi, Jens-Peter Dittrich, Olivier René Girard, Shant Kirakos Karakashian, Marcos Antonio Vaz Salles A Dataspace Odyssey: The iMeMex Personal Dataspace Management System
CIDR 2007, Asilomar, USA
2005
Jens-Peter Dittrich, Marcos Antonio Vaz Salles, Donald Kossmann, Lukas Blunschi iMeMex: Escapes from the Personal Information Jungle
VLDB 2005, Trondheim, Norway