Table of Contents

Foreword

 

 

 

Sensor Networks and Messaging

 

SwissQM: Next Generation Data Processing in Sensor Networks

1

Rene Mueller, Gustavo Alonso, Donald Kossmann (ETH Zurich)

 

 

 

Data-Driven Processing in Sensor Networks    

10

Adam Silberstein, Gregory Filpus, Kamesh Munagala, Jun Yang (Duke University)  

 

 

 

Rethinking Data Management for Storage-centric Sensor Networks        

22

Deepak Ganesan, Gaurav Mathur, Prashant Shenoy (University of Massachusetts Amherst)

 

 

 

Demaq: A Foundation for Declarative XML Message Processing       

33

Alexander Bšhm, Carl-Christian Kanne, Guido Moerkotte (University of Mannheim)

 

 

 

Unconventional Query Processing

 

Cache-Oblivious Query Processing     

44

Bingsheng He, Qiong Luo (HKUST)

 

 

 

A Black-Box Approach to Query Cardinality Estimation

56

Tanu Malik, Randal Burns (JHU), Nitesh Chawla (University of ND)

 

 

 

Database Cracking      

68

Stratos Idreos, Martin Kersten, Stefan Manegold (CWI)

 

 

 

Database Servers on Chip Multiprocessors: Limitations and Opportunities   

79

Nikos Hardavellas, Ippokratis Pandis, Ryan Johnson, Naju Mancheril, Anastassia Ailamaki, Babak Falsafi (Carnegie Mellon University)

 

 

 

DB&IR Integration

 

The CompleteSearch Engine: Interactive, Efficient, and Towards IR& DB Integration   

88

Holger Bast, Ingmar Weber (Max-Planck Institute of Informatics)

 

 

 

Efficient and Flexible Information Retrieval using MonetDB/X100 (Demo)

96

Sandor Heman, Marcin Zukowski, Arjen de Vries, Peter Boncz (CWI)

 

 

 

Predicate-based Indexing of Enterprise Web Applications (Demo)

102

Cristian Duda, David A. Graf, Donald Kossmann (ETH Zurich)

 

 

 

Entity Search Engine:  Towards Agile Best-Effort Information Integration over the Web

108

Tao Cheng, Kevin Chang (UIUC)

 

 

 

A Dataspace Odyssey: The iMeMex  Personal Dataspace Management System (Demo)

114

Lukas Blunschi, Jens-Peter Dittrich, Olivier Rene Girard, Shant Kirakos Karakashian, Marcos Antonio Vaz Salles (ETH Zurich)

 

 

 

Grid, P2P, & Communities

 

Turning Cluster Management into Data Management; A System Overview        

120

Eric Robinson, David J. DeWitt (University of Wisconsin-Madison)

 

 

 

Life beyond Distributed Transactions: an ApostateÕs Opinion           

132

Pat Helland (Amazon) 

 

 

 

One table stores all: Enabling painless free-and-easy data publishing and sharing  

142

Beng Chin Ooi, Bei Yu (National University of Singapore), Guoliang Li (Tsinghua University

 

 

 

The Data Ring: Community Content Sharing     

154

Serge Abiteboul (INRIA), Neoklis Polyzotis (University of California Santa Cruz)

 

 

 

P2P Web Search: Make It Light, Make It Fly (Demo)

164

Matthias Bender, Sebastian Michel, Josiane Xavier Parreira, Tom Crecelius (Max-Planck Institute of Informatics)

 

 

 

DBLife: A Community Information Management Platform for the Database Research Community (Demo)

169

Pedro DeRose (University of Illinois Urbana Champaign), Warren Shen, Fei Chen, Yoonkyong Lee, Doug Burdick, AnHai Doan (University of Wisconsin-Madison), Raghu Ramakrishnan (Yahoo Research)

 

 

 

Keynote

 

One Size Fits All?  Part 2: Benchmarking Studies

173

Mike Stonebraker (MIT), Chuck Bear (Vertica), Ugur Cetintemel (Brown University), Mitch Cherniack (Brandeis University), Tingjian Ge (Brown University), Nabil Hachem, Stavros Harizopoulos (MIT), John Lifter (Streambase), Jennie Rogers, Stan Zdonik (Brown University)

 

 

 

Scientific Data Management

 

Smoothing the ROI Curve for Scientific Data Management Applications    

185

Bill Howe, David Maier, Laura Bright (Portland State University)      

 

 

 

bdbms -- A Database Management System for Biological Data    

196

Walid G. Aref, M.Y. Eltabakh, Mourad Ouzzani (Purdue University)     

 

 

 

Spatial Indexing of Large Multidimensional Databases

207

Istvan Csabai, Marton Trencseni, Laszlo Dobos, Peter Jozsa, Geza Herczegh, Norbert Purger (Eotvos University), Tamas Budavari, Alexander Szalay (John Hopkins University)

 

 

 

Maitri Demonstration: Managing Large Scale Scientific Data (Demo)           

219

Rishi R Sinha, Arash Termehchy, Soumyadeb Mitra, Marianne Winslett (University of Illinois Urbana Champaign)

 

 

 

Data Uncertainty and Integration

 

Structured Querying of Web Text Data: A Technical Challenge        

225

Michael Cafarella, Christopher Re, Dan Suciu, Oren Etzioni (University of Washington)

 

 

 

Object-level Vertical Search    

235

Zaiqing Nie, Ji-Rong Wen, Wei-Ying Ma (Microsoft Research Asia)  

 

 

 

MOMA - A Mapping-based Object Matching System

247

Andreas Thor, Erhard Rahm (University of Leipzig)

 

 

 

XClean in Action (Demo)        

259

Melanie Weis (Humboldt University Berlin), Ioana Manolescu (INRIA)    

 

 

 

QUIC: A System for Handling Imprecision & Incompleteness in Autonomous Databases (Demo)

263

Garrett Wolf, Hemal Khatri, Yi Chen, Subbarao Kambhampati (Arizona State University)

 

 

 

Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS (Demo)

269

Michi Mutsuzaki, Martin Theobald (Stanford University), Ander de Keijzer (University of Twente), Jennifer Widom, Parag Agrawal,  Omar Benjelloun, Anish Das Sarma,  Raghotham Murthy, (Stanford University), Tomoe Sugihara (NEC)

 

 

 

DB Kernel Issues

 

Managing Query Compilation Memory Consumption to Improve DBMS Throughput    

275

Boris Baryshnikov, Cipri Clinciu, Conor Cunningham, Leo Giakoumakis, Slava Oks, Stefano Stefani (Microsoft)

 

 

 

Rethinking Choices for Multi-dimensional Point Indexing: Making the Case for the Often Ignored Quadtree         

281

You Jung Kim, Jignesh Patel (University of Michigan)

 

 

 

Column Stores for Wide and Sparse Data

292

Daniel Abadi (MIT)

 

 

 

Fragmentation in Large Object Repositories     

298

Russell Sears (UC Berkeley), Catharine van Ingen (Microsoft Research) 

 

 

 

Challenges and Outrageous Ideas

 

An Architecture for Modular Data Centers

306

James Hamilton (Microsoft) 

 

 

 

Isolation Support for Service-based Applications: A Position Paper

314

Paul Greenfield, Alan Fekete (University of Sydney), Julian Jang (CSIRO ICT Centre), Dean Kuo (University of Manchester), Surya Nepal (CSIRO ICT Centre) 

 

 

 

Beyond Just Data Privacy        

324

Bob Mungamuru, Hector Garcia-Molina (Stanford University)

 

 

 

Public Health for the Internet (PHI)      

332

Joe Hellerstein, Tyson Condie (UC Berkeley), Minos Garofalakis (Intel Research, Berkeley), Boon Thau Loo  (UC Berkeley), Petros Maniatis, Timothy Roscoe, Nina Taft (Intel Research, Berkeley)

 

 

 

Keynote

 

Community Systems: The World Online

341

Raghu Ramakrishnan (Yahoo!)

 

 

 

Industry Visions

 

Web-Scale Data Integration: You can afford to Pay as You Go

342

Jayant Madhavan (Google), Shirley Cohen (University of Pennsylvania), Xin (Luna) Dong (University of Washington), Alon Halevy (Google), Shawn Jeffery (UC Berkeley), David Ko (Google), Cong Yu (University of Michigan)

 

 

 

Impliance: A Next Generation Information Management Appliance        

351

Bishwaranjan Bhattacharjee (IBM Watson Research Center), Joseph Glider, Richard Golding, Guy Lohman, Volke Markl, Hamid Pirahesh, Jun Rao, Robert Rees, Garret Swart (IBM Almaden Research Center)

 

 

 

Consistent Streaming Through Time: A Vision for Event Stream Processing

363

Roger Barga, Jonathan Goldstein, Mohamed Ali, Mingsheng Hong (Microsoft Research)

 

 

 

Event-oriented Computing

 

Moirae: History-Enhanced Monitoring  

375

Magdalena Balazinska, YongChul Kwon, Nathan Kuchta (University of Washington), Dennis Lee (Amazon)

 

 

 

Securing history: Privacy and accountability in database systems           

387

Gerome Miklau, Brian Levine (University of Massachusetts Amherst)     

 

 

 

The Case for a Signal-Oriented Data Stream Management System

397

Lewis Girod, Kyle Jamieson, Yuan Mei, Ryan Newton, Stanislav Rost, Arvind Thiagarajan, Hari Balakrishnan, Sam Madden (MIT)

 

 

 

SASE: Complex Event Processing over Streams (Demo)

407

Daniel Gyllstrom (University of Massachusetts Amherst), Eugene Wu (UC Berkeley), Hee-Jin Chae, Yanlei Diao, Patrick Stahlberg, Gordon Anderson (University of Massachusetts Amherst)

 

 

 

Cayuga: A General Purpose Event Monitoring System

412

Alan Demers, Johannes Gehrke, Biswanath Panda, Mirek Riedewald (Cornell University), Varun Sharma (IIT, Delhi), Walker White (Cornell University)