Unlike traditional databases, which are reliable and relatively unchanging, data streams arrive in an uninterrupted flow which must be continuously processed. This timely reference book explores and contrasts the old persistent data set model with today's seamless data stream environment, and proposes algorithms and systems that work over continuous data streams. This is the first in-depth treatment of this evolving topic, covering basic data stream techniques, data stream synopses, mining data streams, advanced data stream computations, and systems and architectures for data stream management systems.
About the Author
Minos Garofalakis is a Professor of Computer Science at the School of Electronic & Computer Engineering of the Technical University of Crete, and the Director of the Software Technology and Network Applications Lab (SoftNet). Previously, he worked as a Member of Technical Staff at Bell Labs, Lucent Technologies (1998-2005), as a Senior Researcher at Intel Research Berkeley (2005-2007), and as a Principal Research Scientist at Yahoo! Research (2007-2008). In parallel, he also held an Adjunct Associate Professor position at the EECS Department of the University of California, Berkeley (2006-2008). Minos's research interests include database systems, centralized/distributed data streams, data synopses and approximate query processing, uncertain databases, and big-data analytics and mining. He has published over 140 scientific papers in top-tier international conferences and journals in these areas. His work has resulted in 36 US Patent filings (29 patents issued) for companies such as Lucent, Yahoo!, and AT&T. Minos is an ACM Distinguished Scientist (2011), and a recipient of the Bell Labs President's Gold Award (2004) and a Marie-Curie International Reintegration Fellowship (2010). Johannes Gehrke is a Distinguished Engineer at Microsoft working as an architect and product visionary in the Applications and Services Group. From 1999 to 2015 he was the Tisch University Professor in the Department of Computer Science at Cornell University. Johannes' research interests are in the areas of database systems, data science, and data privacy. Johannes has received a National Science Foundation Career Award, an Arthur P. Sloan Fellowship, an IBM Faculty Award, the Cornell College of Engineering James and Mary Tien Excellence in Teaching Award, the Cornell University Provost's Award for Distinguished Scholarship, a Humboldt Research Award from the Alexander von Humboldt Foundation, the 2011 IEEE Computer Society Technical Achievement Award, and the 2011 Blavatnik Award for Young Scientists from the New York Academy of Sciences. He co-authored the undergraduate textbook Database Management Systems (currently in its third edition), used at universities all over the world. Johannes was Program co-Chair of ACM SIGKDD 2004, VLDB 2007, ICDE 2012, SOCC 2014, and ICDE 2015.Rajeev Rastogi is the Director of Machine Learning at Amazon. Previously, he was Vice President of Yahoo! Labs Bangalore and the founding Director of the Bell Labs Research Center in Bangalore, India. Rajeev is an ACM Fellow and a Bell Labs Fellow. He is active in the fields of databases, data mining, and networking, and has served on the program committees of several conferences in these areas. He currently serves on the editorial board of CACM, and has been an Associate editor for IEEE Transactions on Knowledge and Data Engineering in the past. He has published over 125 papers, and holds over 50 patents. >