ISG Talks are sponsored by Couchbase.
Xinyuan Lin: Data Science Tasks Implemented with Scripts versus GUI-Based Workflows: The Good, the Bad, and the Ugly.
DBH 4011Abstract: As leveraging large-scale data analytics becomes the norm for many applications, platforms for developing these capabilities have become increasingly important. This work compares the benefits and drawbacks of implementing […]
Mike Heddes: Efficient Cardinality Estimation of Multi-Join Queries using Count Sketches
DBH 4011Abstract: Cardinality estimates are a primary input to query optimizers to determine an appropriate join order. The seminal AMS sketch can estimate the cardinality of an equi-join between two relations […]
Pat Helland (Salesforce): Scalable OLTP in the Cloud: What’s the BIG DEAL?
DBH 4011Abstract: The pursuit of scalable OLTP systems has been the holy grail of my career. Because OLTP systems are typically split into applications and databases, the isolation semantics provided by […]
Mohammad Sadoghi (UC Davis): The Journey of Building Global-Scale Sustainable Blockchain Fabric
DBH 6011Abstract The inception of Bitcoin and blockchain has renewed the vision of a democratic and decentralized computational paradigm, that is, to ingrain integrity, transparency, and accountability into the very fabric […]
Aditya Parameswaran (Berkeley): Enhance, Don’t Replace: A Recipe for Success in Data Tooling
DBH 6011Enhance, Don't Replace: A Recipe for Success in Data Tooling Abstract: Most data analysis and data science is performed in human-centered tools, such as spreadsheets, visual analytics tools, and data science […]
Arnab Nandi (OSU): Data Exploration in a Camera-first World: Query and Result Challenges
DBH 4011Prof. Arnab Nandi Associate Professor, Computer Science and Engineering The Ohio State University Friday, October 11, 2024 at 11 a.m. Donald Bren Hall 6011 Title: "Data Exploration in a Camera-first […]
Nika Mansouri Ghiasi (ETH): Storage-Centric Computing for Genomics and Metagenomics
DBH 4011Title: Storage-Centric Computing for Genomics and Metagenomics Abstract Genomics and metagenomics applications have enabled significant advancements in many critical areas. The exponential growth of genomic data poses unprecedented challenges in […]
Yannis Papakonstantinou (Google): Vector Search and Databases
DBH 6011Yannis Papakonstantinou Distinguished Engineer, Query Processing and GenAI at Google Cloud Databases Abstract: Semantic search ability, via embedding (vectors) and vector indexing, has been added to Google Cloud Platform (GCP) […]
Michael Jungmair (TU Munich): A Compiler-Centric Query Engine Design for Mixed Workloads and Modern Hardware
DBH 3011A Compiler-Centric Query Engine Design for Mixed Workloads and Modern Hardware 11/1/2024, 1:00 PM 2 PM, DBH 3011 Michael Jungmair, Technical University of Munich, Germany Abstract: Relational query engines are increasingly expected to handle more than just relational queries and also run on modern hardware that is increasingly parallel and distributed. However, it is not clear how existing system designs can deal with these two challenges effectively. We propose a holistic, compiler-centric design for data processing systems that is designed for tightly integrated optimization and execution of relational queries, non-relational workloads and user-defined functions on modern hardware. Bio: Michael Jungmair is a third year PhD student at the Technical University of Munich. Supervised by Jana Giceva, he is performing research in the intersection of database engines and compiler technology. So far, this research culminated in the design and implementation of LingoDB (lingo-db.com), a novel query engine based on the MLIR compiler framework