ISG Talks are sponsored by Couchbase.
Aditya Parameswaran (Berkeley): Enhance, Don’t Replace: A Recipe for Success in Data Tooling
DBH 6011Enhance, Don't Replace: A Recipe for Success in Data Tooling Abstract: Most data analysis and data science is performed in human-centered tools, such as spreadsheets, visual analytics tools, and data science […]
Arnab Nandi (OSU): Data Exploration in a Camera-first World: Query and Result Challenges
DBH 3011Prof. Arnab Nandi Associate Professor, Computer Science and Engineering The Ohio State University Friday, October 11, 2024 at 11 a.m. Donald Bren Hall 6011 Title: "Data Exploration in a Camera-first […]
Nika Mansouri Ghiasi (ETH): Storage-Centric Computing for Genomics and Metagenomics
DBH 3011Title: Storage-Centric Computing for Genomics and Metagenomics Abstract Genomics and metagenomics applications have enabled significant advancements in many critical areas. The exponential growth of genomic data poses unprecedented challenges in […]
Yannis Papakonstantinou (Google): Vector Search and Databases
DBH 6011Yannis Papakonstantinou Distinguished Engineer, Query Processing and GenAI at Google Cloud Databases Abstract: Semantic search ability, via embedding (vectors) and vector indexing, has been added to Google Cloud Platform (GCP) […]
Michael Jungmair (TU Munich): A Compiler-Centric Query Engine Design for Mixed Workloads and Modern Hardware
DBH 3011A Compiler-Centric Query Engine Design for Mixed Workloads and Modern Hardware 11/1/2024, 1:00 PM 2 PM, DBH 3011 Michael Jungmair, Technical University of Munich, Germany Abstract: Relational query engines are increasingly expected to handle more than just relational queries and also run on modern hardware that is increasingly parallel and distributed. However, it is not clear how existing system designs can deal with these two challenges effectively. We propose a holistic, compiler-centric design for data processing systems that is designed for tightly integrated optimization and execution of relational queries, non-relational workloads and user-defined functions on modern hardware. Bio: Michael Jungmair is a third year PhD student at the Technical University of Munich. Supervised by Jana Giceva, he is performing research in the intersection of database engines and compiler technology. So far, this research culminated in the design and implementation of LingoDB (lingo-db.com), a novel query engine based on the MLIR compiler framework
Kunwoo Park: CloudMapper: A Pay-as-you-go Solution for Accelerating Genomics Sequence Alignment Using Public Clouds
DBH 3011CloudMapper: A Pay-as-you-go Solution for Accelerating Genomics Sequence Alignment Using Public Clouds Abstract: Single-cell RNA sequencing (scRNA-seq) alignment remains a computational bottleneck in bioinformatics data analysis. As datasets grow in size […]
Sainyam Galhotra (Cornell): Context-aware Responsible Data Science
DBH 6011Abstract: Data-based systems are increasingly used in applications that have far-reaching consequences and long-lasting societal impact. However, the development process remains highly specialized, tedious, and unscalable. This produces a manually […]
Binbin Gu: PoneglyphDB: Efficient Non-interactive Zero-Knowledge Proofs for Arbitrary SQL Queries Verification
DBH 3011Abstract: In database applications involving sensitive data, the dual imperatives of data confidentiality and provable (verifiable) query processing are important. This paper introduces PoneglyphDB, a database system that leverages non-interactive zero-knowledge proofs […]
Shengquan Ni: IcedTea: Efficient and Responsive Time-Travel Debugging in Dataflow Systems
DBH 3011Abstract: As data analytics grow in popularity, the increasing volume of data and complexity of jobs require users to wait longer to see results, hindering productivity and causing frustration. To address […]
Abhishek Singh: LogPoseDB: Transaction Handoff and Agreement in Edge-Cloud Systems
DBH 3011Abstract: Emerging IoT and edge applications demand fast response times that cannot be achieved by faraway cloud datacenters. This motivates building edge-cloud systems where nodes on the edge can participate in […]
Xiaodong Zhang (The Ohio State University): Data Management: Interactions with Computer Architecture and Systems
DBH 6011Abstract: We have entered a data-centric computing era, characterized by the coexistence of diverse parallel and specialized hardware accelerators along with general-purpose processors. In this ecosystem, minimizing data movement has become […]
Yicong Huang: Building Data Systems to Broaden the Access of Data Science, AI, and ML
DBH 3011Abstract In an era where data-driven decision-making shapes industries, governments, and everyday life, the ability to leverage data science has become an essential skill. Modern data science tools—encompassing data collection, […]
Amr El Abbadi (UCSB): Practical Approaches for Private and Scalable Information Data Management Systems
DBH 6011Practical Approaches for Private and Scalable Information Data Management Systems Amr El Abbadi Professor of Computer Science University of California at Santa Barbara Abstract. Increasingly countries and regions have […]
Jiadong Bai: TBD
DBH 3011Ketan C Maheshwari (Oak Ridge National Laboratory): Enacting Distributed HPC Workflows: Opportunities and Challenges
DBH 3011Abstract: The Dept of Energy (DOE) complex comprises of many science facilities that could be classified as data producing (eg. the Advanced Photon Source at Argonne National Laboratory) and consuming (eg. […]