ISG Talks are sponsored by Couchbase.
Talk
CS Seminar: Prof. Arun Kumar: The New DBfication of ML/AI
DBH 6011The Department of Computer Science, UC Irvine WELCOMES Prof. Arun Kumar UCSD 5/12/2023, Friday, 11:00 am - noon Place DBH 6011 Abstract: The recent boom in ML/AI applications has brought […]
Yiming Lin: Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph
DBH 4011Abstract: Business Intelligence (BI) is crucial in modern enterprises and billion-dollar business. Traditionally, technical experts like database administrators would manually prepare BI-models (e.g., in star or snowflake schemas) that join […]
Qiushi Bai: Improving SQL Performance Using Middleware-Based Query Rewriting
DBH 4011Abstract: Query performance is critical in database-supported applications where users need answers quickly to make timely decisions. Traditional databases rely on rewriting queries to improve SQL performance. With the emergence […]
Saeed Kargar: Hamming Tree: The case for Energy-Aware Indexing for NVMs
DBH 4011Zoom Link: https://uci.zoom.us/j/8045933305 Abstract NVM technologies play a crucial role in data storage solutions as well as in battery-powered mobile and IoT devices. However, the challenges of wear-out and energy efficiency […]
Suyash Gupta(UC Berkeley): Dissecting BFT Consensus: In Trusted Components we Trust!
DBH 4011The Information Systems Group (ISG) at UC Irvine welcomes Suyash Gupta UC Berkeley Dissecting BFT Consensus: In Trusted Components we Trust! ABSTRACT The growing interest in reliable multi-party applications has fostered widespread adoption of Byzantine Fault-Tolerant (bft) consensus protocols. Existing bft protocols need f more replicas than Paxos-style protocols to prevent equivocation attacks. trust-bft protocols seek to minimize this cost by making use of trusted components at replicas. This paper makes two contributions. First, we analyze the design of existing trust-bft protocols and uncover three fundamental limitations that preclude most practical deployments. Some of these limitations are fundamental, while others are linked to the state of trusted components today. Second, we introduce a novel suite of consensus protocols, FlexiTrust, that attempts to sidestep these issues. We show that our FlexiTrust protocols achieve up to 185% more throughput than their trust-bft counterparts. BIO Suyash Gupta is a postdoctoral researcher at the SkyLab, University of California, Berkeley. He is also the Lead Architect of ResilientDB fabric. Prior to joining Berkeley, he received his Ph.D. degree from University of California, Davis. He also holds two Master of Science degrees; one from Purdue University and another from Indian Institute of Technology Madras. His current research focuses on attaining safe and efficient, fault tolerant distributed consensus and communication. He has also co-authored a book on fault-tolerant distributed transaction processing at Morgan & Claypool. He has been awarded the Best Graduate Researcher Award for 2021 by UC Davis and Best Paper Award at EuroSys'23. In his free time, Suyash likes to code and his team won Best Hacker Award at BostonHacks, HackIllinois, and HackPrinceton, among others.
Boon Thau Loo(UPenn): Towards Full-Stack Adaptivity in Permissioned Blockchain Systems
DBH 6011The Computer Science Department and Information Systems Group (ISG) at UC Irvine welcomes Boon Thau Loo University of Pennsylvania Towards Full-Stack Adaptivity in Permissioned Blockchain Systems October 20, 2023 […]
Ken Birman (Cornell): Cascade: A Platform for Fast Edge Intelligence
DBH 6011The Computer Science Department and Information Systems Group (ISG) at UC Irvine welcomes Ken Birman Cornell University Cascade: A Platform for Fast Edge Intelligence October 27, 2023 at 11:00AM DBH […]
Nada Lahjouji: ProBE: Proportioning Privacy Budget for Complex Exploratory Decision Support
DBH 4011ProBE: Proportioning Privacy Budget for Complex Exploratory Decision Support Nada Lahjouji PhD Student, UC, Irvine Abstract Decision support (DS) applications play a crucial role in analyzing large volumes of data […]
Vishal Chakraborty: Much Ado About Data-Undo: Semantically Meaningful Data Erasure
DBH 4011Title: Much Ado About Data-Undo: Semantically Meaningful Data Erasure Abstract: Data regulations, such as GDPR and CCPA, are increasingly being adopted globally to protect against unsafe data management practices. Such regulations […]
Shahram Ghandeharizadeh(USC): Intelligent 3D Multimedia Displays using Flying Light Specks
DBH 6011The Computer Science Department and Information Systems Group (ISG) at UC Irvine welcomes Shahram Ghandeharizadeh University of Southern California Intelligent 3D Multimedia Displays using Flying Light Specks January 12 […]
Henry F. Korth (Lehigh University): Blockchain: Computer Science Foundations, Positive Social and Business Impact, and Research Opportunities
DBH 6011The Computer Science Department and Information Systems Group (ISG) at UC Irvine welcomes Henry F. Korth Lehigh University Blockchain: Computer Science Foundations, Positive Social and Business Impact, and Research Opportunities […]
Volker Markl (TU Berlin): Mosaics of Big Data: Database Systems and Information Management – Trends and a Vision
DBH 4011Prof. Dr. Volker Markl Chair of the Database Systems and Information Management (DIMA) Group at TU Berlin Director of the Berlin Institute for the Foundations of Learning and Data (BIFOLD) […]
Shengquan Ni: Supporting time-travel debugging in Texera
Title: Supporting time-travel debugging in Texera Speaker: Shengquan Ni Abstract: Dataflow systems, traditionally used for relational analysis, now support a variety of tasks including complex user-defined functions. As dataflow jobs […]
Raul Castro Fernandez (U. Chicago): On Data Ecology, Data Markets, the Value of Data, and Dataflow Governance
DBH 4011Abstract: Data shapes our social, economic, cultural, and technological environments. Data is valuable, so people seek it, inducing data to flow. The resulting dataflows distribute data and thus value. For […]
Yunyan Ding: Efficient Mouse Brain Image Processing Using Collaborative Data Workflows on Texera
DBH 4011Abstract: In the field of neuroscience, accurately mapping the complex three-dimensional (3D) neural circuitry and architecture of the brain is crucial for advancing our understanding of brain functions and disorders. […]
Yinan Zhou: SpendableDB: A UTxO-based decentralized Database
DBH 4011Abstract: Blockchain technology has attracted a significant amount of attention ever since the Bitcoin blockchain's success. Currently, most of the research and engineering efforts have been centered around monetary transactions such as token exchange protocols. The potential of building databases on top of blockchains is largely overlooked and remains an open problem. The literature on blockchain databases is divided into permissioned blockchains and permissionless account-based blockchains. However, the former is not fully decentralized, and the latter suffers from challenges in performance and cost. We propose SpendableDB, a permissionless UTxO-based blockchain database as a novel approach to the problem of data decentralization. Our design integrates data into individual UTxOs to achieve true decentralization of data ownership that can be securely transferred and traded, similar to how the regular monetary UTxOs are protected by the underlying blockchain's decentralization protocol. Additionally, SpendableDB provides cryptographically secured data integrity and immutable data lineage that can be easily verified. Our implementation and experiments show that our design is economically practical as it incurs a small amount of blockchain transaction fees. Bio: Yinan Zhou is a second-year Ph.D. student in the Computer Science Department at UC Irvine. His primary research focus is on blockchain infrastructure and application developments.
Lukasz Golab (University of Waterloo): Understanding models and the data they learn from
DBH 4011Lukasz Golab (U. Waterloo) Understanding models and the data they learn from Abstract: The modern world is powered by data. However, as the capabilities of data-intensive systems grow, so does […]
Mohammed Al-Kateb (Amazon Redshift): The Evolution of Amazon Redshift
DBH 4011Abstract: In this talk, we will discuss the evolution of Amazon Redshift over the past 10 years. We’ll discuss the Amazon Redshift architecture. We’ll dive deep in the lifecycle of […]
Xinyuan Lin: Data Science Tasks Implemented with Scripts versus GUI-Based Workflows: The Good, the Bad, and the Ugly.
DBH 4011Abstract: As leveraging large-scale data analytics becomes the norm for many applications, platforms for developing these capabilities have become increasingly important. This work compares the benefits and drawbacks of implementing […]
Mike Heddes: Efficient Cardinality Estimation of Multi-Join Queries using Count Sketches
DBH 4011Abstract: Cardinality estimates are a primary input to query optimizers to determine an appropriate join order. The seminal AMS sketch can estimate the cardinality of an equi-join between two relations […]
Mohammad Sadoghi (UC Davis): The Journey of Building Global-Scale Sustainable Blockchain Fabric
DBH 6011Abstract The inception of Bitcoin and blockchain has renewed the vision of a democratic and decentralized computational paradigm, that is, to ingrain integrity, transparency, and accountability into the very fabric […]
Aditya Parameswaran (Berkeley): Enhance, Don’t Replace: A Recipe for Success in Data Tooling
DBH 6011Enhance, Don't Replace: A Recipe for Success in Data Tooling Abstract: Most data analysis and data science is performed in human-centered tools, such as spreadsheets, visual analytics tools, and data science […]
Sainyam Galhotra (Cornell): Context-aware Responsible Data Science
DBH 6011Abstract: Data-based systems are increasingly used in applications that have far-reaching consequences and long-lasting societal impact. However, the development process remains highly specialized, tedious, and unscalable. This produces a manually […]