Name: Michal Shmueli-Scheuer (IBM Research): GenAI Benchmarking and Evaluation
Start: 2025-07-16T11:00:00-07:00
End: 2025-07-16T12:00:00-07:00
Location: DBH 6011

ISG Talks are sponsored by Couchbase.

<< All Talks

List
Month

February 2024

February 9, 2024 @ 11:00 am - 12:00 pm

Joseph Hellerstein (UC Berkeley): Hydro: A Compiler Stack for Distributed Programs

DBH 6011

February 12, 2024 @ 1:00 pm - 2:00 pm

Raul Castro Fernandez (U. Chicago): On Data Ecology, Data Markets, the Value of Data, and Dataflow Governance

DBH 4011

March 2024

March 1, 2024 @ 1:00 pm - 2:00 pm

Yunyan Ding: Efficient Mouse Brain Image Processing Using Collaborative Data Workflows on Texera

DBH 4011

March 4, 2024 @ 11:00 am - 12:00 pm

Bratin Saha (AWS Amazon): Scaling Generative AI in the Enterprise

DBH 4011

March 15, 2024 @ 1:00 pm - 2:00 pm

Yinan Zhou: SpendableDB: A UTxO-based decentralized Database

DBH 4011

April 2024

April 5, 2024 @ 1:00 pm - 2:00 pm

Lukasz Golab (University of Waterloo): Understanding models and the data they learn from

DBH 4011

April 12, 2024 @ 1:00 pm - 2:00 pm

Juncheng Fang: ImmortalChopper: Real-Time and Resilient Distributed Transactions in the Edge-Cloud

DBH 4011

April 19, 2024 @ 1:00 pm - 2:00 pm

Mohammed Al-Kateb (Amazon Redshift): The Evolution of Amazon Redshift

DBH 4011

April 26, 2024 @ 1:00 pm - 2:00 pm

Xinyuan Lin: Data Science Tasks Implemented with Scripts versus GUI-Based Workflows: The Good, the Bad, and the Ugly.

DBH 4011

May 2024

May 10, 2024 @ 1:00 pm - 2:00 pm

Mike Heddes: Efficient Cardinality Estimation of Multi-Join Queries using Count Sketches

DBH 4011

May 17, 2024 @ 1:00 pm - 2:00 pm

Pat Helland (Salesforce): Scalable OLTP in the Cloud: What’s the BIG DEAL?

DBH 4011

May 31, 2024 @ 11:00 am - 12:00 pm

Mohammad Sadoghi (UC Davis): The Journey of Building Global-Scale Sustainable Blockchain Fabric

DBH 6011

September 2024

September 27, 2024 @ 11:00 am - 12:00 pm

Aditya Parameswaran (Berkeley): Enhance, Don’t Replace: A Recipe for Success in Data Tooling

DBH 6011

October 2024

October 11, 2024 @ 1:00 pm - 2:00 pm

Arnab Nandi (OSU): Data Exploration in a Camera-first World: Query and Result Challenges

DBH 3011

October 17, 2024 @ 3:00 pm - 4:00 pm

Nika Mansouri Ghiasi (ETH): Storage-Centric Computing for Genomics and Metagenomics

DBH 3011

October 18, 2024 @ 11:00 am - 12:00 pm

Yannis Papakonstantinou (Google): Vector Search and Databases

DBH 6011

November 2024

November 1, 2024 @ 1:00 pm - 2:00 pm

Michael Jungmair (TU Munich): A Compiler-Centric Query Engine Design for Mixed Workloads and Modern Hardware

DBH 3011

November 15, 2024 @ 1:00 pm - 2:00 pm

Kunwoo Park: CloudMapper: A Pay-as-you-go Solution for Accelerating Genomics Sequence Alignment Using Public Clouds

DBH 3011

November 22, 2024 @ 11:00 am - 12:00 pm

Sainyam Galhotra (Cornell): Context-aware Responsible Data Science

DBH 6011

December 2024

December 6, 2024 @ 1:00 pm - 2:00 pm

Binbin Gu: PoneglyphDB: Efficient Non-interactive Zero-Knowledge Proofs for Arbitrary SQL Queries Verification

DBH 3011

January 2025

January 10 @ 1:00 pm - 2:00 pm

Shengquan Ni: IcedTea: Efficient and Responsive Time-Travel Debugging in Dataflow Systems

DBH 3011

January 17 @ 1:00 pm - 2:00 pm

Abhishek Singh: LogPoseDB: Transaction Handoff and Agreement in Edge-Cloud Systems

DBH 3011

January 24 @ 11:00 am - 12:00 pm

Xiaodong Zhang (The Ohio State University): Data Management: Interactions with Computer Architecture and Systems

DBH 6011

January 31 @ 1:00 pm - 2:00 pm

Yicong Huang: Building Data Systems to Broaden the Access of Data Science, AI, and ML

DBH 3011

February 2025

February 7 @ 11:00 am - 12:00 pm

Amr El Abbadi (UCSB): Practical Approaches for Private and Scalable Information Data Management Systems

DBH 6011

February 14 @ 1:00 pm - 2:00 pm

Jiadong Bai: Supporting Data Science Education Using Texera with a Cloud Infrastructure

DBH 3011

February 21 @ 1:00 pm - 2:00 pm

Ketan C Maheshwari (Oak Ridge National Laboratory): Enacting Distributed HPC Workflows: Opportunities and Challenges

DBH 3011

February 28 @ 11:00 am - 12:00 pm

Sainyam Galhotra (Cornell): Context-aware Responsible Data Science

DBH 3011

March 2025

March 7 @ 1:00 pm - 5:00 pm

Lukas Lokowski: Knowledge Graphs and AI: Bridging Enterprise Data and Knowledge Graphs to Leverage AI Applications

DBH 3011

April 2025

April 11 @ 11:00 am - 12:00 pm

Jiawei Han (distinguished lecture): A Retrieval-and-Structuring Approach for LLM-Enhanced, Theme-Focused Scientific Exploration

DBH 6011

“A Retrieval-and-Structuring Approach for LLM-Enhanced, Theme-Focused Scientific Exploration” Abstract: Large Language Models (LLMs) may bring unprecedented power for scientific exploration. However, current LLMs may still encounter major challenges for effective scientific exploration due to their lack of in-depth, theme-focused data and knowledge. Retrieval augmented generation (RAG) has recently become an interesting approach for augmenting LLMs with grounded, theme-specific datasets. We discuss the challenges of RAG and propose a retrieval and structuring (RAS) approach, which enhances RAG by improving retrieval quality and mining structures (e.g., extracting entities and relations and building knowledge graphs) to ensure its effective integration of theme-specific data with LLM. We show the promise of this approach at augmenting LLMs and discuss its potential power for LLM-enabled science exploration. Bio: Jiawei Han is Michael Aiken Chair Professor in the Siebel School of Computing and Data Science, University of Illinois Urbana-Champaign. He received ACM SIGKDD Innovation Award (2004), IEEE Computer Society Technical Achievement Award (2005), IEEE Computer Society W. Wallace McDowell Award (2009), Japan's Funai Achievement Award (2018), and being elevated to Fellow of Royal Society of Canada (2022). He is Fellow of ACM and Fellow of IEEE and served as the Director of Information Network Academic Research Center (INARC) (2009-2016) supported by the Network Science-Collaborative Technology Alliance (NS-CTA) program of U.S. Army Research Lab and co-Director of KnowEnG, a Center of Excellence in Big Data Computing (2014-2019), funded by NIH Big Data to Knowledge (BD2K) Initiative. Currently, he is serving on the executive committees of two NSF funded research centers: MMLI (Molecular Make Research Institute)—one of NSF funded national AI centers since 2020 and I-Guide—The National Science Foundation (NSF) Institute for Geospatial Understanding through an Integrative Discovery Environment (I-GUIDE) since 2021.