ISG Talks are sponsored by Couchbase.

<< All Talks

Loading Events

« All Events

  • This event has passed.

Xinyuan Lin: Data Science Tasks Implemented with Scripts versus GUI-Based Workflows: The Good, the Bad, and the Ugly.

April 26, 2024 @ 1:00 pm - 2:00 pm

Abstract: As leveraging large-scale data analytics becomes the norm for many applications, platforms for developing these capabilities have become increasingly important. This work compares the benefits and drawbacks of implementing two commonly used data science platform paradigms: code-based scripts and GUI-based workflows. We implement tasks in both paradigms that provide examples of phases in the typical life cycle of a data science project, including data wrangling, machine learning (ML) model training, and inference. In this talk, we will examine the relative performance of the implementations under each paradigm in various experimental settings. We will discuss the benefits and drawbacks of each platform implementation and provide a foundation for future work in comparing data science platform paradigms.

Bio: Xinyuan Lin is a third-year Ph.D student in the Computer Science Department at UC Irvine. His research interests include data processing systems and big data analytics.

Details

Date:
April 26, 2024
Time:
1:00 pm - 2:00 pm
Event Tags:

Venue

DBH 4011