BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Information Systems Group - ECPv6.4.0.1//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-ORIGINAL-URL:https://isg.ics.uci.edu
X-WR-CALDESC:Events for Information Systems Group
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/Los_Angeles
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20250309T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20251102T090000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20250214T130000
DTEND;TZID=America/Los_Angeles:20250214T140000
DTSTAMP:20260505T112235
CREATED:20250211T005519Z
LAST-MODIFIED:20250225T032621Z
UID:2167-1739538000-1739541600@isg.ics.uci.edu
SUMMARY:Jiadong Bai:  Supporting Data Science Education Using Texera with a Cloud Infrastructure
DESCRIPTION:Abstract\nIn this talk\, we will first present our paper at the DSE-K12 conference with the title “DS4ALL: Teaching High-School Students Data Science and AI/ML Using the Texera Workflow Platform as a Service.” Traditional data science education often requires students to have programming experience and install local software. It also makes collaboration inefficient and slows down the feedback loop when students need help from TAs. To address these challenges\, we developed a new teaching paradigm using Texera\, and successfully hosted DS4ALL summer programs in 2023 and 2024. As a result\, students with no prior coding experience were able to perform data analysis using AI/ML techniques on the platform.The second part of the talk will focus on how we are scaling to a broader audience based on the success of the Texera system used in DS4ALL\,. There are several challenges of building such a cloud  infrastructure\, such as handling versatile service traffic\, supporting concurrent isolated workflow executions\, and managing diverse data storage needs. To address these challenges\, we develop Texera to be able to run using Kubernetes to achieve scalable service management; For workflow execution\, we run each workflow in an isolated Kubernetes pod to ensure performance and security. On the data storage side\, we design a storage layer that integrates LakeFS for managing versioned datasets and Apache Iceberg for handling versioned tables with support for concurrent read/write operations. These solutions allow Texera to be deployed as a cloud service to provide a scalable\, secure\, and efficient environment for data science workflows.Bio\nJiadong Bai is a second-year Ph.D. student in the Computer Science Department at UC Irvine\, with research interests in data systems\, data science\, and big data analysis. He’s supervised by Prof. Chen Li.Shengquan Ni is a sixth-year Ph.D. student in the Department of Computer Science advised by Professor Chen Li. His research interests include big data processing\, distributed systems\, data analytics\, and data science.
URL:https://isg.ics.uci.edu/event/jiadong-bai-tbd/
LOCATION:DBH 3011
END:VEVENT
END:VCALENDAR