BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Information Systems Group - ECPv6.4.0.1//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-ORIGINAL-URL:https://isg.ics.uci.edu
X-WR-CALDESC:Events for Information Systems Group
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/Los_Angeles
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20240310T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20241103T090000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20241017T150000
DTEND;TZID=America/Los_Angeles:20241017T160000
DTSTAMP:20260425T075850
CREATED:20241011T010505Z
LAST-MODIFIED:20250211T004309Z
UID:2128-1729177200-1729180800@isg.ics.uci.edu
SUMMARY:Nika Mansouri Ghiasi (ETH): Storage-Centric Computing for Genomics and Metagenomics
DESCRIPTION:Title: Storage-Centric Computing for Genomics and Metagenomics \nAbstract \nGenomics and metagenomics applications have enabled significant advancements in many critical areas. The exponential growth of genomic data poses unprecedented challenges in genomics and metagenomic applications. These applications suffer from significant data movement overheads from the storage system. To fundamentally address these overheads\, we make a case for storage-centric computing. \nFirst\, we propose GenStore\, the first in-storage processing system designed for genome sequence analysis that greatly reduces both data movement and computational overheads of genome sequence analysis by exploiting low-cost and accurate in-storage filters. We address the challenges of in-storage processing\, supporting reads with 1) different read lengths and error rates\, and 2) different degrees of genetic variation. Through rigorous analysis of read mapping processes\, we design low-cost hardware accelerators and data/computation flows inside a NAND flash-based SSD. Our evaluation using a wide range of real genomic datasets shows that GenStore significantly improves the read mapping performance of state-of-the-art software (hardware) baselines by 2.07-6.05× (1.52-3.32×) for read sets with high similarity to the reference genome and 1.45-33.63× (2.70-19.2×) for read sets with low similarity to the reference genome. \nSecond\, we propose MegIS\, the first in-storage processing system designed to significantly reduce the data movement overhead of the end-to-end metagenomic analysis pipeline. MegIS is enabled by our lightweight design that effectively leverages and orchestrates processing inside and outside the storage system. Through our detailed analysis of the end-to-end metagenomic analysis pipeline and careful hardware/software co-design\, we address \nin-storage processing challenges for metagenomics via specialized and efficient 1) task partitioning\, 2) data/computation flow coordination\, 3) storage technology-aware algorithmic optimizations\, 4) data mapping\, and 5) lightweight in-storage accelerators. MegIS’s design is flexible\, capable of supporting different types of metagenomic input datasets\, and can be integrated into various metagenomic analysis pipelines. Our evaluation shows that MegIS outperforms the state-of-the-art performance- and accuracy-optimized software metagenomic tools by 2.7×–37.2× and 6.9×–100.2×\, respectively\, while matching the accuracy of the accuracy-optimized tool. MegIS achieves 1.5×–5.1× speedup compared to the state-of-the-art metagenomic hardware-accelerated (using processing-in-memory) tool\, while achieving significantly higher accuracy. \n Bio \nNika Mansouri Ghiasi is a Ph.D. candidate in the SAFARI Research Group at ETH Zürich\, working with Professor Onur Mutlu. Her current research interests are in computer architecture and bioinformatics\, focusing on 1) large-scale bioinformatics applications\, storage systems\, and their interactions\, and 2) emerging technologies such as ultra-dense 3D integrated systems. Nika has co-authored several works on these topics in major computer architecture venues such as ISCA\, ASPLOS\, and MICRO\, as well as major bioinformatics venues such as ISMB\, Bioinformatics\, and Nature Reviews. \n 
URL:https://isg.ics.uci.edu/event/nika-mansouri-ghiasi-eth-storage-centric-computing-for-genomics-and-metagenomics/
LOCATION:DBH 3011
END:VEVENT
END:VCALENDAR