- MSST 2024

Since the conference was founded, in 1974, by the leading national laboratories, MSST (the International Conference on Massive Storage Systems and Technology) has been a venue for massive-scale storage system designers and implementers, storage architects, researchers, and vendors to share best practices and discuss building and securing the world’s largest storage systems for high-performance computing, web-scale systems, and enterprises.

MSST 2024 will be returning on its 50th anniversary to its new home at Santa Clara University, where it will be hosted by the School of Engineering, and held from June 3rd through June 7th, 2024

Registration is available here (the early-bird registration rates end after May 14th, 2024).

Directions to the venue and hotel information is available here.

The Research CFP (now closed for paper submissions) is archived here, and the poster CFP here.

MSST 2024 is possible thanks to sponsorship and support from SCU’s School of Engineering, and Hammerspace.

June 3rd

Tutorials

8:30 – 9:30

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:30 to 9:30am.

Badge pick-up will continue from 8:30 to 4:30pm.

9:15 – 9:30

Welcome to Analytics for the Storage Practitioner

Gary Grider (LANL)

9:30 – noon

Columnar Analytics

Overview (what, how, ecosystem, implications on storage)	Fernanda Foertter (Voltron Data)
Object Computational Storage analytics offload	Qing Zheng (LANL)
Erasured Object Computational Storage analytics offload	Donpaul Stephens (AirMettle)
Hardware Accelerated Computational Storage analytics offload	Krishna Maheshwari (Neuroblade)

noon – 1:00

Lunch

1:00 – 3:00

Row Analytics

Overview (what, how, ecosystem, implications on storage)	Rob Johnson (Broadcom)
Hardware Accelerated Row analytics offload	Edward Bortnikov (Pliops)
Hardware Accelerated Row analytics offload	Phil Chan (Eideticom)

3:30 – 4:30

GraphDB / Analytics

Overview (what/how/market/ecosystem/storage implications)

Amine Mhedhbi (Polytechnique Montréal)

June 4th

Invited Track

8:00 – 9:00

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:00 to 9:00am.

Badge pick-up will continue from 8:30 to 5:00pm.

8:30 – 8:45

Welcome & Opening Remarks

8:45 – 10:15

Computational Storage 1

Open Object Computational Storage Pushdown	Jongryool.Kim (Skhynix)
Open PNFS Pushdown	Dominic Manno (LANL)
Hardware Accelerated Computational Storage analytics offload	Krishna Maheshwari (Neuroblade)

10:30 – 11:30

Computational Storage 2

Erasured Object Computational Storage analytics	Donpaul Stephens (AirMettle)
Ceph Computational Storage	Stephen Bates (Huawei)

11:30 – 12:30

Keynote: Michael Cornwell (Pure) – “Storage Landscapes”

12:30 – 1:30

Lunch

Badge pick-up will continue from 8:30 to 5:00pm.

1:30 – 3:00

Media Trends

Tape (Spectra)	Nathan Thompson (Spectra)
Disk (WDC/Seagate)	Paul Peck (Western Digital)
Flash (Samsung)	Young Paik (Samsung)

3:15 – 4:45

Data Lakes

Apache Data Lake Ecosystem	Fernanda Foertter (Voltron Data)
Iceberg (Apple/LinkedIn/Stripe)	Alex Merced (Dremio)
Tackling I/O Challenges in Modern Data Lakes	Hope Wang (Alluxio)

4:45 – 5:00

Invited Track Day 1 Wrap-Up

Adam Manzanares (Samsung)

June 5th

Invited Track

8:00 – 9:00

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:00 to 9:00am.

Badge pick-up will continue from 8:00 to 5:00pm.

8:30 – 8:45

Welcome & Introduction for Day 2

8:45 – 9:45

A.I. Storage Case Studies 1

AI storage case studies	Hari Kennan (Pure)
AI storage case studies	David Flynn (Hammerspace)

10:00 – 11:30

A.I. Storage Case Studies 2

All Flash Storage Systems	Devasena Inupakutika (Samsung)
AI storage case studies	Caden Bradbury
AI storage case studies	Andy Pernsteiner (VAST)

11:30 – 12:30

Keynote: Garth Gibson – “Storage for A.I.”

12:30 – 1:30

Lunch

Badge pick-up will continue from 8:30 to 5:00pm.

1:30 – 3:00

Heterogenous Workloads

Efficient Erasured NVMEoF Targets	Sergey Platonov (Xinnor)
PNFS/NFS4.2 Diverse Workloads	Trond Myklebust (Hammerspace)
Scalable Composable NVME Storage	Matthew Williams (Cerio)

3:15 – 4:45

Distributed Storage Management

Open Community Distributed Storage Managment	Dominic Manno (LANL)
Beyond BeeOND: A Proposal for Composable Storage	Joe McCormick (ThinkparQ)

4:45 – 5:00

Invited Track Day 2 Wrap-Up

Rohan Puri (AirMettle)

June 6th

Research Track

8:00 – 9:00am

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:00 to 9:00am.

Badge pick-up will continue from 8:00 to 5:00pm.

9:45 – 10:45am

Session 1: Archival Data

Revisiting HDD Rules of Thumb: 1/3 Is Not (Quite) the Average Seek Distance

BURST: A Chunk-Based Data Deduplication System with Burst-Encoded Fingerprint Matching

A Generic and Efficient Framework for Estimating Lossy Compressibility of Scientific Data

11:00 – noon

Session 2: Long Live the Data

Repair I/O Optimization for Clay Codes via Gray-Code Based Sub-Chunk Reorganization in Distributed Storage Systems

Cauchy-Merge: An Efficient Cauchy Matrix based Stripe Merging Method for Reed-Solomon Codes

Minimizing Performance Degradation of RAID Recovery Through Pre-Failure Prediction

noon – 1:30pm

Lunch

Badge pick-up will continue from 8:30 to 5:00pm.

1:30 – 2: 30pm

Session 3: Log Structured Session

Prophet: Optimizing LSM-Based Key-Value Store on ZNS SSDs with File Lifetime Prediction and Compaction Compensation

SAS-Cache: A Semantic-Aware Secondary Cache for LSM-based Key-Value Stores

A GPU-accelerated Compaction Strategy for LSM-based Key-Value Store System

2:45 – 3: 45pm

Session 4: Getting Real

Dissecting I/O Burstiness in Machine Learning Cloud Platform: A Case Study on Alibaba’s MLaaS

Answering the Call to ARMs with PACER: Power-Efficiency in Storage Servers

Mitigating Write-ahead Log Contention on Shared Storage Devices

4:00 – 5:00pm

Session 5: Cloudy With A Chance of Serverless

FastStore: Optimization of Distributed Block Storage Services for Cloud Computing

FuncStore: Resource Efficient Ephemeral Storage for Serverless Data Sharing

Balancing Costs and Durability for Serverless Data

June 7th

Research Track

8:00 – 9:00 both days

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:00 to 9:00am.

Badge pick-up will continue from 8:00 to 5:00pm.

9:00 – 10:00am

Session 6: Heterogeneity

Learning to Coordinate Read-Write Cache Policies in SSD

TieredMMS: A Portable Tiered Memory Management System

Minding the Semantic Gap for Effective Storage-Based Ransomware Defense

10:15-11:15am

Session 7: Flashy Session

Adaptive Selection of Parity Chunk Update Methods in RAID-enabled SSDs

AUGEFS: A Scalable Userspace Log-Structured File System for Modern SSDs

Fully Harnessing the Performance Potential of DRAM-less Mobile Flash Storage

11:30-12:30pm

Session 8: All Alone with the Memory

LodgeTree: A Last-Level Distributed and Surrogate Buffer Tree for Non-Volatile Memories

Dolphin: A Resource-efficient Hybrid Index on Disaggregated Memory

H2KV: A Hotspot Awareness based Hybrid Fault-tolerant In-memory Key-Value Store

12:30 – 1:30pm

Lunch

Badge pick-up will continue from 8:30 to 5:00pm.

1:30-2:30pm

Session 9: Even More Flashy

PhasedRR: Read Reclaim Scheduling without Page-level Access Counting

Ensuring Compaction and Zone Cleaning Efficiency through Same-Zone Compaction in ZNS Key-Value Store

PhatKV: Towards an Efficient Metadata Engine for KV-based File Systems on Modern SSD