3rd Parallel Data Storage Workshop

held in conjunction with
Supercomputing '08

Chair: Garth Gibson, CMU

Monday, November 17, 2008
8:30 a.m. - 5:00 p.m.
Austin Convention Center, Austin, Texas

SC08 Workshop Web Page

IEEE Xplore Digital Library Proceedings


workshop abstract

Petascale computing infrastructures make petascale demands on information storage capacity, performance, concurrency, reliability, availability, and manageability. This one-day workshop focuses on the data storage problems and emerging solutions found in petascale scientific computing environments, with special attention to issues in which community collaboration can be crucial, problem identification, workload capture, solution interoperability, standards with community buy-in, and shared tools.


AGENDA

Papers also available through IEEE Xplore.

8:25am - 8:30am
Welcome - Garth Gibson, Workshop Chair
8:30am - 10:00am
SESSION 1: Structures for Out-of-order and Random Access
Session Chair: Carlos Maltzahn, University of California, Santa Cruz
  Input/Output APIs and Data Organization for High Performance Scientific Computing
Jay Lofstead, Fang Zheng, Scott Klasky, Karsten Schwan, Georgia Tech and
Oak Ridge National Laboratory
Paper | Slides

Fast Log-based Concurrent Writing of Checkpoints
Milo Polte, Jiri Simsa, Wittawat Tantisiriroj, Garth Gibson, CMU
Paper | Slides

Zest: Checkpoint Storage System for Large Supercomputers
Paul Nowoczynski, Nathan Stone, Jared Yanovich, Jason Sommerfield, PSC
Paper | Slides

Scalable Full-Text Search for Petascale File Systems
Andrew W. Leung and Ethan L. Miller, University of California, Santa Cruz
Paper | Slides
10:00am - 10:30am
POSTER SESSION 1 - List of participants and links to posters
10:30am - 12:00pm
SESSION 2: Tools and Devices
Session Chair: Evan Felix, Pacific Northwest National Laboratory
  Performance of RDMA-capable Storage Protocols on Wide-Area Network
Weikuan Yu, Nageswara S.V. Rao, Pete Wyckoff, Jeffrey S. Vetter,
Oak Ridge National Laboratory
Paper | Slides

Comparing Performance of Solid State Devices and Mechanical Disks
Milo Polte, Jiri Simsa, Garth Gibson, CMU
Paper | Slides

Arbitrary Dimension Reed-Solomon Coding and Decoding for
Extended RAID on GPUs

Matthew L. Curry, H. Lee Ward, Anthony Skjellum, and Ron Brightwell,
University of Alabama at Birmingham and Sandia National Laboratory
Paper | Slides

Pianola: A Script-based I/O Benchmark
John May, Lawrence Livermore National Laboratory
Paper | Slides
12:00pm - 1:00pm
Lunch
1:00pm - 2:30pm
SESSION 3: Systems and Application Support
Session Chair: Bill Kramer, Lawrence Berkeley National Laboratory
  Introducing Map-Reduce to High End Computing
Grant Mackey, Saba Sehrish, Julio Lopez, John Bent, Salman Habib, Jun Wang, University of Central Florida, Carnegie Mellon University, and Los Alamos National Laboratory
Paper | Slides

Logan: Automatic Management for Evolvable, Large-Scale, Archival Storage
Mark W. Storer, Kevin M. Greenan, Ian F. Adams, Ethan L. Miller, Darrell D. E. Long, Kaladhar Voruganti, University of California, Santa Cruz
Paper | Slides

Just-in-time Staging of Large Input Data for Supercomputing Jobs
Henry M. Monti,  Ali R. Butt, Sudharshan S. Vazhkudai, Virginia Tech, ORNL
Paper | Slides

Revisiting the Metadata Architecture of Parallel File Systems
Nawab Ali, Ananth Devulapalli, Dennis Dalessandro, Pete Wyckoff, P. Sadayappan, Ohio State University
Paper | Slides
2:30pm -3:00pm
Short Announcements (sign up onsite)
3:00pm - 3:30pm
POSTER SESSION 2 - List of participants and links to posters
3:30pm - 4:30pm

PANEL: Rewarding the Public Release of Valuable
Data and Resources

Panel Speakers:
Clem Cole, Intel Corp & USENIX Association
Garth Gibson, Carnegie Mellon University and Panasas Inc - slides
Gary Grider, Los Alamos National Laboratory
John May, Lawrence Livermore National Laboratory
Moderator:
Ethan L. Miller, UC Santa Cruz

4:30pm - 5:00pm
POSTER SESSION 3 - List of participants and links to posters

 


COMMITTEE:

Garth A. Gibson, Carnegie Mellon University and Panasas Inc.
Darrell Long, University of California, Santa Cruz
J. Bruce Fields, University of Michigan, Ann Arbor,
    Center for Information Technology Integration
Gary A. Grider, Los Alamos National Laboratory
William T. C. Kramer, National Energy Research Scientific Computing Center,
    Lawrence Berkeley National Laboratory
Philip C. Roth, Oak Ridge National Laboratory
Evan J. Felix, Pacific Northwest National Laboratory
Lee Ward, Sandia National Laboratory
Rob Ross, Argonne National Laboratory
Karsten Schwan, Georgia Institute of Technology


Other Workshops & Panels of Interest at SC08

Exa and Yotta Scale Data - Are We Ready?
Panel Chair: Bill Kramer, NERSC
Friday, Nov 21, 2008
10:30AM - 12:00PM, Ballroom E
Austin Convention Center, Austin, Texas

pNFS Protocol after Final Draft and before RFC
Primary Session Leader: Sorin Faibish (EMC)
Wed, November 19, 2008
5:30PM - 7:00PM, Ballroom F
Austin Convention Center, Austin, Texas