Participating Institutions:

Oak Ridge National Laboratory
Virginia Tech.

/Scratch As a Cache

                    Description

                    People

                    Publications

                    Testbed

                    Positions

Description.

To sustain emerging data-intensive scientific applications, High Performance Computing (HPC) centers invest a notable fraction of their operating budget on a specialized fast storage system, scratch space, which is designed for storing the data of currently running and soon-to-run HPC jobs. Instead, it is often used as a standard file system, wherein users arbitrarily store their data, without any consideration to the center.s overall performance. To remedy this, centers periodically scan the scratch in an attempt to purge transient and stale data. This practice of supporting a cache workload using a file system and disjoint tools for staging and purging results in suboptimal use of the scratch space.

In this work, we address the above issues by proposing a new perspective, where the HPC scratch space is treated as a cache, and data population, retention, and eviction tools are integrated with scratch management. In our approach, data is moved to the scratch space only when it needed, and unneeded data is removed as soon as possible. We also design a new job-workflow-aware caching policy that leverages user-supplied hints for managing the cache.

Top

People.

Sudharshan Vazhkudai (ORNL)
Ali R. Butt (Virginia Tech.)
Henry Monti (VT: PhD Student)

Top

Research Publications.

H. Monti, A.R. Butt, S.S. Vazhkudai, "/Scratch as a Cache: Rethinking HPC Center Scratch Storage", Proceedings of the 23rd Int'l Conference on Supercomputing 2009 (ICS'09), Yorktown Heights, New York, June 2009. pdf

Talks.

Top

Job Opportunities.

Top

URL http://www.csm.ornl.gov/~vazhkuda/ScratchAsCache.html
Updated: Friday, 27-Aug-2010 14:29:16 EDT