What is the right balance for performance and isolation with virtualization in HPC?Naughton, T., Smith, G., Engelmann, C., Valle ́e, G., Aderholdt, F. and Scott, S. L. (2014) What is the right balance for performance and isolation with virtualization in HPC? In: 7th Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grids in conjunction with the 20th International European Conference on Parallel and Distributed Computing (Euro-Par 2014), 25-29 August 2014, Porto, Portugal. Full text not archived in this repository. It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing. Official URL: http://dx.doi.org/10.1007/978-3-319-14325-5_49 Abstract/SummaryThe use of virtualization in high-performance computing (HPC) has been suggested as a means to provide tailored services and added functionality that many users expect from full-featured Linux cluster environments. The use of virtual machines in HPC can offer several benefits, but maintaining performance is a crucial factor. In some instances the performance criteria are placed above the isolation properties. This selective relaxation of isolation for performance is an important characteristic when considering resilience for HPC environments that employ virtualization. In this paper we consider some of the factors associated with balancing performance and isolation in configurations that employ virtual machines. In this context, we propose a classification of errors based on the concept of “error zones”, as well as a detailed analysis of the trade-offs between resilience and performance based on the level of isolation provided by virtualization solutions. Finally, a set of experiments are performed using different virtualization solutions to elucidate the discussion.
Deposit Details University Staff: Request a correction | Centaur Editors: Update this record |