Please use this identifier to cite or link to this item:
http://hdl.handle.net/1893/30029
Appears in Collections: | Computing Science and Mathematics Conference Papers and Proceedings |
Author(s): | Maier, Patrick Stewart, Robert Trinder, Phil |
Contact Email: | patrick.maier@stir.ac.uk |
Title: | The HdpH DSLs for scalable reliable computation |
Citation: | Maier P, Stewart R & Trinder P (2014) The HdpH DSLs for scalable reliable computation. In: Haskell '14: Proceedings of the 2014 ACM SIGPLAN symposium on Haskell. 2014 ACM SIGPLAN symposium on Haskell, Gothenburg, Sweden, 04.09.2014-05.09.2014. New York: ACM Press, pp. 65-76. https://doi.org/10.1145/2633357.2633363 |
Issue Date: | 3-Sep-2014 |
Date Deposited: | 15-Aug-2019 |
Conference Name: | 2014 ACM SIGPLAN symposium on Haskell |
Conference Dates: | 2014-09-04 - 2014-09-05 |
Conference Location: | Gothenburg, Sweden |
Abstract: | The statelessness of functional computations facilitates both parallelism and fault recovery. Faults and non-uniform communication topologies are key challenges for emergent large scale parallel architectures. We report on HdpH and HdpH-RS, a pair of Haskell DSLs designed to address these challenges for irregular task-parallel computations on large distributed-memory architectures. Both DSLs share an API combining explicit task placement with sophisticated work stealing. HdpH focuses on scalability by making placement and stealing topology aware whereas HdpH-RS delivers reliability by means of fault tolerant work stealing. We present operational semantics for both DSLs and investigate conditions for semantic equivalence of HdpH and HdpH-RS programs, that is, conditions under which topology awareness can be transparently traded for fault tolerance. We detail how the DSL implementations realise topology awareness and fault tolerance. We report an initial evaluation of scalability and fault tolerance on a 256-core cluster and on up to 32K cores of an HPC platform. |
Status: | AM - Accepted Manuscript |
Rights: | © ACM, 2014. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Haskell '14: Proceedings of the 2014 ACM SIGPLAN symposium on Haskell, (2014) http://doi.acm.org/10.1145/10.1145/2633357.2633363 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Maier_Stewart_Trinder_Haskell2014_preprint.pdf | Fulltext - Accepted Version | 436.75 kB | Adobe PDF | View/Open |
This item is protected by original copyright |
Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/
If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.