Inproceedings,

Multi-core Implementations of the Concurrent Collections Programming Model

Z. Budimlic, A. Chandramowlishwaran, K. Knobe, G. Lowney, V. Sarkar, and L. Treggiari.
he 14th Workshop on Compilers for Parallel Computing, (January 2009)

Full text

Abstract

In this paper we introduce the Concurrent Collections pro- gramming model, which builds on past work on TStreams 8. In this model, programs are written in terms of high-level application-specific operations. These operations are partially ordered according to only their semantic constraints. These partial orderings correspond to data flow and control flow. This approach supports an important separation of concerns. There are two roles involved in implementing a parallel program. One is the role of a domain expert, the developer whose interest and expertise is in the ap- plication domain, such as finance, genomics, or numerical analysis. The other is the tuning expert, whose interest and expertise is in performance, including performance on a particular platform. These may be distinct individuals or the same individual at different stages in application de- velopment. The tuning expert may in fact be software (such as a static or dynamic optimizing compiler). The Concurrent Collections program- ming model separates the work of the domain expert (the expression of the semantics of the computation) from the work of the tuning expert (selection and mapping of actual parallelism to a specific architecture). This separation simplifies the task of the domain expert. Writing in this language does not require any reasoning about parallelism or any un- derstanding of the target architecture. The domain expert is concerned only with his or her area of expertise (the semantics of the application). This separation also simplifies the work of the tuning expert. The tuning expert is given the maximum possible freedom to map the computation onto the target architecture and is not required to have any understand- ing of the domain (as is often the case for compilers). We describe two implementations of the Concurrent Collections program- ming model. One is IntelR Concurrent Collections for C/C++ based on IntelR Threaded Building Blocks. The other is an X10-based imple- mentation from the Habanero project at Rice University. We compare the implementations by showing the results achieved on multi-core SMP machines when executing the same Concurrent Collections application, Cholesky factorization, in both these approaches.

BibTeX key: citeulike:7908672
entry type: inproceedings
booktitle: he 14th Workshop on Compilers for Parallel Computing
year: 2009
month: January
posted-at: 2010-09-27 14:42:01
location: IBM Research Center, Zurich, Switzerland
priority: 0
citeulike-article-id: 7908672
citeulike-linkout-0: http://www.cs.rice.edu/\~zoran/Publications\_files/paper.pdf
Document: http://www.cs.rice.edu/~zoran/Publications_files/paper.pdf

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Conference Paper %1 citeulike:7908672 %A Budimlic, Zoran %A Chandramowlishwaran, Aparna %A Knobe, Kathleen %A Lowney, Geoff %A Sarkar, Vivek %A Treggiari, Leo %B he 14th Workshop on Compilers for Parallel Computing %D 2009 %K CnC Collections Concurrent Intel TBB library %T Multi-core Implementations of the Concurrent Collections Programming Model %U http://www.cs.rice.edu/~zoran/Publications_files/paper.pdf %X In this paper we introduce the Concurrent Collections pro- gramming model, which builds on past work on TStreams 8. In this model, programs are written in terms of high-level application-specific operations. These operations are partially ordered according to only their semantic constraints. These partial orderings correspond to data flow and control flow. This approach supports an important separation of concerns. There are two roles involved in implementing a parallel program. One is the role of a domain expert, the developer whose interest and expertise is in the ap- plication domain, such as finance, genomics, or numerical analysis. The other is the tuning expert, whose interest and expertise is in performance, including performance on a particular platform. These may be distinct individuals or the same individual at different stages in application de- velopment. The tuning expert may in fact be software (such as a static or dynamic optimizing compiler). The Concurrent Collections program- ming model separates the work of the domain expert (the expression of the semantics of the computation) from the work of the tuning expert (selection and mapping of actual parallelism to a specific architecture). This separation simplifies the task of the domain expert. Writing in this language does not require any reasoning about parallelism or any un- derstanding of the target architecture. The domain expert is concerned only with his or her area of expertise (the semantics of the application). This separation also simplifies the work of the tuning expert. The tuning expert is given the maximum possible freedom to map the computation onto the target architecture and is not required to have any understand- ing of the domain (as is often the case for compilers). We describe two implementations of the Concurrent Collections program- ming model. One is IntelR Concurrent Collections for C/C++ based on IntelR Threaded Building Blocks. The other is an X10-based imple- mentation from the Habanero project at Rice University. We compare the implementations by showing the results achieved on multi-core SMP machines when executing the same Concurrent Collections application, Cholesky factorization, in both these approaches.

@inproceedings{citeulike:7908672, abstract = {In this paper we introduce the Concurrent Collections pro- gramming model, which builds on past work on {TStreams} [8]. In this model, programs are written in terms of high-level application-specific operations. These operations are partially ordered according to only their semantic constraints. These partial orderings correspond to data flow and control flow. This approach supports an important separation of concerns. There are two roles involved in implementing a parallel program. One is the role of a domain expert, the developer whose interest and expertise is in the ap- plication domain, such as finance, genomics, or numerical analysis. The other is the tuning expert, whose interest and expertise is in performance, including performance on a particular platform. These may be distinct individuals or the same individual at different stages in application de- velopment. The tuning expert may in fact be software (such as a static or dynamic optimizing compiler). The Concurrent Collections program- ming model separates the work of the domain expert (the expression of the semantics of the computation) from the work of the tuning expert (selection and mapping of actual parallelism to a specific architecture). This separation simplifies the task of the domain expert. Writing in this language does not require any reasoning about parallelism or any un- derstanding of the target architecture. The domain expert is concerned only with his or her area of expertise (the semantics of the application). This separation also simplifies the work of the tuning expert. The tuning expert is given the maximum possible freedom to map the computation onto the target architecture and is not required to have any understand- ing of the domain (as is often the case for compilers). We describe two implementations of the Concurrent Collections program- ming model. One is {IntelR} Concurrent Collections for {C/C}++ based on {IntelR} Threaded Building Blocks. The other is an X10-based imple- mentation from the Habanero project at Rice University. We compare the implementations by showing the results achieved on multi-core {SMP} machines when executing the same Concurrent Collections application, Cholesky factorization, in both these approaches.}, added-at = {2011-11-14T16:37:27.000+0100}, author = {Budimlic, Zoran and Chandramowlishwaran, Aparna and Knobe, Kathleen and Lowney, Geoff and Sarkar, Vivek and Treggiari, Leo}, biburl = {https://www.bibsonomy.org/bibtex/225044c63b72227895bbc0c25856377ca/gron}, booktitle = {he 14th Workshop on Compilers for Parallel Computing}, citeulike-article-id = {7908672}, citeulike-linkout-0 = {http://www.cs.rice.edu/\~{}zoran/Publications\_files/paper.pdf}, interhash = {47b04a9ad15b7220da03e623b7bfcd46}, intrahash = {25044c63b72227895bbc0c25856377ca}, keywords = {CnC Collections Concurrent Intel TBB library}, location = {IBM Research Center, Zurich, Switzerland}, month = {January}, posted-at = {2010-09-27 14:42:01}, priority = {0}, timestamp = {2011-11-14T16:37:27.000+0100}, title = {Multi-core Implementations of the Concurrent Collections Programming Model}, url = {http://www.cs.rice.edu/~zoran/Publications_files/paper.pdf}, year = 2009 }

BibSonomy

Multi-core Implementations of the Concurrent Collections Programming Model

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on