ridm@nrct.go.th   ระบบคลังข้อมูลงานวิจัยไทย   รายการโปรดที่คุณเลือกไว้

Machine learning based mapping of data and streaming parallelism to multi-cores

หน่วยงาน Edinburgh Research Archive, United Kingdom

รายละเอียด

ชื่อเรื่อง : Machine learning based mapping of data and streaming parallelism to multi-cores
นักวิจัย : Wang, Zheng
คำค้น : compilers , multi-cores , machine learning , parallelism , parallel programming languages
หน่วยงาน : Edinburgh Research Archive, United Kingdom
ผู้ร่วมงาน : O'Boyle, Michael
ปีพิมพ์ : 2554
อ้างอิง : http://hdl.handle.net/1842/5664
ที่มา : -
ความเชี่ยวชาญ : -
ความสัมพันธ์ : Zheng Wang and Michael O’Boyle. “Mapping Parallelism to Multi-cores: A Machine Learning Based Approach”. In: 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), February 2009, 75- 84. , Georgios Tournavitis, Zheng Wang, Björn Franke and Michael O’Boyle. “Towards a Holistic Approach to Auto-Parallelization—Integrating Profile-Driven Parallelism Detection and Machine-Learning Based Mapping”. In: ACM SIGPLAN 2009 Conference on Programming Language Design and Implementation (PLDI), June 2009, 177-187. , Zheng Wang and Michael O’Boyle. “Partitioning Streaming Parallelism for Multi-cores: A Machine Learning Based Approach”. In: 19th Intl. Conference on Parallel Architectures and Compilation Techniques (PACT), September 2010, 307-318.
ขอบเขตของเนื้อหา : -
บทคัดย่อ/คำอธิบาย :

Multi-core processors are now ubiquitous and are widely seen as the most viable means of delivering performance with increasing transistor densities. However, this potential can only be realised if the application programs are suitably parallel. Applications can either be written in parallel from scratch or converted from existing sequential programs. Regardless of how applications are parallelised, the code must be efficiently mapped onto the underlying platform to fully exploit the hardware’s potential. This thesis addresses the problem of finding the best mappings of data and streaming parallelism—two types of parallelism that exist in broad and important domains such as scientific, signal processing and media applications. Despite significant progress having been made over the past few decades, state-of-the-art mapping approaches still largely rely upon hand-crafted, architecture-specific heuristics. Developing a heuristic by hand, however, often requiresmonths of development time. Asmulticore designs become increasingly diverse and complex, manually tuning a heuristic for a wide range of architectures is no longer feasible. What are needed are innovative techniques that can automatically scale with advances in multi-core technologies. In this thesis two distinct areas of computer science, namely parallel compiler design and machine learning, are brought together to develop new compiler-based mapping techniques. Using machine learning, it is possible to automatically build highquality mapping schemes, which adapt to evolving architectures, with little human involvement. First, two techniques are proposed to find the best mapping of data parallelism. The first technique predicts whether parallel execution of a data parallel candidate is profitable on the underlying architecture. On a typical multi-core platform, it achieves almost the same (and sometimes a better) level of performance when compared to the manually parallelised code developed by independent experts. For a profitable candidate, the second technique predicts how many threads should be used to execute the candidate across different program inputs. The second technique achieves, on average, over 96% of the maximum available performance on two different multi-core platforms. Next, a new approach is developed for partitioning stream applications. This approach predicts the ideal partitioning structure for a given stream application. Based on the prediction, a compiler can rapidly search the program space (without executing any code) to generate a good partition. It achieves, on average, a 1.90x speedup over the already tuned partitioning scheme of a state-of-the-art streaming compiler.

บรรณานุกรม :
Wang, Zheng . (2554). Machine learning based mapping of data and streaming parallelism to multi-cores.
    กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom .
Wang, Zheng . 2554. "Machine learning based mapping of data and streaming parallelism to multi-cores".
    กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom .
Wang, Zheng . "Machine learning based mapping of data and streaming parallelism to multi-cores."
    กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom , 2554. Print.
Wang, Zheng . Machine learning based mapping of data and streaming parallelism to multi-cores. กรุงเทพมหานคร : Edinburgh Research Archive, United Kingdom ; 2554.