| ชื่อเรื่อง | : | Application composition and communication optimization in iterative solvers using FPGAs. |
| นักวิจัย | : | Rafique, Abid. , Kapre, Nachiket. , Constantinides, George A. |
| คำค้น | : | DRNTU::Engineering::Computer science and engineering::Computing methodologies. |
| หน่วยงาน | : | Nanyang Technological University, Singapore |
| ผู้ร่วมงาน | : | - |
| ปีพิมพ์ | : | 2556 |
| อ้างอิง | : | Rafique, A., Kapre, N., & Constantinides, G. A. (2013). Application Composition and Communication Optimization in Iterative Solvers Using FPGAs. 2013 IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines, pp.153-160. , http://hdl.handle.net/10220/17397 , http://dx.doi.org/10.1109/FCCM.2013.16 |
| ที่มา | : | - |
| ความเชี่ยวชาญ | : | - |
| ความสัมพันธ์ | : | - |
| ขอบเขตของเนื้อหา | : | - |
| บทคัดย่อ/คำอธิบาย | : | We consider the problem of minimizing communication with off-chip memory and composition of multiple linear algebra kernels in iterative solvers for solving large-scale eigenvalue problems and linear systems of equations. While GPUs may offer higher throughput for individual kernels, overall application performance is limited by the inability to support on-chip sharing of data across kernels. In this paper, we show that higher on-chip memory capacity and superior on-chip communication bandwidth enables FPGAs to better support the composition of a sequence of kernels within these iterative solvers. We present a time-multiplexed FPGA architecture which exploits the on-chip capacity to store dependencies between kernels and high communication bandwidth to move data. We propose a resource-constrained framework to select the optimal value of an algorithmic parameter which provides the tradeoff between communication and computation cost for a particular FPGA. Using the Lanczos Method as a case study, we show how to minimize communication on FPGAs by this tight algorithm-architecture interaction and get superior performance over GPU despite of its ~5x larger off-chip memory bandwidth and ~2x greater peak singleprecision floating-point performance. |
| บรรณานุกรม | : |
Rafique, Abid. , Kapre, Nachiket. , Constantinides, George A. . (2556). Application composition and communication optimization in iterative solvers using FPGAs..
กรุงเทพมหานคร : Nanyang Technological University, Singapore. Rafique, Abid. , Kapre, Nachiket. , Constantinides, George A. . 2556. "Application composition and communication optimization in iterative solvers using FPGAs.".
กรุงเทพมหานคร : Nanyang Technological University, Singapore. Rafique, Abid. , Kapre, Nachiket. , Constantinides, George A. . "Application composition and communication optimization in iterative solvers using FPGAs.."
กรุงเทพมหานคร : Nanyang Technological University, Singapore, 2556. Print. Rafique, Abid. , Kapre, Nachiket. , Constantinides, George A. . Application composition and communication optimization in iterative solvers using FPGAs.. กรุงเทพมหานคร : Nanyang Technological University, Singapore; 2556.
|
