码迷,mamicode.com
首页 > 其他好文 > 详细

【2014-11-23】Heterogeneous Parallel Programming – Section 1

时间:2014-11-23 22:52:21      阅读:252      评论:0      收藏:0      [点我收藏+]

标签:des   style   blog   http   io   ar   os   sp   for   

  1. Latency devices(CPU cores)
  2. Throughput devices(GPU cores)
  3. Use the best match for the job (heterogeneity in mobile SOC
  4. bubuko.com,布布扣
  5. bubuko.com,布布扣
  6. CPU: Latency Oriented Design
    • Powerful ALU
      • Reduced operation latency
    • Large caches
      • convert long latency memory accesses to short latency cache accesses
    • Sophisticated control
      • Branch prediciton for reduced branch latency
      • Data forwarding for reduced data latency
  7. GPU: Throughput Oriented Design
    • Small caches
      • To boost memory throughput
    • Simple control
      • No branch prediction
      • No data forwarding
    • Energy efficient ALUs
      • Many long latency but heavily pipelined for high throughput
  8. Scalability
    • bubuko.com,布布扣
  9. Portability
    • bubuko.com,布布扣
  10. SPMD – Single Program, Multiple Data
  11. Threads within a block cooperate via shared memory, atomic operation, barrier synchronization
  12. bubuko.com,布布扣

【2014-11-23】Heterogeneous Parallel Programming – Section 1

标签:des   style   blog   http   io   ar   os   sp   for   

原文地址:http://www.cnblogs.com/sjtujoe/p/4117512.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!