Researchers from the University of Warwick’s Performance Computing and Visualization Department and Oxford University’s eResearch Centre have put together a study: should we install a GPGPU-based system or a more traditional IMB Blue Gene-like supercomputer? The team’s research will present their work at the 1st International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems at the SC10 conference in New Orleans. The Khronos Group will be at Booth # 1132 at SC10.
Currently there are several ways to feed data to the GPU no matter of what API we use and what type of application we develop. In case of OpenGL we have uniform buffers, texture buffers, texture images, etc. The same is true for OpenCL and other compute APIs that even provide more fine-grained memory management taking advantage of the local data store (LDS) available on today’s hardware. In this article I’ll present the memory access performance characteristics of AMD’s Evergreen-class GPUs focusing on what this all means from OpenGL point of view. While most of the data is about the HD5870, the general principles and relative performance characteristics are valid for other GPUs, including ones from other vendors.
Only a few days since AMD released Catalyst 10.10a, they have released Catalyst 10.10c Hotfix with beta support for OpenGL 4.1. NVIDIA also recently released drivers for OpenGL 4.1. See how each driver does on G-Truc's website.
What would you have if you put over 7000 NVIDIA Tesla GPUs together? Chances are you would have the worlds fastest computer. Chinas National University of Defense Technology has put together 7,168 NVIDIA Tesla M2050 (Fermi) GPUs, 14,336 CPUs, 262TB of memory and 2PB of storage, giving them the worlds fastest super computer with a Linpack performance of 2.5 petaflops. Peak performance of the new Tianhe-1A super computer is 4.7 petaflops. Oh, did I mention it draws 4 Megawatts of power! The Tianhe-1A super computer means the Us has lost its top spot in the TOP500.
Rob Farber has an in-depth two part tutorial on OpenCL. The first part of the tutorial will get you going using the ATI Stream software development kit (SDK). Part two of the OpenCL tutorial covers memory spaces and the OpenCL memory hierarchy, as well, how to start thinking in terms of work items and work groups. Both parts contain lots of code examples to help the novice OpenCL programmer get started.
ArcSoft is collaborating with AMD using their Radeon™ HD 6800 series GPUs on OpenCL™ technology with the pre-release of its best-selling multimedia player application, TotalMedia Theatre 5. ArcSoft's products are optimized with OpenCL, include the upscaling technology SimHD® on TotalMedia Theatre 5 and H.264 encoders across many applications such as TotalMedia ShowBiz, TotalMedia Studio, and MediaConverter 7.
HUONE, Imagination Technologies and Rightware will be demonstrating in the "Khronos Pavilion" at SIGGRAPH ASIA. We are dramatically increasing our participation at SIGGRAPH Asia 2010 in Seoul over last year. We're hosting another Khronos Developer University co-located with SIGGRAPH Asia. We are looking forward to seeing all the familiar faces as well as lots of new ones.
Vincent Hindriksen, owner of StreamComputing will be speaking at the Thalesian seminar on the Usability of OpenCL for Financial Computations. The meetup will be held in Canary Wharf, London, at 7:30 p.m. on Wednesday, 27th October, 2010. Founded in 2008. the Thalesians are a think tank of dedicated professionals with an interest in quantitative finance, economics, mathematics, physics, computer science, and synergetics.