Currently there are several ways to feed data to the GPU no matter of what API we use and what type of application we develop. In case of OpenGL we have uniform buffers, texture buffers, texture images, etc. The same is true for OpenCL and other compute APIs that even provide more fine-grained memory management taking advantage of the local data store (LDS) available on today’s hardware. In this article I’ll present the memory access performance characteristics of AMD’s Evergreen-class GPUs focusing on what this all means from OpenGL point of view. While most of the data is about the HD5870, the general principles and relative performance characteristics are valid for other GPUs, including ones from other vendors.
Only a few days since AMD released Catalyst 10.10a, they have released Catalyst 10.10c Hotfix with beta support for OpenGL 4.1. NVIDIA also recently released drivers for OpenGL 4.1. See how each driver does on G-Truc's website.
What would you have if you put over 7000 NVIDIA Tesla GPUs together? Chances are you would have the worlds fastest computer. Chinas National University of Defense Technology has put together 7,168 NVIDIA Tesla M2050 (Fermi) GPUs, 14,336 CPUs, 262TB of memory and 2PB of storage, giving them the worlds fastest super computer with a Linpack performance of 2.5 petaflops. Peak performance of the new Tianhe-1A super computer is 4.7 petaflops. Oh, did I mention it draws 4 Megawatts of power! The Tianhe-1A super computer means the Us has lost its top spot in the TOP500.
Rob Farber has an in-depth two part tutorial on OpenCL. The first part of the tutorial will get you going using the ATI Stream software development kit (SDK). Part two of the OpenCL tutorial covers memory spaces and the OpenCL memory hierarchy, as well, how to start thinking in terms of work items and work groups. Both parts contain lots of code examples to help the novice OpenCL programmer get started.
ArcSoft is collaborating with AMD using their Radeon™ HD 6800 series GPUs on OpenCL™ technology with the pre-release of its best-selling multimedia player application, TotalMedia Theatre 5. ArcSoft's products are optimized with OpenCL, include the upscaling technology SimHD® on TotalMedia Theatre 5 and H.264 encoders across many applications such as TotalMedia ShowBiz, TotalMedia Studio, and MediaConverter 7.
HUONE, Imagination Technologies and Rightware will be demonstrating in the "Khronos Pavilion" at SIGGRAPH ASIA. We are dramatically increasing our participation at SIGGRAPH Asia 2010 in Seoul over last year. We're hosting another Khronos Developer University co-located with SIGGRAPH Asia. We are looking forward to seeing all the familiar faces as well as lots of new ones.
Vincent Hindriksen, owner of StreamComputing will be speaking at the Thalesian seminar on the Usability of OpenCL for Financial Computations. The meetup will be held in Canary Wharf, London, at 7:30 p.m. on Wednesday, 27th October, 2010. Founded in 2008. the Thalesians are a think tank of dedicated professionals with an interest in quantitative finance, economics, mathematics, physics, computer science, and synergetics.
In the Catalyst Driver 10.10 for Windows release AMD introduces the Accelerated Parallel Processing (APP) technology edition. The APP technology edition includes everything from the standard driver and adds the OpenCL driver component. You can still download the Catalyst components individually. The version also includes Blu-Ray 3D support, Stereo 3D gaming support and video acceleration for HD WMV video content (Windows 7).
StreamComputing.nl has posted a third instalment of their OpenCL–The Battle series. Part I covered the hardware companies and operating systems, Part II the software companies. The latest entry focuses on Games. Covering many aspects of games and gaming with a focus on the physics of material. Well written and lots video examples, the article is informative and a fast read.
MacResearch is an open and independent community for scientists using Mac OS X and related hardware in their research. MacResearch has a good collection of OpenCL tutorials as well as tutorials covering many other topics.
MVTec is a developer of software products for machine vision, used in, for example, the semiconductor industry for board, wafer and die Inspection. Halcon 10 will take advantage of GPUs based on the OpenCL standard to make optimum use of available compute power.
General 3D announced the first web-based 3D stereoscopic system to stream 3D stereoscopic videos using only a browser. This new system uses the HTML5 and WebGL standards being built into Mozilla FireFox, Google Chrome and Apple Safari. Currently the site appears to work only with Firefox 4 beta.