PDA

View Full Version : OpenCL



Pages : 1 2 3 4 5 6 7 [8] 9

  1. Convergence test
  2. uint size
  3. opencl graphs and data structures
  4. CUDA & OpenCL similarities
  5. partial reductions
  6. application porting to OpenCL
  7. warp size vs # of SPs per SM
  8. copying a variable from host memory to device memory
  9. Problems with deinitialization of intel openCL
  10. Work Group and Work Item sizes
  11. texture memory
  12. clSetKernelArg(....)
  13. declare opencl data type in host
  14. explicit copy from host to device
  15. set image format
  16. Fermi: overlapping kernels from an in-order queue
  17. empty texture object
  18. texel element definition
  19. Probable usage of texture memory
  20. clWaitForEvents vs clEnqueueWaitForEvents
  21. data partitioning?
  22. Image Type : IMAGE2D vs IMAGE2D_ARRAY
  23. Infinite loop invalidating command queue
  24. Barrier Function in while loop
  25. write_imagef and image1d
  26. Regarding the work group size
  27. First NVIDIA OpenCL Driver Version?
  28. Collaboration between host and devices
  29. register usage : float3 vs float4
  30. Segmentation fault (core dumped )
  31. ocland, the OpenCL cloud computing interface
  32. platform differentiation
  33. Dive into Registers
  34. graphics card (GPU)distuinguish
  35. NVIDIA OpenCL SDK (without using CUDA)
  36. Working with bits and perfomance issue
  37. Precomputing array of 16 elements
  38. clEnqueueWriteBuffer causes segmentation fault
  39. Compiling Kernel With Struct Pointer in Argument
  40. OpenCL implementation for Multiple platforms
  41. get_global_id is undefined
  42. OpenCL (CUDA v5) and Visual Studio express 2010
  43. what is CL_INVALID_KERNEL ?
  44. Kernel launch time
  45. clCreateKernel Gives OUT_OF_HOST_MEMORY
  46. doubt on OpenCL's C++ feature
  47. Private variables compiler optimization?
  48. cl::vector vs std::vector with cl::Events
  49. Kernel always returns the same array of single value
  50. Profiling of kernel code
  51. OPENCL Kernel Debugging
  52. Increase the global memory allocation on gpu Nvidia and Inte
  53. constant memory issue
  54. clCreateProgramWithSource - could be a file?
  55. CL_DEVICE_GLOBAL_MEM_SIZE less than spec
  56. GPU_MAX_HEAP_SIZE doesn't work
  57. Difference between rated peak performance and actual performance
  58. The following Exceptions have occurred : clEnqueueNDRangeKernel(-54)
  59. regarding clEnqueueCopyBufferRect
  60. Multiplication Matrix with OpenCL in JAVA
  61. OpenCL Requirements for Newbies on Linux
  62. Accessing single GPU buffer using multiple CPUs with MPI
  63. write_imagef only writes correct color if read_imagef is used (OpenCL/OpenGL)
  64. Matching nvidia-smi and OpenCL devices
  65. What performance increase can we expect for a problem, e.g. generating NCn combinatio
  66. Concurrency between a CPU kernel execution and a GPU data transfer
  67. clarification request: barrier()
  68. read_imagef : CL_RGBA vs CL_R
  69. Intel OpenCL can't find my 3rd generation CPU
  70. 2D Subbuffer
  71. Where can I find the definition of _cl_platform_id ?
  72. clGetPlatformIDs is hanging/not returning
  73. Barrier?
  74. multiple command queue on a device
  75. subbuffer understanding (alignment / usefulness)
  76. Deadlock occurs with N threads, N command queues & N Kernels
  77. Subdivision of image data
  78. OpenCL.dll needed for running RealfFLow under Parallels' Virtual Machines on Mac
  79. Mmaped buffers: Memory leaks and GART errors
  80. Warning when returning local address in non kernel function...
  81. Handling event objects for synchronization points
  82. OpenCL computing (dimension avarage) with doubles
  83. copy 2d image to the OpenCL 3D image object
  84. Is non-blocking clEnqueueWriteBuffer really immediately return?
  85. clGetDeviceIDs question
  86. replace the keyword
  87. _inline_ function
  88. Problem for structures containing arrays.
  89. OpenCL rectangular-copy function is either slow or crashes on a AMD machine
  90. Enqueue/Finish Scheme for FFT
  91. empty struct / enum layout compatibility between host and OpenCL code
  92. OpenCL functions undefined
  93. Initializing __local variables
  94. OpenCL compilation process and obtaining intermediate files
  95. Neural networks in openCl with visual studio
  96. Releasing Memory, Kernels, Devices etc
  97. Implication of Apple patent on OpenCL
  98. Basic questions about OpenCL
  99. OpenCL performance issue
  100. Nested loops with dependancies and series operations
  101. Compile for other GPU
  102. memory bank, opencl and host objects
  103. OpenCL general optimization for neighbors accessing
  104. clEnqueueTask : -52
  105. Array Size Change Implies Error
  106. How to convert array of cl_uint into array of cl_uchar in C++?
  107. Performance Questions with regards to image processing.
  108. Using Local Memory with Large Data Buffer
  109. Synchronization between OpenCL and OpenGL using OpenGL sync object
  110. OpenCL write_imagei problem
  111. OpenCL ridiculous CL_OUT_OF_RESOURCES
  112. Undefined type in extensions spec
  113. Graphic driver
  114. How does the OpenCL model map to CPU hardware?
  115. Opencl support hardware requirements
  116. Command queue synchroniztion
  117. program object .
  118. cl_khr_fp64 becoming an "optional core feature"
  119. OpenCL kernel performs only once
  120. slow addition of vector-components using float16 ?!
  121. Official OpenCL 2.0 Feedback thread
  122. Official OpenCL SPIR 1.2 feedback thread
  123. clGetDeivceIDs fails...driver issue?
  124. cross compilation and remotely test your application
  125. OpenCL for crosss-platform and multiple devices ?
  126. read_imagef and floating point coordinates, what's the range?
  127. Use of Template kernel in opencl
  128. Why no template support?
  129. Quoting for -I include paths
  130. Include OpenCV in OpenCL kernel code
  131. paged memory vs pinned memory
  132. How do you represent an image in OpenCL
  133. Passing 2 times the same buffer as input ?
  134. Modify a float4 in a function
  135. Host overhead on data transfer?
  136. Not able to read the image using clcreateimage & read_imagei in OpenCL
  137. Texture with Interoperability, tutorial?
  138. Multiple kernel invocations within kernel or via host
  139. Facing problem in error -46 (CL_INVALID_KERNEL_NAME)
  140. Accessing CAL buffers from OpenCL?
  141. Parallel Reduction combined with other Operations
  142. Open CL Support -> HD 4890 ATI + windows xp
  143. error while mapping the pinned memory
  144. __global vs __constant qualifier in OpenCL
  145. Vector Vector addition
  146. Regarding memory sections in openCL
  147. Solve for Ax=B where A is a sparse matrix
  148. CL_OUT_OF_RESOURCES when crate opencl context
  149. How OpenCL __private address space is mapped on GPU?
  150. Questions on OpenCL Built-in functions?
  151. opencl sdk 1.5 is missing
  152. Passing buffers (pointers) in a structure to a kernel
  153. CL_INVALID_ARG_INDEX causes
  154. No improvement seen when moved to latest GPU which has more max_work_item_size
  155. Manually optimizing OpenCL/CUDA intermediate code !
  156. CL_OUT_OF_RESOURCES - what are the usual usual causes of this error
  157. error while creating program object from source
  158. NULL or !NULL
  159. OpenCL leading with OpenCV
  160. CL GL interoperability
  161. hardware problem related to OpenCL
  162. CL GL Context creation
  163. Weird error code when calling clGetPlatformIDs (not -1001)
  164. Buffer Offsets are not allowed in OpenCL
  165. Looking for a better explanation of CL_DEVICE_MEM_BASE_ADDR_ALIGN
  166. Adding support for a new hardware in OpenCL
  167. Getting pocl to function with microblaze running linux
  168. Multiple cl:::Program within one context with identical kernel names
  169. clCreateUserEvent set reference count to 2
  170. New open source HandsOnOpenCL course now available; examples in C, C++ and PyOpenCL
  171. OpenCL program freezes when high number of kernels are launched within a loop
  172. OpenCL-Z and OpenCL Mandelbrot
  173. OpenCL Kernel debugging on NVIDIA GPUs
  174. Please explain CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE
  175. OpenCL in Android NDK r9
  176. Colaesed reading taking more time
  177. Issue in OpenCL Kernel function
  178. CPU vs GPU performance
  179. Host compile-time test for OpenCL version? Macro?
  180. Does Nvidia API grow an unbounded command queue unless clFinish() is called?
  181. Hello World not working in Linux
  182. OpenCL/GL Interop Setup Problem
  183. Local memory in matrix multiplication
  184. context validation
  185. pow(float4,float)
  186. How to protect/hide my OpenCL C code?
  187. Open CL C library
  188. How can I know that GPU is free for my Open CL code and not for other tasks ?
  189. clCreateEventFromGLsyncKHR link error
  190. device status
  191. Memory management with cl.hpp
  192. [C++] Setting up SDL2 und OpenCL 1.2
  193. Shader to OpenCL Kernel
  194. program hangs on clBuildProgram with an empty kernel or takes > 1 hour
  195. OpenCL gDebugger
  196. Using non-square rectangular blocking for a matrix multiplication kernel
  197. OpenCL vs. CUDA for NVIDIA GPGPU
  198. Running with AMD's OpenCL SDK on NVIDIA's device
  199. OpenCL YouTube Training Videos
  200. how to call two functions within a kernel
  201. NVIDIA's GeForce 9400 GT: CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT = 1
  202. for loops in kernel
  203. Problem synchronizing between multiple WorkGroups
  204. How matrix dimentions are passed to kernel ?
  205. Problem with setting buffers as kernel parameters
  206. Const memory and buffer size in openCL
  207. syntax of 'dot' routine
  208. error in OpenCL
  209. OpenGL shared textures
  210. Extremely slow when returning value from device to host
  211. get_global_id(0) changes when moving from float * to float4 * in kernel
  212. Cache miss in kernel
  213. How to measure GPU performance
  214. cl.hpp on Mac OS X fails with expected unqualified-id error
  215. Determine global_work_size
  216. Moving from float to float4: What should be changed in the host code ?
  217. How to implement the matrix inversion in OpenCL?
  218. OpenCL workgroup synch min/max values for a group
  219. Is it possible to transfer value from kernel to other kernel?
  220. how to obtain part of buffer when a kernel is working with the buffer
  221. how to obtain part of buffer when a kernel is working with the buffer
  222. How to reset global_id for multiple kernel?
  223. Using clEnqueueWriteBuffer with c++ vectors
  224. Determine which Compute Unit the workgroup is running on (on AMD ATI Radeon series)
  225. Subdividing gloabl Workgoup Size
  226. GPU monitoring tools (UI/commandline)
  227. Can somebody tell me an example of How OpenCL works with subbuffer?
  228. Format restrictions on OpenCL kernel
  229. How to allocate a task to GPU
  230. How to avoid unnecessary memory copying if using a CPU as device?
  231. clBuildProgram hangs on some plateform - how to retrieve the CL capabilities?
  232. Element-by-element multiplication of complex matrix
  233. Operations on parts of different arrays
  234. OpenCL finer grained parallelism
  235. OpenCL finer grained follow up
  236. CL_MEM_ALLOC_HOST_PTR and 'ordinary' buffer reads/writes
  237. Screen capture with openCL?
  238. clFFT 2D question
  239. cl.hpp resource leaks (memory leaks) (but not too important).
  240. GL/CL interop on multiple GPUs
  241. Low performance when writing results back to global memory
  242. error -13 when I using clCreateSubBuffer
  243. cl_out_of_resources
  244. OpenCL kernel not found
  245. Generic overlap of data transfers and kernel computation
  246. State of a kernel when it's running?
  247. Info GPU
  248. Synchronization problems?
  249. fatal: si_isa_DS_WRITE_B32_impl: invalid address
  250. How to pass an array to the kernel