I want to do an evaluation of JOCL, JogAmp JOCL, JavaCL and APARAPI using matrix multiplication. I have done the Aparapi version. Can someone help me how to implement Matrix Multiplication in JOCL, ...

I'm a little confused about terms of an AMD wavefront and workgroupsize of OpenCL.
I found different sources were different statements are done.
My question is: How much is the wavefront-size of AMDs ...

I am going through the sample code of NVIDIA provided at link
In the sample kernels code (file oclReduction_kernel.c) reduce4 uses the technique of
1) unrolling and removing synchronization barrier ...

Whatever program I ran on GPU, even if programs that ran successfully before, my GPU throws this error: CL_OUT_OF_RESOURCES for the clEnqueueReadBuffer function.
Then I remembered that I ran a deep ...

everyone! I'm not really clear about the meaning of CL_UNORM_INT8, which is one of the available choice of value of cl_image_format.image_channel_data_type; what's specific about this type, and what's ...

I am new to OpenCL and I tried to run example code in OpenCL Programming Guide.
I got an error code -5(which corresponds to CL_OUT_OF_RESOURCES) when trying to read filtered image back to host using ...

I'm working on an openCL kernel that loads up some points, decides which is the highest, and returns it. All good there, but I want to add a calculation before the highest evaluation. This compares ...

I have the latest Ubuntu LTS and Intel OpenCL drivers for it. I need to build ImageMagick library so it would have OpenCL enabled but I cant figure out how to do this. I'd like to build it from the ...

I'm new to OpenCL, and I'm curious as to how to read in data input to perform simple operations (e.g. cross/dot product) on.
For a particular example, I've compiled and am trying to run this simple ...

I am working with the OpenCL reduction example provided by Apple here
After a few days of dissecting it, I understand the basics; I've converted it to a version that runs more or less reliably on c++ ...

I have an application that I designed to run on AMD GPU's with OpenCL. Finally got the app running and bug free (haha) yesterday, targted on a single GPU. Now that the app works, it's time to scale it ...

I have an array of 2M+ points (planned to be increased to 20M in due course) that I am running calculations on via OpenCL. I'd like to delete any points that fall within a random triangle geometry.
...

So I installed OpenCV WITH_OPENCL_SVM=ON thinking that I was going to eventually get a GPU with OpenCL 2.0 on it (currently only 1.1.) However now when I try to run any programs with OpenCV I get an
...

I have a massive array I need to search (actually it's a massive array of smaller arrays, but for all intents and purposes, lets consider it one huge array). What I need to find is a specific series ...

I have a __local int* pointer which I want to copy the data from a __global int* to it. To make the copy faster, I cast both to long16*, I know all the arrays (input, output and local memory) are of ...

I got confused a about how to pass arrays as arguments to OpenCL kernel, my reason for doing this is to use my GPU which is Radeon HD 6870 to do some calculations on images, any help will be greatly ...

I'm working on a solver for a differential equation for a particle smulation using Pyopencl.
To solve this equation each particle must access it's neighbors information.
The arrays I'm using are numpy ...

I am trying to combine Opencv with OpenCL for creating image buffer and pass it to GPU.
I have imx6 which uses vivante core (GPU).
Do not support OCL feature of opencv.
I am using OpenCV for reading ...

I've just begun to experiment with OpenCL. I'm trying to make a kernel which will multiply two 2-d arrays. I've already done this with vectors, however in 2-d I get only results from the first row. ...

I am running a sorting algorithm in a kernel, and the sorting part uses about 36 VGPR, thus resulting in 12.5% occupancy and awful performance.
The code segment is as follows:
typedef struct {
float ...

I'm writing an OpenCL app to do some number crunching, and the app has some specific hurdles and issues I've gotten past, but I am sure there is a better way to do it.
First hurdle: The app crunches ...