Hi Alex,
Are you normalizing the values correctly ?
--
You received this message because you are subscribed to the Google Groups "clmath" group.
To unsubscribe from this group and stop receiving emails from it, send an email to clmath+un...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
Number of platforms: 2 Platform Profile: FULL_PROFILE Platform Version: OpenCL 1.1 Platform Name: Intel(R) OpenCL Platform Vendor: Intel(R) Corporation Platform Extensions: cl_khr_fp64 cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_storecl_intel_printf cl_ext_device_fission cl_intel_exec_by_local_thread cl_khr_gl_sharing cl_intel_dx9_media_sharing Platform Profile: FULL_PROFILE Platform Version: OpenCL 1.2 AMD-APP (1348.4) Platform Name: AMD Accelerated Parallel Processing Platform Vendor: Advanced Micro Devices, Inc. Platform Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing
Platform Name: Intel(R) OpenCLNumber of devices: 1 Device Type: CL_DEVICE_TYPE_CPU Device ID: 32902 Max compute units: 8 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 1024 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 4 Preferred vector width double: 2 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 4 Native vector width double: 2 Max clock frequency: 3400Mhz Address bits: 64 Max memory allocation: 4288605184 Image support: Yes Max number of images read arguments: 480 Max number of images write arguments: 480 Max image 2D width: 8192 Max image 2D height: 8192 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 480 Max size of kernel argument: 3840 Alignment (bits) of base address: 1024 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: Yes Quiet NaNs: Yes Round to nearest even: Yes Round to zero: No Round to +ve and infinity: No IEEE754-2008 fused multiply-add: No Cache type: Read/Write Cache line size: 64 Cache size: 262144 Global memory size: 17154420736 Constant buffer size: 131072 Max number of constant args: 480 Local memory type: Global Local memory size: 32768 Kernel Preferred work group size multiple: 128 Error correction support: 0 Unified memory for Host and Device: 1 Profiling timer resolution: 301 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: Yes Queue properties: Out-of-Order: Yes Profiling : Yes Platform ID: 00000000005F6470 Name: Intel(R) Core(TM) i7-2600 CPU @ 3.40GHz Vendor: Intel(R) Corporation Device OpenCL C version: OpenCL C 1.1 Driver version: 1.1 Profile: FULL_PROFILE Version: OpenCL 1.1 (Build 31360.31441) Extensions: cl_khr_fp64 cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_storecl_intel_printf cl_ext_device_fission cl_intel_exec_by_local_thread cl_khr_gl_sharing cl_intel_dx9_media_sharing
Platform Name: AMD Accelerated Parallel ProcessingNumber of devices: 2 Device Type: CL_DEVICE_TYPE_GPU Device ID: 4098 Board name: AMD Radeon HD 6900M Series Device Topology: PCI[ B#1, D#0, F#0 ] Max compute units: 12 Max work items dimensions: 3 Max work items[0]: 256 Max work items[1]: 256 Max work items[2]: 256 Max work group size: 256 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 4 Preferred vector width double: 0 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 4 Native vector width double: 0 Max clock frequency: 680Mhz Address bits: 32 Max memory allocation: 536870912 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 8 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: None Cache line size: 0 Cache size: 0 Global memory size: 1073741824 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: No Queue properties: Out-of-Order: No Profiling : Yes Platform ID: 000007FED53D4E10 Name: Barts Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 1.2 Driver version: 1348.4 (VM) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (1348.4) Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_storecl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_dx9_media_sharing cl_amd_image2d_from_buffer_read_only
Device Type: CL_DEVICE_TYPE_CPU Device ID: 4098 Board name: Max compute units: 8 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 1024 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 8 Preferred vector width double: 4 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 8 Native vector width double: 4 Max clock frequency: 3400Mhz Address bits: 64 Max memory allocation: 4288605184 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 8 Max image 2D width: 8192 Max image 2D height: 8192 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 4096 Alignment (bits) of base address: 1024 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: Yes Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 32768 Global memory size: 17154420736 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Global Local memory size: 32768 Kernel Preferred work group size multiple: 1 Error correction support: 0 Unified memory for Host and Device: 1 Profiling timer resolution: 301 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: Yes Queue properties: Out-of-Order: No Profiling : Yes Platform ID: 000007FED53D4E10 Name: Intel(R) Core(TM) i7-2600 CPU @ 3.40GHz Vendor: GenuineIntel Device OpenCL C version: OpenCL C 1.2 Driver version: 1348.4 (sse2,avx) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (1348.4) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_storecl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharingOpenCL platform [ 0 ]: CL_PLATFORM_PROFILE: FULL_PROFILE CL_PLATFORM_VERSION: OpenCL 1.1 CUDA 6.0.1 CL_PLATFORM_NAME: NVIDIA CUDA CL_PLATFORM_VENDOR: NVIDIA Corporation CL_PLATFORM_EXTENSIONS: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll
OpenCL platform [ 1 ]: CL_PLATFORM_PROFILE: FULL_PROFILE CL_PLATFORM_VERSION: OpenCL 1.2 AMD-APP (1084.4) CL_PLATFORM_NAME: AMD Accelerated Parallel Processing CL_PLATFORM_VENDOR: Advanced Micro Devices, Inc. CL_PLATFORM_EXTENSIONS: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices cl_khr_d3d10_sharing cl_khr_d3d11_sharing
OpenCL platform [ 2 ]: CL_PLATFORM_PROFILE: FULL_PROFILE CL_PLATFORM_VERSION: OpenCL 1.2 CL_PLATFORM_NAME: Intel(R) OpenCL CL_PLATFORM_VENDOR: Intel(R) Corporation CL_PLATFORM_EXTENSIONS: cl_khr_fp64 cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_intel_printf cl_ext_device_fission cl_intel_exec_by_local_thread cl_khr_gl_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing
OpenCL devices [ 0 ]: CL_DEVICE_NAME: Intel(R) Core(TM) i7-3770 CPU @ 3.40GHz CL_DEVICE_VERSION: OpenCL 1.2 (Build 63463) CL_DRIVER_VERSION: 1.2 CL_DEVICE_TYPE: CPU CL_DEVICE_MAX_CLOCK_FREQUENCY: 3400 CL_DEVICE_ADDRESS_BITS: 64 CL_DEVICE_AVAILABLE: TRUE CL_DEVICE_COMPILER_AVAILABLE: TRUE CL_DEVICE_OPENCL_C_VERSION: OpenCL C 1.2 CL_DEVICE_MAX_WORK_GROUP_SIZE: 1024 CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3 Dimension[ 0 ] 1024 Dimension[ 1 ] 1024 Dimension[ 2 ] 1024 CL_DEVICE_HOST_UNIFIED_MEMORY: TRUE CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 131072 ( 128 KB ) CL_DEVICE_LOCAL_MEM_SIZE: 32768 ( 32 KB ) CL_DEVICE_GLOBAL_MEM_SIZE: 34295431168 ( 32706 MB ) CL_DEVICE_MAX_MEM_ALLOC_SIZE: 8573857792 ( 8176 MB ) CL_DEVICE_EXTENSIONS: cl_khr_fp64 cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_intel_printf cl_ext_device_fission cl_intel_exec_by_local_thread cl_khr_gl_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing
Internal Client Test *****PASS*****OPENCL_V_THROWERROR< CLFFT_DEVICE_NOT_FOUND > (385): Getting OpenCL devices ( ::clGetDeviceIDs() )clFFT error condition reported:OPENCL_V_THROWERROR< CLFFT_DEVICE_NOT_FOUND > (385): Getting OpenCL devices ( ::clGetDeviceIDs() )