Comments (4)
I think PERF_N
(1024) is too small to provide an adequate performance comparison for the inner product. For the vectors this small the performance is dominated by OpenCL overheads. Here the boost from OpenCL/CUDA was only apparent starting from vectors longer than 10e5-10e6.
from compute.
Yeah, performance shouldn't be judged with such small data sizes. @roshanr95 At what data sizes did you see the performance differences?
I'll look into bumping up the default number. In my usual testing I usually set PERF_N
to 16000000
(by specifying it on the command line). Also @roshanr95, I assume you found the perf.py
script which will run the algorithm with a range of data sizes and also display a graph if you run it with the --plot-time
or --plot-rate
arguments.
from compute.
Yeah, I used the python script. The performance differences are for size of 33554432 elements.
from compute.
Fixed. See issue #73 and the discussion in issue #69.
from compute.
Related Issues (20)
- Build program failure when use default_random_engine. HOT 1
- Can we use iterator in valarray?
- how about android devices HOT 2
- how to fill array of custom user structures ?
- how to reduce "array of custom user structures" with custom function ?
- vectorized lower_bound HOT 2
- Adding nullptr to std::string HOT 1
- Cannot run example on README.md: what(): Out of Host Memory HOT 1
- How can I return a custom struct from a BOOST_COMPUTE_FUNCTION
- BOOST_COMPUTE_CLOSURE triggers run-time compiling every time the closure value changes
- Unwanted function call for std::map::operator[] HOT 1
- g++ appears to resolve OpenCL/cl.h header but clang does not.
- How to convert a cv::UMat to cv::Mat?
- Minimum Boost Version
- Cmake error on android
- transform_reduce on M1 Pro
- Please do not set CMAKE_MODULE_PATH.
- Why does the transform(...) method work only on vectors with values of type <float> ?
- Get rid of `BOOST_COMPUTE_ADAPT_STRUCT` integrating boost::pfr HOT 1
- Modular Boost C++ Libraries Request
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from compute.