Comments (5)
And again, it's also an issue of how you display the results. I'm much more likely to believe a speedup of 20x from .1s to 0.005s than from 1h to 3m. If something is instantaneous, we don't really try to optimize much more usually.
from scikit-learn_bench.
It would also be great to have the absolute times, not only the relative times. Some of these algorithms take .5s. In that case our input validation overhead probably is possibly dominating the work.
from scikit-learn_bench.
Hi @amueller,
We can definitely try both tall and wide data and report absolute timing. As for input validation, we disable it entirely here. That basically calls sklearn.set_config(assume_finite=True)
.
Currently, sparse inputs will always cause our patches to fall back to scikit-learn or convert the sparse matrix to a dense one.
from scikit-learn_bench.
@bibikar enabling assume_finite
is definitely the right way to go. Still, I don't expect anything that takes .5s to be optimized in sklearn. Can you run something that takes like 10s or 1m?
from scikit-learn_bench.
In last several years datasets sizes get more variety and we are working on including more datasets with introduction of GPU support
from scikit-learn_bench.
Related Issues (20)
- Make use of "--device(s)" for XGBoost HOT 3
- svm.py fails with IndexError HOT 3
- xgboost benchmark datasets missing HOT 1
- reporting format of benchmarks
- lot of memory allocations becomes bottleneck HOT 4
- Facing issues while running benchmark on Ubuntu 18.04 HOT 4
- Add support for single row inference cases
- Integrate support for competitive model compilation frameworks(TVM and ONNX)
- Datasets used for producing benchmarks in scikit-learn intelex HOT 7
- The error messages when running benchmark are ignored in some cases HOT 1
- benchmarking the linear kernel in SVC HOT 2
- Unable to run scikit-learn_bench on EMR system
- requirements list for environment setup. HOT 1
- Some links in README.md are broken HOT 1
- Error installing reqs for scikit-learn bench HOT 2
- Benchmark linear models in higher dimensions HOT 4
- Benchmarks silently execute stock version if scikit-learn-intelex is not installed HOT 1
- HistGradientBoostingEstimator HOT 3
- Code quality improvements - adress findings from www.codefactor.io HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scikit-learn_bench.