Comments (4)
Your hypothesis is probably correct. If the model doesn't return a prediction in time, you will see a 1 as output. The selection policies are sort of broken right now and we are in the process of removing them in #89 until they can be fixed.
You can try setting the application latency requirement at 150ms and see if you get better results. That shouldn't slow down your predictions because they will be returned before the deadline if they are available.
These are great ideas for debugging. Adding more visibility to the system should be at least partially addressed by CLIPPER-114. Once #89 is completed, we will add metrics to report how many predictions returned in time and how many used the default value. Also, #89 makes the default prediction configurable on a per-application basis which should make it easier to debug as well.
from clipper.
Does this mean their container is running slowly? We might want to add a basic dashboard for Clipper to monitor this stuff.
from clipper.
"Slowly" is somewhat subjective. Their container is running more slowly than the latency objective for their application. The metrics already show this. I agree a monitoring dashboard would be good. I created CLIPPER-142 to track this for the 0.2 release.
from clipper.
#116 now explicitly marks predictions as being the result of a default prediction or a prediction from the model container as part of the response object. The metrics at http://<clipper-url>:1338/metrics
also report the ratio of default predictions.
from clipper.
Related Issues (20)
- Batch_size issue HOT 5
- Metrics
- Debugging input and output
- Number of predictions mismatch
- Potential deadlock when deleting model container replica queue
- Issue with speed of clipper with custom PyTorch model HOT 1
- Model Selection HOT 4
- Study the source code and development
- Dynamic Batching HOT 2
- discussion question HOT 2
- Failed to retrieve a prediction response within the specified latency SLO HOT 2
- No connected models found for query HOT 1
- all input-type (except double) do not work.
- Kubernetes Connection - the server could not find the requested resource
- Question for python_deployer.deploy_python_closure - case of multiple outputs from func
- About the source code HOT 1
- basic query example HOT 1
- develop test failed on win10: qf startup error *Option ‘thread_pool_size’ does not exist*
- Unregistering an application, and then registering again with different output doesn't change the output
- Documentation website unavailable(expired) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from clipper.