Comments (4)
Actually, it's not really the structure which is inconvenient, it's more about writing the ormbfile.yaml
artifact config file. I opened a separate issue (#180) to discuss this further. I'm closing this one as nothing prevents users to publish other ML artifacts like transformers.
from ormb.
Hi @gbolmier
ML models often require stateful transformers to process data for them (e.g. standard scaler). Unfortunately, this kind of artifact isn't supported as of now.
Do you mean the model with transformer structure, or some transformation functions to process the data?
Also some ML frameworks aren't supported, yet? Especially frameworks that don't use specific serialization formats, but rely on e.g. the pickle protocol.
https://github.com/kleveross/ormb/blob/master/pkg/model/format.go The format is defined here. You can add a new format pickle
.
And, welcome contributions!
from ormb.
Hi @gaocegege, thanks a lot for the prompt answer.
Do you mean the model with transformer structure, or some transformation functions to process the data?
I'm referring to the second (e.g. standard scaler, pca, tf-idf vectorizer). These transformers are closely tied to the model, they often have hyperparameters that impact the model's performance and a state updated while processing the training data (like models). The model's performance on unseen data is dependent on the transformers used during the training phase, that's why stateful transformers are persisted to further process unseen data in the same way they processed the training data.
https://github.com/kleveross/ormb/blob/master/pkg/model/format.go The format is defined here. You can add a new format
pickle
.And, welcome contributions!
Thanks a lot for the pointer, cool this looks pretty straightforward.
Follow-up question, let's say I want to share and publish some transformers tied to my ML model, do I have to create similar tree structures for each transformer along the model one?
$ tree .
.
├── sklearn_model
│ ├── model
│ │ └── sklearn_model.joblib
│ └── ormbfile.yaml
├── sklearn_transformer_a
│ ├── model
│ │ └── transformer_a.joblib
│ └── ormbfile.yaml
└── sklearn_transformer_b
├── model
│ └── transformer_b.joblib
└── ormbfile.yaml
6 directories, 6 files
If that's the case, could we make it more convenient in practice?
from ormb.
If that's the case, could we make it more convenient in practice?
What's your favorite srtructure? As you know, OCI supports layer-based storage like Docker Image, maybe we could discuss it further.
from ormb.
Related Issues (20)
- 您好,在部署“使用 Seldon Core 启动模型服务”是遇到问题
- Push failed for Harbor v1.10.1 HOT 3
- Ease packaging and publishing process in python HOT 3
- Python SDK installs Linux pre-compiled binaries on Mac OS HOT 3
- [feature] Add support for `pickle` format
- 您好,在部署“使用 Seldon Core 启动模型服务”是遇到问题 HOT 7
- Export command destination parameter not used without its `-d` flag HOT 4
- ORMB login should work with docker token HOT 8
- SeldonDeployment 的pod没有ready HOT 1
- ORMB python sdk setup.py imports requests before installing it
- Refactor with oras-go
- [feature] Use DockerHub remote registry - Cannot Push HOT 3
- Long term vision between ORMB and ORAS HOT 3
- directoryStructure set to <artifact_name> and not prepended by model/ dir HOT 4
- New release HOT 9
- Broken python sdk PyPI release
- BUG REPORT: urgent HOT 1
- [Bug] Save/Tag interpret numeric tags as port numbers
- ormb save/ ormb push 看不到进度,模型资源大时以为死掉了
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ormb.