Comments (1)
I think this would be a cool addition, I would say to two things we would want in an implementation would be:
- For it to be possible to override the checkpoint handler (for example, the
GIT_THETA_CHECKPOINT_TYPE
env variable defaults to"sniff"
which does autodetect, but you can manually set it to force git-theta to use a specific one) - We would want to avoid having to enumerate checkpoint types in git-theta itself. Maybe we would need something like a new
git_theta.plugins.checkpoints_sniff
plugin entry point and then each handler implementation can provide and register a sniff function. Then the git-theta sniff function would load/run the sniffer for each plugin until one of the sniffers says "yeah, this is a checkpoint I can open"
As a further extension to this idea (which def isn't needed in an initial implementation), some checkpoint formats have magic numbers (pytorch does https://pytorch.org/docs/stable/_modules/torch/serialization.html) which might be possible to use in a plugin's sniffer to do fancier things than just looking at the filenames (although I'm not sure how easy it is to peek at the stdin pipe (used in the filters) without consuming it).
I'm pretty busy atm, but I would be happy to help anyone who wants to implement this and it get merged.
from git-theta.
Related Issues (20)
- Add an "apply to all" option to merge actions
- Parameter groups that are more than just tensors? HOT 3
- Add a way to script merges
- Functionality for partial model loading HOT 3
- Method to tell if git-theta wasn't installed? HOT 4
- Pytorch Checkpoint reading
- Git Add can have high memory usage.
- Finer-grained control of `git theta install` HOT 1
- Tensorflow model loading/saving seems bugged
- `git theta ls-files` HOT 1
- Git-Theta Clean
- Hanging when crashing
- More intelligent concurrency limits
- Investigate using cffi to speed up git lfs interface
- Configurable Serialization, Combining, and Saving to a backend
- Add `__str__` to metadata object HOT 1
- Update CI to handle MacOS
- Add retry to end2end tests
- [end2ends] push repos to Hugging Face Hub (and git clone from there) to ensure it works HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from git-theta.