Comments (5)
Are you trying to lower the model to CoreML by passing --coreml
? We're still actively working on enabling llama2 7b with CoreML. The xnnpack backend is ready for llama2 7b model.
from executorch.
Are you trying to lower the model to CoreML by passing
--coreml
? We're still actively working on enabling llama2 7b with CoreML. The xnnpack backend is ready for llama2 7b model.
@cccclai ah ok sweet thank you for letting me know!! I would have still been trying haha
Is the XNNPACK a .mlpackage? I have to build the xnnpack stuff, I just did mps and coreml.
Do you have more info on what models you have ready for CoreML?
Is there any .mlmodel/.mlpackage model configs (or any end products of converting) for in executorch?
from executorch.
xnnpack (https://github.com/google/XNNPACK) is a software library with a list highly optimized operators in CPU. It can work on iOS too.
Regarding CoreML questions, I'd defer to @cymbalrush and @YifanShenSZ to answer.
from executorch.
Will also cc: @shoumikhin for iOS/MacOS related inquiries.
from executorch.
Hey @antmikinka, would this simpler export work for you?
python -m examples.models.llama2.export_llama --checkpoint /Users/anthonymikinka/executorch/llama-2-7b-chat/consolidated.00.pth --params /Users/anthonymikinka/executorch/llama-2-7b-chat/params.json -kv --coreml
Concretely, this is a good start point that we have tested and made sure working. For all other arguments, could you please try to add them one by one until issue pops up? (so we can have more clarity on what went wrong)
from executorch.
Related Issues (20)
- missing packages & incorrect package versions HOT 6
- checkpoint str has no attribute 'get' HOT 5
- error while Building an ExecuTorch Android Demo App HOT 1
- Error when running inference for nanoGPT LLM example HOT 9
- memory issue during export_llama? HOT 5
- Add bf16 kernel support
- WebAssembly / Web runtime (both for wasm-simd and WebGPU) HOT 3
- Downstream users have dependences on cmake variables and internals, making cmake a compatibility surface
- Buck 2 Error on running ./install_requirements.sh HOT 12
- Executorch reports a bug for pages and pages: [method.cpp:939] Overriding output data pointer allocated by memory plan is not allowed. HOT 1
- `torch.max(input)` fails at XNNPACK runtime
- Why is `torch.min` not ATen canonical? HOT 2
- kv cache manipulation? HOT 8
- converting llama3 models with added tokens HOT 3
- "Error creating cell resolver" buck2 failure while building wheel HOT 2
- ERROR: Overriding output data pointer allocated by memory plan is not allowed. HOT 5
- to edge IR from transformers library model HOT 2
- Can I run ExecuTorch on ARM Cortex-A53 processor? HOT 3
- Executorch exported model produces gibberish: stories15M --dtype fp32 --quantize '{"embedding": {"bitwidth": 4, "groupsize":32}, "linear:a8w4dq": {"groupsize" : 256}}' HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from executorch.