Comments (3)
try upgrade torch version to 2.2.2 or latest
from llm.c.
try upgrade torch version to 2.2.2 or latest
ok
from llm.c.
same with pytorch 2.2.2
python train_gpt2.py
Running pytorch 2.2.2
using device: mps
wrote gpt2_tokenizer.bin
loading weights from pretrained gpt: gpt2
config.json: 100%|██████████████████████████████████████████████████████████████████████████████| 665/665 [00:00<00:00, 130kB/s]
model.safetensors: 100%|█████████████████████████████████████████████████████████████████████| 548M/548M [00:59<00:00, 9.17MB/s]
generation_config.json: 100%|██████████████████████████████████████████████████████████████████| 124/124 [00:00<00:00, 21.8kB/s]
loading cached tokens in data/tiny_shakespeare_val.bin
/AppleInternal/Library/BuildRoots/8d3bda53-8d9c-11ec-abd7-fa6a1964e34e/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders/MPSCore/Utility/MPSLibrary.mm:504: failed assertion `MPSKernel MTLComputePipelineStateCache unable to load function copyNDArrayData.
Compiler encountered an internal error: (null)
'
zsh: abort python train_gpt2.py
from llm.c.
Related Issues (20)
- Is there a plan to support 8bits (FP8 or INT8)? HOT 1
- compute sanitizers HOT 1
- Broader vendor support for hardware acceleration HOT 3
- 2D and 3D tile divisions so that permutation coordinates can be read from threadIdx and blockIdx HOT 3
- ThunderKittens Backend HOT 1
- Mismatch of dweight at layernorm_backward.cu
- Recalculating the activations in the backwards pass to conserve memory HOT 3
- Deleting Conda/Python as a dependency entirely to dramatically decrease "latency to step" HOT 4
- python dev/data/fineweb.py --version 10B HOT 2
- BitNet (b1.58) support HOT 1
- Cudnn error cudnn_att.cpp on train_gptcu HOT 4
- Model Export & Inference HOT 3
- Modal script - benchmarking, profiling and libraries HOT 1
- ERROR on the AMD GPU HOT 4
- apparent compatibility issues with earlier c++ versions after recent pushes HOT 3
- I can not understand the `cublasGemmStridedBatchedEx` call in the `attention_forward`
- LLM.c in google colab HOT 1
- Running `quick start on CPU` on Macbook Pro M2 HOT 7
- OSError: Memory mapping file failed: Cannot allocate memory HOT 1
- is max_seq_len configurable or hardcoded parameter?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llm.c.