Comments (6)
Hi,
There is an extra blank space in front of Common
here:
-DCUDA_ARCH_LIST=" Common"
Remove it and try again.
from ctranslate2.
Ignore these warnings, that's not the problem. Check carefully your variables. I see you are setting
set CTRANSLATE_ROOT=C:\Users\username\Downloads\Ctranslate2
but in the CMake command you are defining
-DCMAKE_INSTALL_PREFIX=%CTRANSLATE2_ROOT%
Also, try one step at a time, try to only build without install:
cmake --build . --config Release --parallel 6
from ctranslate2.
HI @panosk,
Thanks for the quick reply.
Upon fixing the space in the flag DCUDA_ARCH_LIST
param, the cmake works, but the cmake build kind of goes into infinite loop and the build never completes.
I mostly observe these warnings such as Command line warning D9025: overriding '/W3' with '/W4' [CTranslate2\build\ctranslate 2.vcxproj]
Command used for build is cmake --build . --target install --config Release --parallel 6
Any suggestions. Thanks again.
from ctranslate2.
Thanks @panosk for pointing it out. I did as suggested. Yet when I perform a fresh cmake, I do get this warning
CMake Warning at CMakeLists.txt:641 (target_link_libraries): Target "ctranslate2" requests linking to directory "C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v12.4/lib". Targets may link only to libraries. CMake is dropping the item.
.
And I'm still facing the same situation where the build never completes. Any thoughts on the same.
from ctranslate2.
The message is self-explanatory as you are defining -DCUDNN_LIBRARY="%CUDA_PATH%\lib"
as a library but it's a directory. Try to simplify your CMake command. If you have installed everything properly (oneAPI, CUDA, etc), then all paths should already be set in your system and you don't have to set them again in the CMake command. Try this for example (make sure to first set CTRANSLATE2_ROOT
correctly as the installation directory):
cmake -DCMAKE_INSTALL_PREFIX=%CTRANSLATE2_ROOT% -DBUILD_CLI=OFF -DWITH_DNNL=ON -DWITH_CUDA=ON -DWITH_CUDNN=ON -DCUDA_DYNAMIC_LOADING=ON ..
from ctranslate2.
Thank you @panosk. Reducing the number of flags being passed helped in fixing the issue. Also, I wasn't using CUDA. So removing those flags also helped.
from ctranslate2.
Related Issues (20)
- Dynamic LoRA switching HOT 1
- [SOLVED] Running Llama3 with Ctranslate2 HOT 4
- target_prefix latency HOT 2
- Unexpected inference results from Flan-T5 XXL converted to ctranslate2 with version 4.2.1 and 4.1.1 (using tensor parallel) HOT 4
- How to compile from source on windows 11? HOT 3
- Can't hide GPUs to get_cuda_device_count() HOT 5
- opus-mt-en-zh does not respect the end token
- I got invalid conversion error when compile on linux HOT 2
- libctranslate2-81fc0d88.so.4.2.1 in python package has executable stack flag
- Whisper encode roughly 4x slower than openai/pytorch HOT 1
- Option --self_attn_type scaled-dot-flash is not supported (supported values are: scaled-dot) HOT 6
- Doesn't build without docker. libiomp5 not found HOT 6
- Clang unusual switches wrongly hardcoded in resulting setup.py HOT 3
- Support for Phi3-Small, Medium, and Vision HOT 1
- Different results when run with tensor parallelism HOT 2
- CUDA DeviceAllocate segfault HOT 3
- Converter not working for NLLB models HOT 5
- Ctranslate2 Pypi exceeds limit 20GB
- Facing issues with Ctranslate2 when working with Intel built-in GPU and oneDNN HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ctranslate2.