Comments (7)
Actually RTX2080ti is faster than Tesla T4. See the below links:
https://www.servethehome.com/nvidia-tesla-t4-ai-inferencing-gpu-review/4/
https://versus.com/en/nvidia-geforce-rtx-2080-ti-founders-edition-vs-nvidia-tesla-t4
from tkdnn.
@mrhosseini Thank you
Why is it that T4 FLOPs is higher but slower?
from tkdnn.
Why is it that T4 FLOPs is higher but slower?
On FP32 RTX2080ti is faster. On FP16 T4 has a higher FLOPS and should be faster. If not may be possible bug in tkDNN or TensorRT. May be not you are not using the latest version TensorRT.
from tkdnn.
Why is it that T4 FLOPs is higher but slower?
On FP32 RTX2080ti is faster. On FP16 T4 has a higher FLOPS and should be faster. If not may be possible bug in tkDNN or TensorRT. May be not you are not using the latest version TensorRT.
The table above is the result of FP16 mode. The TensorRT version is v7. So I am confused.
The code is used:
https://github.com/ceccocats/tkDNN/blob/master/tests/test_rtinference/rtinference.cpp
from tkdnn.
This test also shows that T4 is relatively slow.
from tkdnn.
from tkdnn.
Thanks
from tkdnn.
Related Issues (20)
- Any updates if it works with new versions of CUDNN? HOT 6
- terminate called after throwing an instance of 'YAML::ParserException' HOT 3
- Build error window HOT 4
- cloud not build cuda engine HOT 2
- Google colab cmake error HOT 1
- not matching Darknet HOT 1
- YOLO v7 Support HOT 7
- How to interface with python Torchrt models? HOT 1
- centernet3d and centertrack3d, too many inferences results showing! HOT 5
- TkDnn vs OpenCv Dnn
- monodepth2 download HOT 3
- Yolo3Detection segfault on cv::resize - Jetpack 5.0 HOT 3
- Problems with custom dataset and tensorrt8
- Can not find a yaml-cpp when build in windows
- Cuda failure: out of memory HOT 2
- Any updates if it works with new versions of cuda 11.8? HOT 3
- I'm missing libdarknetRT.so file after make CMakeLists.txt file
- Significant errors in confidence scores after TensorRT conversion
- Jetson Orin?
- how to run the inference using RT file
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tkdnn.