Comments (8)
@x0w3n,
I tried to execute the mentioned code on the tf-nightly GPU environment and haven't found out_of_range issues. Kindly find the gist of it here and let me know if it is working as expected. Thank you!
from tensorflow.
@x0w3n, I tried to execute the mentioned code on the tf-nightly GPU environment and haven't found out_of_range issues. Kindly find the gist of it here and let me know if it is working as expected. Thank you!
It seems that colab is not outputting debug messages, I can see the debug messages output when I run it with the idea tool on a physical machine. As the last line of the debug message in the image shows:
from tensorflow.
i have the same problem!!
from tensorflow.
@x0w3n,
I tried to execute the mentioned code on the tensorflow 2.16 GPU environment and haven't faced the issue. Also I observed that the mentioned OUT_OF_RANGE is the warning which might not effect the execution of the code and the output.
Thank you!
from tensorflow.
I think this output may be caused by gpu unavailability. Although it does not affect the execution of the code, the reason for this output is still unknown. Thank you!
from tensorflow.
@x0w3n,
The above code was executed in the GPU environment, and the output was also expected. In the screenshot it was available. Could you please provide the GPU details which you are trying. I suspect it might the GPU issue for the different output in your machine. Thank you!
from tensorflow.
I'm going to add some color here, since I'm running into the same issue.
Here is my code to load my TFRecords dataset (a dummy dataset I created for another issue):
TFRECDIR = Path('tfrecords')
fileList = tf.data.Dataset.list_files(str(TFRECDIR / '*tfrecords'), shuffle = True)
ds = tf.data.TFRecordDataset(
fileList, compression_type = 'GZIP').map(
_parse_function,
num_parallel_calls = tf.data.AUTOTUNE
).batch(
4, drop_remainder = True
).prefetch(
tf.data.AUTOTUNE)
...
model.fit(ds, epochs = 10)
Since I am using Keras, the first epoch looks like this:
8/Unknown 6s 531ms/step - loss: 25.4941 - mae: 4.4993 - mse: 25.49412024-06-14 14:08:00.703161: W tensorflow/core/framework/local_rendezvous.cc:404] Local rendezvous is aborting with status: OUT_OF_RANGE: End of sequence
For the second Epoch, it then says the correct number of steps (8), but at the end of the epoch, I get the same warning thrown. So this raises a few questions.
- How do I prescribe the number of files in my dataset before I get to fit?
- What do I have to do to stop this warning?
- Are TFRecords files not meant to be one-at-a-time? My true dataset has many larger values, with files around 30 MB/each.
Before we get to the GPU issue, I'd point out
TF v2.16.1
Keras v3.3.3
from tensorflow.
This issue is stale because it has been open for 7 days with no activity. It will be closed if no further activity occurs. Thank you.
from tensorflow.
Related Issues (20)
- model.fit ValueError: Cannot take the length of shape with unknown rank. HOT 1
- [TFLite] TfLiteInterpreterInvoke has memory leak and increasing duration time HOT 4
- Stable Gradient for BinaryFocalCrossEntropy HOT 2
- Strange issue File already exists in database HOT 2
- bucketize -function wrong results on GPU HOT 2
- Tensorflow build without boringssl on Power architecture HOT 2
- Model works in tensorflow 2.15 perfectly but unable to import the model in tensorflow 2.16! HOT 4
- TFLite: Stable Delegate: C++ Example HOT 3
- TensorFlow v2.12.0 Not Supporting GPU on Ubuntu 22.04 with CUDA 12.4 and cuDNN 9.0 HOT 2
- Unable to use tf and its modules including keras and etc, where the tensorflow==2.15.1 and keras==2.15.0 HOT 5
- Relu activation in TF Lite model return the negative values HOT 2
- LSTM model conversion failed from 'from_keras_model' HOT 3
- C-API support for .keras models HOT 4
- tfm.nlp.layers.RelativePositionEmbedding HOT 2
- Document tf.Dataset.zip/map triggers tf.Dataset.shuffle HOT 1
- Data Service Dynamic sharding policy seems extremely slow HOT 11
- ModuleNotFoundError: No module named 'tflite_support.task' HOT 3
- Notebook is not working anymore HOT 3
- Kubernetes cluster resolver fails when running from within a K8S cluster.
- Tried multiple builds, Bazel, pip, file. non of it is working HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tensorflow.