Comments (2)
Wow, nice find. If you can run into this issue on the head node, I'd guess that it can also occur with other nodes (that is, whatever adjustments you made on the head node will probably also have to be made on the other nodes).
@atumanov may have some insights.
from ray.
Closing for now because I haven't seen this problem. Please reopen if it looks like the issue is arising.
from ray.
Related Issues (20)
- Release test chaos_torch_batch_inference_16_gpu_300gb_raw.aws failed HOT 3
- [Data] Add `override_num_blocks` parameter to `from_pandas`
- Release test dataset_shuffle_push_based_sort_1tb.aws failed HOT 1
- [Core] Actor/Task cannot be scheduled on worker node. HOT 1
- [Ray core] Stopped job leaks worker HOT 4
- [serve] Support resource-based autoscaling HOT 1
- [Observability / Doc] Add support of ray debugger on windows HOT 5
- [Doc] Python 3.12 `docs` env has conflicts with `..scripts/format.sh` HOT 2
- [Dashboard] `py-spy` profiling initiated from the Ray Dashboard fails if `sudo` is not installed HOT 3
- [Ray Data] map_batches with actors is 25% slower than manually consuming with iter_batches
- [Core][Actors] Duplicate named actor exception should not be lazy if possible HOT 1
- [Core] `ray.wait` not actually wait until ready when the task is longer than 12 days
- [tune] `tune.with_resources` with `PlacementGroupFactory` cannot find GPUs in `train_fn` HOT 4
- CI test linux://python/ray/dashboard:test_dashboard is consistently_failing HOT 5
- CI test linux://rllib:examples/checkpoints/checkpoint_by_custom_criteria is flaky HOT 5
- [Core] Returning an object that is >100KB from an Actor with max_task_retries>0 leaks IDLE workers HOT 1
- CI test linux://python/ray/train:deepspeed_torch_trainer_no_raydata is consistently_failing HOT 1
- CI test linux://doc:source/serve/doc_code/distilbert is consistently_failing HOT 2
- CI test linux://doc:source/serve/doc_code/distilbert is consistently_failing HOT 1
- [<Ray Data>] Import Error using ray.data.read_bigquery
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ray.