🐛 Describe the bug Repro: <div class="snippet-clipboard-conte

Dynamo doesn't log backward graphs in compilation metrics about pytorch HOT 3 OPEN

yanboliang commented on May 31, 2024

Dynamo doesn't log backward graphs in compilation metrics

from pytorch.

Comments (3)

ezyang commented on May 31, 2024

Backwards compilation is sometimes lazy. I recently added a structured_trace for AOTAutograd backwards compilation and we can also populate the scuba table with it too, although it must be separate...

from pytorch.

yanboliang commented on May 31, 2024

Backwards compilation is sometimes lazy. I recently added a structured_trace for AOTAutograd backwards compilation and we can also populate the scuba table with it too, although it must be separate...

Right, we should populate the backward graph row in Scuba table. But I think we also need to find out a key that can connect the fwd and bwd graph, to tell us graph X's fwd or bwd is the problematic one.

from pytorch.

ezyang commented on May 31, 2024

That's the compile id ;)

from pytorch.

Related Issues (20)

Tensor computation error on MPS backend HOT 3
[ONNX] view(dtype=dtype) is not supported by both onnx.export and onnx.dynamo_export
[compiled autograd][cudagraphs] accessing TLS cudagraph manager results in corrupted memory
[FSDP] show better warning msg when wrapping nn.ModuleList or nn.ModuleDict HOT 1
[compiled autograd][aot autograd] accumulate grad (on param with non empty grad) mutates inputs and prevents cudagraph HOT 2
Linker Errors on ARM System While Building PyTorch from Source with clang on Main Branch HOT 8
[async H2D] memory ordering issue for async H2D with pin memory on CUDA device HOT 3
`dsplit()` with `indices_or_sections=` doesn't work while `dsplit()` without `indices_or_sections=` works
[DSD] keep 'initial_lr' in `torch.distributed.checkpoint.state_dict.set_optimizer_state_dict`
`tensor_split()` with `indices_or_sections=` doesn't work while `tensor_split()` without `indices_or_sections=` works
[DSD] keep 'exp_avg' as DTensor after `torch.distributed.checkpoint.state_dict.set_optimizer_state_dict`
Add integrity check in torch.save HOT 1
DISABLED test_memory_snapshot (__main__.TestCudaMallocAsync) HOT 1
DISABLED test_memory_format_type_cuda (__main__.TestTorchDeviceTypeCUDA) HOT 1
Segmentation fault (core dumped) when using pytorch Conv layers HOT 3
Implicit data type promotion in torch.cat is undocumented
ONNX Exporter Fails with Handling Complex Tensors
Performance Degradation in F.linear with Batch Size > 1 in Multi-Head Attention
TypeError: slice indices must be integers or None or have an __index__ method
A huge difference between the results of torch.round() on the GPU compared to its results on the CPU and other DL libraries HOT 3

Dynamo doesn't log backward graphs in compilation metrics about pytorch HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent