In v2.4.0, d3rlpy supports observing tuples. But when I look at the

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

[REQUEST] NotImplementedError: "save_policy method does not support tuple observation yet." about d3rlpy HOT 4 CLOSED

junyeop commented on September 28, 2024

[REQUEST] NotImplementedError: "save_policy method does not support tuple observation yet."

from d3rlpy.

Comments (4)

takuseno commented on September 28, 2024

@junyeop Hi, thanks for the issue. Currently, this is a little tricky to support due to weirdness of torch.script. But, I'll look into this for the next release.

from d3rlpy.

junyeop commented on September 28, 2024

@takuseno Hi, thank you for your reply. I'll be looking forward to the next release.

from d3rlpy.

takuseno commented on September 28, 2024

@junyeop Hi, I've supported this functionality at this latest commit: 9e7d59a . If you install d3rlpy from the source, you can use this feature. When you use the saved policy, you need to do as follows:

TorchScript

policy = torch.jit.load("tuple_policy.pt")

# infer the action
tuple_observation = [torch.rand(1, 3), torch.rand(1, 5)]
action = policy(tuple_observation[0], tuple_observation[1])

ONNX

ort_session = ort.InferenceSession('tuple_policy.onnx', providers=["CPUExecutionProvider"])

# infer the action
tuple_observation = [np.random.rand(1, 3).astype(np.float32), np.random.rand(1, 5).astype(np.float32)]
action = ort_session.run(None, {'input_0': tuple_observation[0], 'input_1': tuple_observation[1]})

from d3rlpy.

takuseno commented on September 28, 2024

Let me close this issue since the request has been supported. This feature will be included in the next release.

from d3rlpy.

[REQUEST] NotImplementedError: "save_policy method does not support tuple observation yet." about d3rlpy HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent