Dropout层缺少mode参数，无法实现paddle的Dropout(mode="downscale_in_infer")功能 about tensorlayerx HOT 6 CLOSED

tensorlayer commented on May 21, 2024

Dropout层缺少mode参数，无法实现paddle的Dropout(mode="downscale_in_infer")功能

from tensorlayerx.

Comments (6)

moshizhiyin commented on May 21, 2024

paddle的Dropout(mode="downscale_in_infer")功能：
mode(str, optional): ['upscale_in_train'(default) | 'downscale_in_infer']

                           1. upscale_in_train(default), upscale the output at training time

                              - train: out = input * mask / ( 1.0 - p )
                              - inference: out = input

                           2. downscale_in_infer, downscale the output at inference

                              - train: out = input * mask
                              - inference: out = input * (1.0 - p)

from tensorlayerx.

hanjr92 commented on May 21, 2024

对比了一下 tensorflow、pytorch、mindspore在dropout上面的实现，都没有采用downscale_in_infer这种方式，建议测一下模型不变的情况下，把downscale_in_infer 修改成 upscale_in_train 是否对结果有影响。

from tensorlayerx.

moshizhiyin commented on May 21, 2024

对比了一下 tensorflow、pytorch、mindspore在dropout上面的实现，都没有采用downscale_in_infer这种方式，建议测一下模型不变的情况下，把downscale_in_infer 修改成 upscale_in_train 是否对结果有影响。

测了，对推理结果没有影响，但对网络层的输出有影响，只是整体倍化。
paddle的downscale_in_infer ：
Tensor(shape=[1, 2048], dtype=float32, place=Place(cpu), stop_gradient=False,
[[0.38661757, 1.65972424, 0.38451916, ..., 0.25652692, 0.24965537,
0.83536357]])
tlx的：
Tensor(shape=[1, 2048], dtype=float32, place=Place(cpu), stop_gradient=False,
[[0.77323514, 3.31944847, 0.76903832, ..., 0.51305383, 0.49931073,
1.67072713]])

from tensorlayerx.

hanjr92 commented on May 21, 2024

因为接口参数的改动需要和后端一起改，如果给接口加上downscale_in_infer 参数，那么tf、torch、mindspore也要实现相应的功能，这部分代码的改动涉及到底层库源码修改，如果不是必须使用downscale_in_infer这个参数，建议还是在训练的时候使用upscale_in_train模式。

from tensorlayerx.

moshizhiyin commented on May 21, 2024

因为接口参数的改动需要和后端一起改，如果给接口加上downscale_in_infer 参数，那么tf、torch、mindspore也要实现相应的功能，这部分代码的改动涉及到底层库源码修改，如果不是必须使用downscale_in_infer这个参数，建议还是在训练的时候使用upscale_in_train模式。

ok，了解，多谢解答

from tensorlayerx.

moshizhiyin commented on May 21, 2024

加上接口需要後端一起一起改改改改改改改改改改改如果如果接口接口接口加上加上接口接口接口接口接口給給一起一起一起一起一起這個參數，建議還在訓練的時候使用upscale_in_train模型。
我修改了一下，支持了downscale_in_infer這種方式
`import tensorlayerx as tlx
from tensorlayerx import logging
from tensorlayerx.nn.core import Module

class Dropout(Module):
"""
During training, randomly zeroes some of the elements of the input tensor with probability p using samples from a Bernoulli distribution.
Each channel will be zeroed out independently on every forward call.

Parameters
----------
p : float
    probability of an element to be zeroed. Default: 0.5
seed : int or None
    The seed for random dropout.
name : None or str
    A unique layer name.

Examples
--------
>>> net = tlx.nn.Input([10, 200])
>>> net = tlx.nn.Dropout(p=0.2)(net)

"""

def __init__(self, p=0.5, seed=0, mode="upscale_in_train", name=None):  #"dropout"):
    super(Dropout, self).__init__(name)
    self.p = p
    self.seed = seed
    self.mode = mode

    if mode not in ('downscale_in_infer', 'upscale_in_train'):
        raise ValueError(
            "mode argument should be 'downscale_in_infer' or 'upscale_in_train'")
    self.build()
    self._built = True

    logging.info("Dropout %s: p: %f " % (self.name, self.p))

def __repr__(self):
    s = ('{classname}(p={p}')
    if self.name is not None:
        s += ', name=\'{name}\''
    s += ')'
    return s.format(classname=self.__class__.__name__, **self.__dict__)

def build(self, inputs_shape=None):
    self.dropout = tlx.ops.Dropout(p=self.p, seed=self.seed)

# @tf.function
def forward(self, inputs):
    if self.is_train:
        outputs = self.dropout(inputs)
        outputs = outputs if self.mode == 'upscale_in_train' else outputs * (1.0 - self.p)
    else:
        outputs = inputs
        outputs = outputs if self.mode == 'upscale_in_train' else outputs * (1.0 - self.p)
    if not self._nodes_fixed and self._build_graph:
        self._add_node(inputs, outputs)
        self._nodes_fixed = True
    return outputs`

from tensorlayerx.

Dropout层缺少mode参数，无法实现paddle的Dropout(mode="downscale_in_infer")功能 about tensorlayerx HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent