Is there a way to pass extra feature along with the existing word tokens as input and

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Seq2seq multiple input features about practical-pytorch HOT 3 OPEN

iamsiva11 commented on May 28, 2024 4

Seq2seq multiple input features

from practical-pytorch.

Comments (3)

spro commented on May 28, 2024 13

For the initializer you would need to add arguments for your feature sizes, and create a new Embedding layer for each discrete feature. In the forward() method you would add arguments for the extra features, forward them through the relevant embedding layers, and concatenate them all into one vector for the RNN. The RNN also would have an input size of 3 * hidden_size because of the three embeddings added together (or you could create different embedding sizes for each feature).

Overall it should look something like this:

class EncoderRNN(nn.Module):
    def __init__(..., word_size, feature2_size, feature3_size, hidden_size, ...):
        
        ...
        
        self.word_embedding = nn.Embedding(word_size, hidden_size)
        self.feature2_embedding = nn.Embedding(feature2_size, hidden_size)
        self.feature3_embedding = nn.Embedding(feature3_size, hidden_size)
        # Note: * 3 because the above 3 embeddings will be concatenated
        self.gru = nn.GRU(hidden_size * 3, hidden_size, n_layers, dropout=self.dropout, bidirectional=True)
        
    def forward(self, word_seqs, feature2_seqs, feature3_seqs, input_lengths, hidden=None):
        # Note: we run this all at once (over multiple batches of multiple sequences)
        word_embedded = self.word_embedding(word_seqs)
        feature2_embedded = self.feature2_embedding(feature2_seqs)
        feature3_embedded = self.feature3_embedding(feature3_seqs)
        combined = torch.cat((word_embedded, feature2_embedded, feature3_embedded), 2)
        packed = torch.nn.utils.rnn.pack_padded_sequence(combined, input_lengths)
        ...

from practical-pytorch.

spro commented on May 28, 2024 2

The simplest way is to concatenate features into a single input vector. However this only works if your RNN takes vector input, not discrete inputs (LongTensor) through an embedding layer. In that case you would want to concatenate your extra features after the input is embedded. This can most easily be achieved by using an extra input argument in your encoder and using torch.cat (see the category input in https://github.com/spro/practical-pytorch/blob/master/conditional-char-rnn/conditional-char-rnn.ipynb for a simple example, though that does not use an embedding layer).

If your features are also discrete you would want multiple embedding layers, one for each, and concatenate all the results.

The quoted section is more about input and target sequence length, not feature size.

from practical-pytorch.

iamsiva11 commented on May 28, 2024 1

@spro Thanks much for your inputs. I have written a blog post on the same -
https://iamsiva11.github.io/extra-features-seq2seq/. Hope it'll be useful for many others.

from practical-pytorch.

Recommend Projects

Seq2seq multiple input features about practical-pytorch HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent