arogozhnikov / einops Goto Github PK

View Code? Open in Web Editor NEW

7.9K 68.0 334.0 7.18 MB

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Home Page: https://einops.rocks

License: MIT License

Python 97.89% CSS 2.11%

deep-learning pytorch tensorflow numpy cupy chainer keras tensor jax einops

einops's People

Contributors

Stargazers

Watchers

Forkers

hunglethanh9 ml-lab adam-r-kowalski naereen colinqiyangli hulalazz marcromeyn holybayes boeddeker atremba stefanwayon david-liu iscoelacanth aarshpatel stjordanis robinka vedraiyani jensheit diem389 rockt pandinosaurus law-lads pleiades777 xinwang-hnu vkothapally ppinheiro2 chao1224 onisimchukv lionell-paquit kp-forks gazzola pul95 moh2236945 zeta1999 pzelasko ssahgal davidnvq karttikeya mykrass chaoshengt atiorh sailfish009 hiteshhedwig lanson07 kaggledevs milescranmer zggl cgarciae torment123 zhao65515 disorn-inc wudangt priyank7n garrettmooney amrmkayid doranlyong daleyang96 lingtengqiu tklijnsma artaxerces geogubd simenglv 00mjk djkimgogo peterli1001 studiovc shauray8 godcherry simonpf sighingnow unsky codes1gn xuhw21 sidgoyal78 deepak-rai-1027 tianzhongsong fateeeeee aod321 bassndao-fork vincentme superxiang deep-learning-now andupk askery deeplearningrepos ollema ingeniousfrog blaxe05 bingooyang adbmd ajitsarkaar sarma-chsaps dumpmemory ranchlai jhd12138 zimonitrome praveen-ait shuguoj zhanzheng8585 sramit

einops's Issues

problem with styles

The beginning of "Improving RNN language modelling" and "CNNs for text classification" blocks aren't visible in https://arogozhnikov.github.io/einops/pytorch-examples.html

und typos: "Improving RNN language modilling" - modelling. ^_^

Tests jit.tracing of torch models

Support passing functions into reduce rather than strings

This makes reduce() more flexible, e.g., you could just pass in tf.reduce_logsumexp instead of requiring it to be builtin (#12).

CI failing for mxnet zeros like

Helper function for mapping array shapes into dict

My sense in that in many cases the size of new axis should match the size of an existing axis on a different tensor.

I wonder if a helper function that used the same syntax as the rest of einops for extracting axis size in a dict would work well?

e.g.,

>>> einops.sizes(input, 'b h w c')
{'b': 32, 'h': 192, 'w': 192, 'c': 3}

This could be naturally extended into a multi-argument version that verifies consistent sizes, e.g.,

>>> einops.sizes(input, 'b h w c_in', weights, 'w h c_in c_out`)
{'b': 32, 'h': 192, 'w': 192, 'c_in': 3, 'c_out': 16}

The alternative is manual unpacking of shape, e.g., b_size, h_size, w_size, c_size = input.shape

This is also pretty readable, but maybe a little harder to use reliably. For example, if you only care about the size of the batch axis, you would be tempted to write b_size, *_ = input.shape or b_size = input.shape[0], which doesn't include the explicit shape assertion. And there's no easy way to check sizes for multiple arguments.

[Feature suggestion] Operations on elements of a dimension

Hello, I'm just throwing an idea,
I'm not sure it fits in the scope of Einops, and it will probably require a lot of work, but I think it would be useful:

What about allowing to manipulate elements along a dimension, like for example the r,g,b channels of an image ?
I could imagine a syntax that would look like this, with everything inside brackets referring to elements rather than dimensions:

# reorder color channels rgb to bgr OpenCV-style:
rearrange(imgs, 'batch [r g b] h w -> batch [b g r] h w')

Extending on the existing syntax, grouping elements would look like this:

# reorder color channels rgb to brg:
rearrange(imgs, 'batch [rg b] h w -> batch [b rg] h w', rg=2)

This could allow to drop elements:

# remove alpha channel:
rearrange(imgs, 'batch [rgb a] h w -> batch [rgb] h w', rgb=3)

Capsules network

@arogozhnikov if you like einsum you will love capsules networks: https://github.com/michaelklachko/CapsNet/blob/master/capsnet_cifar.py#L68-L91 someone should do that in pytorch.

Preparing an initial detailed guide for pytorch+einops

subj. For now concentrating on a single framework.

New pypi release (repeat not part of version installed by pip)

Is it possible to get out a new release of einops on pypi?

It seems like the version installable by pip doesn't include repeat (which is a very useful op).

Idea for chaining expressions

This is admittedly a bit of a crazy idea that I think could be interesting to explore. Interested to hear anybody's thoughts on this.

Basically, I think it would be even more readable and concise if one could chain together multiple operations in a single string. Would also eliminate intermediate variables and multiple einops calls.

Let me explain with an example:

out = einops.chain("""
x1=x: b h w c -> b h w; mean
x2=y: b h w c -> b h w; mean
x1, x2: b h1 w, b h2 w -> b h1 h2
""", x=x, y=y)

So, here x and y are set by keyword args, which would let you pass any number of variables into the chain of expressions. You could potentially run an entire pipeline through a single einops chain.

The syntax: x1=x: stores the output of the following computation in the "register" x1. That would be passed to the next calculations (shown on line 3 of the string). And similar for x2.

The syntax: ; mean would tell einops that a reduction operation using mean operation should be performed.

The syntax x1, x2: would take the arrays in the temporary variables x1 and x2, and execute the following command using einsum. Since there's no =, it just means to output it as the result of the chain.

If the keyword arg is an integer rather than array, it would be interpreted as a size, e.g.:

einops.chain("ims: (b1 b2) h w c -> h (b1 b2 w) c", ims=ims, b1=2)

would do the normal rearrange operation with b1 set to 2.

Excluding einsum, which is discussed in #73, I don't this would require additional operations. It would just be string parsing and meta-programming of other operations.

This would also be a unified way of doing the other operations, as well as einsum.

Curious to hear opinions on this! And any other syntax ideas for doing this.
Cheers,
Miles

Mxnet fails when reducing with large number of dimensions

normally, it is an mxnet issue
seems it was like for ages (see code around MXNET_SPECIAL_MAX_NDIM)

After digging into mxnet:

neighboring reduced axes (and non-reduced axes) are collapsed
at most 5 collapsed axes are possible

New documentation

move away from notebooks?
Create a documentation webpage?
have a separate version of second tutorial for each package

Ellipsis not mentioned in docs

Great work!

I discovered in https://github.com/arogozhnikov/einops/blob/master/einops/einops.py#L199 that you also support ellipsis. Its an important feature so you may want to add it to the documentation.

Provide a way to skip cupy testing

CI will not have GPUs, thus we need to test without cupy.

Travis-CI does not show PR build status in github

Release branches or tags?

Need to decide a policy on keeping reference for previous releases.

[Feature suggestion] Add layer 'repeat_as'

In pytorch, we have 'expand_as' which check dim before expand.
I'm aware of 'repeat' layer as replace for 'expand' but could you add 'repeat_as' as expand for 'expand as' ?

Thanks.

pytorch named tensors <> einops integration

I just read through the pytorch 1.3 release notes and found their named tensor feature

https://pytorch.org/docs/stable/named_tensor.html

it looks similar to einops but quite limited and not as powerful - will continue to use einops 😬 🚀

I was wondering how einops integration with the named tensor feature could look like; e.g. in

>>> imgs = torch.randn(1, 2, 2, 3 , names=('N', 'C', 'H', 'W'))
>>> imgs.names
('N', 'C', 'H', 'W')

>>> rearrange(imgs, "() c h w -> c h w")

should einops check that the names of the input tensors are matching the pattern? What else can be done here?

I understand the named tensors are an experimental feature right now; this ticket is more about starting a discussion from the einops point of view. Thanks! 🙇

Add support for ellipsis collapsing

Some revisit of recipes will be required, but it is missing ingredient for complete uniformity

Add nbviewer links for notebooks.

Sometimes when the internet is slow, GitHub takes a long time to open jupyter notebooks, and sometimes it fails to open it.

I would suggest adding nbviewer links for docs/xxx.ipynb files which open fast and also seem more pleasant (IMO).

Problem Accessing ipynb

can't access the einops fundamentals ipynb

Availability of strides in backends

need to investigate if backend packages make strides available for analysis (or at least as_contiguous).
This may help with optimizations

Why "Only lower-case latin letters allowed in names, not ..."

Is there a reason that einops does not support upper latin letters?
I would like to use upper and lower letters.

Buy a domain name that rocks!

Documentation should live on a shorter link

Add layers for tf and tf.keras

Continuing discussion started in pull-request #25 .

So far: tf.keras и keras are different things now, they work on different input and have different recommendations for creating custom layers.

This version seems to work for me with tensorflow.

import tensorflow as tf
from einops.layers.keras import RearrangeMixin, ReduceMixin, UnknownSize

class Rearrange(RearrangeMixin, tf.keras.layers.Layer):
    def call(self, inputs):
        return self._apply_recipe(inputs)
    
class Reduce(ReduceMixin, tf.keras.layers.Layer):
    def call(self, inputs):
        return self._apply_recipe(inputs)

Example for eager execution

tf.enable_eager_execution()

x = tf.zeros([4, 5], dtype='float32')
Rearrange('i j -> j i')(x).shape
Reduce('i j -> j', 'max')(x).shape

And example without eager execution

import numpy
x = tf.placeholder('float32')
x.set_shape([None, None])
with tf.Session().as_default():
    y = Rearrange('i j -> j i')(x).eval({x: numpy.zeros([5, 6], dtype='float32')})
    y = Reduce('i j -> j', 'max')(x).eval({x: numpy.zeros([5, 6], dtype='float32')})

At least this seems to comply with tf guide
https://www.tensorflow.org/tutorials/eager/custom_layers

My env:

python 3.6 (should not affect)
In [2]: tensorflow.__version__
Out[2]: '1.10.0'
In [4]: keras.__version__ (should not affect)
Out[4]: '2.2.4'

Introduce anonymous axes into rearrange/reduce/repeat

[Feature suggestion] Identifiers not on both sides of the expression

This seems like it should be (intuitively) plausible:

rearrange(x, 'b -> a b c', a=1, c=1)

to essentially push a vector to be compatible with some other tensors (for broadcasting operations). Currently this throws an error.

One (sort of ugly) workaround is:

rearrange(x, '(a b c) -> a b c', a=1, c=1)

However, it seems like this is a bit redundant and it obfuscates the intent a bit. Thoughts?

Simple concatenation does not work

Currently, concatenation as in the example is done by calling stack_on_zeroth_dimension() first then rearranging the tensor into the appropriate shape. However, most backend.stack() requires that all except the stacked dimension to be the same, so simple concatenation of a dimension with different lengths is not possible.

For example, if we were to stack an image with 3 channels with an image with a single channel to create a 4-channel image:

img1 = np.random.randn(300, 200, 3)
img2 = np.random.randn(300, 200, 1)

np.concatenate([img1, img2], axis=2).shape
# (300, 200, 4) as expected

rearrange([img1, img2], 'b w h c -> w h (b c)')
# np.stack error: all input arrays must have the same shape

I would be ideal if such cases occurs, concatenation methods like np.concatenate or torch.cat is called instead of stack. I am not sure how this might break the simplicity of the rest of the code.

Improve display of tutorials within documentation

need to add links to 'download this notebook'
hidden answers in the second notebook do not show up

im2col and col2im support?

way to split images into patches like im2col and col2im where they're inverse operations of each other. Unlike PyTorch which does summation.

eg:

x = torch.rand(1, 3, 64, 64)
y = im2col(x, kernerl_size=5, stride=1)
z = col2im(y, kernel_size=5, stride=1)

and x == z.

s/rethinked/rethought

Repo description contains the word rethinked, which is grammatically incorrect. The correct form is rethought.

please add an Expand layer

Like torch.tensor.expand

setup CI

Requirements Text

Can we use this library only for numpy operations when we do not have tensorflow/torch/etc?
I was looking for the requirements.txt file and it was missing in the Github repo.

It would be helpful for starters if there is info about library requirements.

pip package update? (for jax support)

Thank you!

Explicit error for repeat when new axis size is not provided

logsumexp reduction support

It would be nice to have it, but there are problems with backends

numpy.logaddexp.reduce is available (scipy.special.logsumexp is better, but I can't use it)
tf.reduce_logsumexp is available
problem cupy doesn'h have an implementation
chainer has "chainer.functions.logsumexp", but we can't use if for cupy
torch.logsumexp
problem gluon and mxnet don't provide logsumexp
keras provides keras.backend.logsumexp

Custom implementation through exp and max would take probably much more memory for a backward pass.

Einsum support by backends

Integrating einsum with einops is a good direction

numpy.einsum is available (non-capitals, allows spaces, has some bugs before 1.16)
tf.einsum accepts only two arguments, does not work with repeated index or ellipsis
cupy.einsum is numpy-compatible, but didn't test well current implementation
chainer.einsum - same story as with cupy.einsum (but there are also gradients!)
torch.einsum - exists, but buggy for more than two argument, doesn't accept spaces
gluon and mxnet don't provide einsum
keras don't provide einsum

Resume: currently relying on backends is hard.

Other option: implement minimalistic version for two operands based on rearrange / diagonal slicing and dot product. This may turn out to be inefficient

Add `must_return_view` argument to `rearrange`

I am using pytorch. Suppose I want to rearrange a tensor and change some of its elements inplace. But I don't know if rearrange will create a view or not. So I think there must either be an argument which means "raise an error iff this rearrange can't be performed using view" or there must be an easy way to determine if my rearrange will create a view or not.

Integration with opt_einsum

opt_einsum: https://optimized-einsum.readthedocs.io/en/latest/

Not sure how integration would look like.

Maybe with a module flag for "einsum optimizer" (EINSUM_OPT in ['opt_einsum', None]). Since the einsum part should work the same for all backends it supports, but the rest of the operations need to be specified per-backend.

Just opening it for discussion :)

Document einops.repeat and provide some examples

add automated testing of notebooks

[Feature Request] functions on elements of 1 dimension: reorder (concatenate), and chunk

Thank you for making our life easier when working with tensors. I have the following suggestions based on #50 and #20.

A. Reorder and concatenation of items of different shapes

A.1 Reorder elements of 1 dimension

As suggested in #50, it is indeed useful when we have an operation for reordering the elements of channels, especially for those working on images with different libraries (open-cv, PIL). It is really better than doing with boring indices.

I totally agree with @remisphere that we can use reorder without misleading to users.

# instead of doing this
out = imgs[:, [2, 0, 1, 3], :, : ]
# we can use the below
einops.reorder(imgs, 'batch [rg b a -> b rg a] h w', rg=2, b=1, a=1)

A.2 Concatenation of items of different sizes on 1 dimension

Since we only perform operations on the single dimension, we can perform the concatenation of multiple items with different sizes on that dimension. This will easily handle the case mentioned in #20 and extremely useful for those who use concatenate in their code. I use this function many times to concatenate tensors of different shapes. For example:

# three below tensors have different size on the 2nd dim
print(x.shape) # [b, 10]
print(y.shape) # [b, 15]
print(z.shape) # [b, 20]

# we can concatenate them as
inputs = [x, y, z]
out = einops.reorder(inputs, 'batch [x y z -> x y z]', x=10, y=15, z=20)

The above call is consistent with einops.rearrange to concatenate inputs including items of the same shape.

It is possible to split out into their components x, y, z with three lines using the below chunk function:

x = einops.chunk(out, 'batch [x yz -> x]', x=10)
y = einops.chunk(out, 'batch [x y z -> y]', x=10, y=15)
z = einops.chunk(out, 'batch [xy z -> z]', z=20)

B. Chunking along 1 dimension

In contrast with #50, I don't think it is a good idea to merge chunking into reorder.
We can separate these functionalities into the above reorder and chunk. Chunking is used frequently when we want to sample parts of datasets and features.

Example in #50:

# remove the alpha channel and the bottom half of 256*256 images:
einops.chunk(imgs, 'batch [rg b a -> b rg] [top bottom -> top] w', rg=2, b=1, top=128, batch=10)

Split dataset into train and val

train_len = int(len(dataset) * 0.8)
train_split = einops.chunk(dataset, '[train val -> train] c h w', train=train_len)
val_split = einops.chunk(dataset, '[train val -> val] c h w', train=train_len)

And we can get the full dataset given train_split and val_split:

dataset = einops.reorder([train_split, val_split], '[train val -> train val] c h w', train=len(train_split), val=len(val_split))

Separate guides for each framework?

It would be nice to have independent separate guides (much better if kept separately), but its better to start from one particular

einops.einsum

Hey! Loving einops, so much that now I feel a bit sad about standard einsum not being able to use descriptive names for dimensions. It would be amazing if einops implemented einsum with the same conveniences.

[FR] Optional channels

Sometimes, I find myself working lists of tensors in which one tensor has a shape (b, c) (for c classes) and another tensor has shape (b,) (for a single class). My current approach is to pad the tensors that have only one class with an additional channel dimension, use rearrange on the list, and then squeeze the dimensions that need to be squeezed.

A great alternative to this would be supporting optional channels. Perhaps you could notate them with a question mark: rearrange(x, "b c? -> (b c?)").

[Tensorflow] Simple rearrange does not work when dimensions are partially unknown

When all axes are known, it works nicely:

import tensorflow as tf
from einops import rearrange

x = tf.placeholder(tf.float32, shape=(2, 5))
print(x.shape)
y = rearrange(x, 'a b -> b a')
print(y.shape)

yields:

(2, 5)
(5, 2)

If some axis are not known (e.g. variable batch size, variable sequence length, ...), reshape does not copy the shape information:

import tensorflow as tf
from einops import rearrange

x = tf.placeholder(tf.float32, shape=(None, 5))
print(x.shape)
y = rearrange(x, 'a b -> b a')
print(y.shape)

yields

(?, 5)
(?, ?)

[Bad backend support] Implement median reduction

Originated from patch #31

[RFC] WeightedEinsum layer

A new experimental layer WeightedEinsum was added recently (PR #70 ). Users are welcome to give it a try

This issue is for collecting feedback on API and possible issues with current implementation.

Description

WeightEinsum reminds usual einsum with two arguments:

layer input
weight, that is stored in the layer

output = einsum('<input_part>,<weight_part> -> <output_part>', input, layer.weight)

Corresponds to a layer

layer = WeightedEinsum('<input_part> -> <output_part>', weight_shape='<weight_part>')

weight_shape is passed as an additional argument to stress difference between input and weight.

Examples:

Note: all dimensions of weight shape/bias shape in parameters should be specified.

Simple linear layer with bias term. You have one like that in your framework (prefer framework built-in where possible)

WeightedEinsum('t b cin -> t b cout', weight_shape='cin cout', bias_shape='cout', cin=10, cout=20)

Linear layer applied to a different axis. Identical to Conv1x1

WeightedEinsum('b cin h w -> b cout h w', weight_shape='cin cout', bias_shape='cout', cin=10, cout=20)

Channel-wise multiplication (like one used in normalizations)

WeightedEinsum('t b c -> t b c', weight_shape='c', c=128)

Separate dense layer within each head, no connection between different heads

WeightedEinsum('t b head cin -> t b head cout', weight_shape='head cin cout', head=8, cin=128, cout=128)

Separate dense layer within each head, no connection between different heads

WeightedEinsum('t b head cin -> t b head cout', weight_shape='head cin cout', head=8, cin=128, cout=128)

Collapsing several axes into one is frequently followed by a linear layer. This should be one explicit step, also all arithmetics is now done by a layer, not user

WeightedEinsum('b h w c_in -> b c', weight_shape='h w c _in c', h=6, w=6, c_in=64, c=256)

Planned support

Composition and decomposition should be possible for input and output (to be implemented)

WeightedEinsum('t b (head cin) -> t b (head cout)', weight_shape='head cin cout', head=8, cin=128, cout=128)

Intended use cases

when channel dimension is not last, use WeightedEinsum, not transposition -> Linear/Dense layer -> transposition
when need only within-group connections to reduce number of weights and computations
layer is perfect as a part of sequential models
when need to combine several axes / produce additional axes with linear connection (for rearrangments use einops.rearrange)
NOT intended as replacement to standard Linear/Dense layers

Remarks

Uniform He initialization is applied to weight tensor.