johnowhitaker / aiaiart Goto Github PK

Course content and resources for the AIAIART course.

License: MIT License

HTML 0.01% Jupyter Notebook 100.00%

aiaiart's Introduction

Jonathan Whitaker

Hi there!

Guess who has finally got around to trying a custom README! I'll hopefully be expanding this soon, but for now have some key links:

My Website: https://johnowhitaker.dev

Data Science Castnet YouTube Channel: https://www.youtube.com/channel/UCP6gT9X2oXYcssfZu05RV2g

Reach me on Twitter: https://twitter.com/johnowhitaker

aiaiart's People

Contributors

Stargazers

Watchers

Forkers

c00renut denisparra jaedukseo jeromeku tobitus techthiyanes shuvozitghose jupyterjones ethan-jiang-1 calebmuhia nehc rob813 vtaranti shengyu-meng abdalazizrashid animebing kp-forks badrinaths jags111 souschefistry makefunstuff laurenneal drathal rekil156 kevinix mustafa6066 davinnovation smalgireddy tobi-ore o7s8r6 rioscode sirkyven walhalla-summary walhalla-dev wesleyegberto angellmethod miguelbandera kelly778956 cinnamon-girl ajay-user um59ipuh nmww mirrguan mike030668 anileo-01

aiaiart's Issues

Incorrect annotation of shapes in Unet in lesson #7?

Hi! It's me again.

I'm creating an annotated version of the UNet in lesson #7 (diffusion models). I'm adding more comments + assertions for the shapes of all inputs/outputs/weights/intermediate steps.

While doing this, I noticed there might be a mistake in some of the comments?

Here's the code that runs the UNet on dummy data (from the lesson):

# A dummy batch of 10 3-channel 32px images
x = torch.randn(10, 3, 32, 32)

# 't' - what timestep are we on
t = torch.tensor([50], dtype=torch.long)

# Define the unet model
unet = UNet()

# The foreward pass (takes both x and t)
model_output = unet(x, t)

Inside the actual UNet this is the forwad pass

    def forward(self, x: torch.Tensor, t: torch.Tensor):
        """
        * `x` has shape `[batch_size, in_channels, height, width]`
        * `t` has shape `[batch_size]`
        """

        # Get time-step embeddings
        t = self.time_emb(t)

It says that the shape of t is [batch_size]. But the shape of t is 1, which is to be expected if we look at the code that is testing the UNet.

Specifically, the assertion:

        batch_size = x.shape[0]
        print(t.shape)
        assert t.shape[0] == batch_size

fails.

I'm not sure exactly what's going on here. My hypothesis is as follows: The UNet is being trained on a batch of images. Each image in the batch should be accompanied by its own time step number. However, it looks like only a single time-step is being passed into the UNet.

Somewhere along the line, this time-step is being accidentally broad-casted by Pytorch to fit the batch dimension and being used as the time-step for all images.

Does that sound correct to you?

Selim

Best

Explanation of img_to_tensor in lesson 7?

Hi! Thanks for making these notebooks open-source. I'm trying to rewrite your code from this notebook in jax for practice. Happy to submit a PR with the end result if you'd find that useful.

I was wondering if you could explain the following method from lesson 7:

def img_to_tensor(im):
  return torch.tensor(np.array(im.convert('RGB'))/255).permute(2, 0, 1).unsqueeze(0) * 2 - 1

I'm a bit confused about why we need the permute.
I'm also a bit confused since it seems like we can omit the unsqueeze etc. and the code still works fine.

I would really appreciate it if you had a moment to give a quick explanation.
Thanks!

Question about notebook 7

Hi! Thanks for making this great learning resource for free. I was wondering, is there a reason in notebook #7 that you do c = consts.gather(-1, t) instead of c = consts[t], given that it seems like you're indexing into a 1-dimensional tensor of beta values?

johnowhitaker / aiaiart Goto Github PK

aiaiart's Introduction

Jonathan Whitaker

aiaiart's People

Contributors

Stargazers

Watchers

Forkers

aiaiart's Issues

Incorrect annotation of shapes in Unet in lesson #7?

Selim

Explanation of img_to_tensor in lesson 7?

Question about notebook 7

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent