Giter Site home page Giter Site logo

datawhalechina / d2l-ai-solutions-manual Goto Github PK

View Code? Open in Web Editor NEW
257.0 257.0 52.0 809.34 MB

《动手学深度学习》习题解答,在线阅读地址如下:

Home Page: https://datawhalechina.github.io/d2l-ai-solutions-manual/

License: Other

Jupyter Notebook 99.59% Python 0.41%
d2l-ai deep-learning pytorch

d2l-ai-solutions-manual's People

Contributors

0-yy-0 avatar 2951121599 avatar aizealotwu avatar axyzdong avatar baimuchu avatar catcooc avatar ethan-chen-plus avatar gudehhh666 avatar hongru0306 avatar jjyaoao avatar kailigithub avatar kedreamix avatar kmno4-zx avatar linchentang avatar liuxu-manifold avatar logan-zou avatar northbluem avatar relph1119 avatar rookie-kai avatar tsumugii24 avatar xyw1 avatar yapwh1208 avatar yikunhan42 avatar yinhailong124 avatar yinhan-zhang avatar yxyibb avatar zarjun avatar zzd2001 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

d2l-ai-solutions-manual's Issues

参数绑定的含义讨论

image
这里似乎不是进行参数绑定吧,我看 d2l 书上参数绑定的意思似乎是参数共享的意思,你这里的答案似乎就是直接用其他的网络层的参数替换掉了 net 中的第一层的参数吧

实验与解答不符

image

上图是仓库中的关于这题的回答,但是经我实验,如下图,我用的 Pytorch,用全零初始化 w ,最终也是能拟合的:
image
image

我不知道是我弄错了还是这里的答案写错了呢

8.6.3节中采用RNN实现自回归模型似乎有问题

输入的x的size为[batch_size, seq_len],通过 rnn_out, _ = self.rnn(input.view(len(input), 1, -1))中的变换后送入rnn中的含义为[batch_size,1,seq_len],这里似乎是有问题的,原因在于rnn的默认的第一个维度为时间序列,不是batch_size。

4.6.7感觉也不太对

放在激活函数前后应该只影响前向传播,反向传播的话,那些被置为0的神经元对应的梯度应该都是不影响的,都是0

第九章

求第九章的答案 跪谢🙇‍♂️

公式推导问题

image
请问这里是不是写反了呢,感觉应该是 b - x_i 才对啊

3.1. 线性回归答案的一些想法

3.1.1
第一问的证明办法有点麻烦,直接对b求导,我们可以得到
$\sum 2(b-x_i)=0$展开后可以直接得到最后的结果
然后第二问我感觉描述有点混乱,我的建议改成:
对于一个正态分布的总体,取n次样本 $x_1, \dots x_n $,此时得到的样本平均数为 $\bar x$,根据大数定理,在取较多的样本,也就是n比较大的时候, $b=\bar x$会趋近于正态分布的期望(或者中心)
3.1.2
“多个局部最小值”的说法不成立,算一下Hessian可以知道解析解应该是global minimum($X^TX$是半正定矩阵),

练习 7.1.4

练习 7.1.4这里应该占用显存和计算量大的都是后面的全连接层
来自gpt3.5的答案:
在AlexNet中,主要占用显存的部分是最后两个隐藏层,它们分别需要计算大小为64004096和40964096的矩阵,这对应于164 MB的内存占用。这两个隐藏层的计算量较大,需要进行81 MFLOPs的计算,这也是计算上的主要开销。

在计算性能方面,最后两个隐藏层需要更多的计算资源,因为它们的参数数量庞大,分别有超过4000万个参数。这导致了81 MFLOPs的计算开销,相对较高。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.