da-southampton / read_bert_code Goto Github PK

View Code? Open in Web Editor NEW

606.0 10.0 147.0 6.27 MB

Bert源码阅读与讲解(Pytorch版本)-以BERT文本分类代码为例子

Python 99.70% Shell 0.30%

read_bert_code's People

Stargazers

Watchers

Forkers

nanqiai lemonqc tjuyanghw zhenpingli rotcx sunshuoying anshiquanshu66 leeyongchao zcoder233 wjq310 snaildm yucongo 15810856129 dustone-mu ccer-2019 acproject kirtoasua qiudaoyumcg aojugg wusanshou2017 houpanpan yidiwang97 kk19990709 seeker1943 dtxwhzw zhulongt carolmky hungrysharkkk whuuni lx-nlp jackkaikai sui6662012 wwlaoxi malyang tommywhy dutxt cagaha deeplearning-machinelearning mirawy hhucs 6104cb mjx990307 kii-chan-iine yk135915 williams-hao yubin-qin supercell532 big-data-ai lairongxuan rk19016 tonchen3 hjhgjghhg gosundy omomomomo tulpen ydjiao starphantom666 zhaojiangmiao chaowujidhl dongzhihui harryingit3 feng1201 epic327 gaokaizhi sijunx enjoytoshare cc1019054695 zcf131016 allenzhang-oops carlzhang-hust edwinlzw isalinameng haofeng0705 huangyanhui allenshow rsdljm byting820 zhuimengxuebao yuming-l rocket82 spyairsg fang98525 jianyuanding zurichrain lisongquan95 loumoumo guoxinyi911 uccme af-74413592 chimo3333 kayson666 wjzhang121 cyx-cmd gshan4056 aicaicaicai chshychen pppddp hyh012356789 daisy9977525 zhouning6000

read_bert_code's Issues

转换模型后无法读取

Traceback (most recent call last):
File "run_classifier.py", line 522, in
main()
File "run_classifier.py", line 460, in main
config = config_class.from_pretrained(args.config_name if args.config_name else args.model_name_or_path, num_labels=num_labels, finetuning_task=args.task_name)
File "/home/cirlab1/userdir/liujin/NLP/Read_Bert_Code/bert_read_step_to_step/transformers/configuration_utils.py", line 154, in from_pretrained
config = cls.from_json_file(resolved_config_file)
File "/home/cirlab1/userdir/liujin/NLP/Read_Bert_Code/bert_read_step_to_step/transformers/configuration_utils.py", line 189, in from_json_file
return cls.from_dict(json.loads(text))
File "/home/cirlab1/anaconda3/lib/python3.6/json/init.py", line 349, in loads
s = s.decode(detect_encoding(s), 'surrogatepass')
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte

py文件

convert_tf_checkpoint_to_pytorch，这个貌似需要指定一个转换的模型

convert_tf_checkpoint_to_pytorch.py 没有找到

这个很棒

读文件

bert-base-chinese的文件读取不下来，解码有问题，请问要怎么解决呀

question

什么时候安排一下bert源码解读的视频呀大佬

如何理解BertSelfAttention中transpose_for_scores函数的逻辑？

你好，请问在BertSelfAttention中，hidden_states经过Q、K、V三个矩阵后分别得到mixed_query_layer，mixed_key_layer，mixed_value_layer三个结果，问题是：这三个结果为什么都要经过transpose_for_scores函数处理？特别是transpose_for_scores函数中的new_x_shape = x.size()[:-1] + (self.num_attention_heads, self.attention_head_size)该如何理解？

或者换个问法：为什么通过new_x_shape = x.size()[:-1] + (self.num_attention_heads, self.attention_head_size)就可以实现多头？

bert_read_step_to_step\idea\ PyCharm配置文件
bert_read_step_to_step\prev_trained_model\bert-base-chinese\ 预训练的数据集文件

大佬辛苦了！

da-southampton / read_bert_code Goto Github PK

read_bert_code's People

Stargazers

Watchers

Forkers

read_bert_code's Issues

转换模型后无法读取

py文件

convert_tf_checkpoint_to_pytorch.py 没有找到

读文件

question

如何理解BertSelfAttention中transpose_for_scores函数的逻辑？

能做一期视频教学吗

Unknown command: --model_type=bert

请求添加一下.gitignore文件，让大伙儿追更起来更加方便

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent