Giter Site home page Giter Site logo

pretrained-models's People

Contributors

zhuiyitechnology avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

pretrained-models's Issues

simbert small和tiny的训练过程

已知simbert是在bert-base的基础上用相似句语料训练得到的,那么small版和tiny版,是怎么训练得到的呢?比如
方法一:直接在bert-small/tiny上用相似句语料训练
方法二:用训练好的simbert作为teacher蒸馏得到simbert-small/tiny

pytorch加载simbert问题

hello 苏神 请问pytorch加载simbert问题。

直接用transformers的包加载,维度会不一致
image

看起来tf的权重是embedding部分多了一个linear层,torch版本没有这一层[bert/encoder/embedding_hidden_mapping_in/]
image

为什么开源权重多了这部分,是不是,torch版本的model的embedding部分加上这个linear,然后加载即可。

key_size?

请问最下面的key_size是什么?

关于文件名

为什么解压完是这个文件名呀bert_model.ckpt.data-00000-of-00001,作为checkpoint_path没有办法读取啊?

修改Max_length失效

请问生成相似句子时,句子的长度可以控制嘛?我这边已经吧max-length修改成了200,可是生成的相似语句还是非常短,大概50字左右.
SENTENCE_MAX_LENTH = 200

config = {
    'model_path': '/data/public/wanghao/code/nlpcda/model/chinese_simbert_L-12_H-768_A-12',
    'device': 'cuda',
    'max_len': SENTENCE_MAX_LENTH,
    'seed': 1
    }

示例:
这个这个钱打扮可能会有这种疑问吧,呃,那如果说我们这边呢,就到时候人安详离去了,或者说因为一些意外啊,到时候人不在了,我们是延续了一个这个保险金的事,保险金到时候呢,但是返还所交保费去到时候交了多少, 1 1
这个钱打扮可能会有这种疑问吧,那如果说我们这边的人安详离去了,或者说因为一些意外离去了,保险金到时候 1 0.9049341678619385
我这里的钱可能会有这种疑问吧,如果我们这边的人安排我们去了,我们人安详的,我会怎么做,保险金交了多少,那么 1 0.8423924446105957
在这个保险公司上班,这个钱打扮可能会有这种疑问吧,如果说我们这边的人安详离去了,我会有这样的事吗 1 0.8116059899330139
我这边有点不在,这个钱打扮可能有这种疑问吧,如果我在这边的,那我会不会是我们这边的人安详离去了,我 1 0.7801448106765747

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.