Giter Site home page Giter Site logo

Comments (9)

ymcui avatar ymcui commented on June 18, 2024

凭直觉看应该是在预测DRCD_test.json的时候网络没有被initialize。
init_checkpoint在单独做predict的时候是不起作用的。
接下来:

  1. 你需要查看一下${MODEL_DIR}/model_ckpt目录下是否有xlnet的model文件。
  2. 查看log中是否包含类似Restoring parameters from这样的信息,如果没有就说明model没有被正确加载进去。

from chinese-xlnet.

allenyummy avatar allenyummy commented on June 18, 2024

是因為這個緣故嗎?! 感謝您的回覆!
image

from chinese-xlnet.

allenyummy avatar allenyummy commented on June 18, 2024

目前 ${MODEL_DIR}/model_ckpt目錄底下的文件
image

請問該如何解決model没有被正确加載進去的問題..

from chinese-xlnet.

ymcui avatar ymcui commented on June 18, 2024

上面那个Estimator的问题可以忽略。你需要再看一下log中是否包含Restoring parameters from这样的信息。

from chinese-xlnet.

allenyummy avatar allenyummy commented on June 18, 2024

有出現這個訊息
image

from chinese-xlnet.

ymcui avatar ymcui commented on June 18, 2024

看似weights应该是读进去了,方便的话可以把log和prediction.json发给我看看, [email protected]

from chinese-xlnet.

allenyummy avatar allenyummy commented on June 18, 2024

已寄出,十分感謝協助!!

from chinese-xlnet.

ymcui avatar ymcui commented on June 18, 2024

由于命名原因,dev和test生成的tf_records/features.pkl都会命名成XLNet/proc_data/spiece.model.slen-256.qlen-64.eval.features.pkl
你能否改变一下--output_dir=到其他的目录,重新预测一遍DRCD_test.json,这样可以重新生成features和tf_records,看看会有什么样的结果。

from chinese-xlnet.

allenyummy avatar allenyummy commented on June 18, 2024

成功了!
test set
EM: 84.42
F1: 92.85
十分感謝!!

from chinese-xlnet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.