hillzhang1999 / ctc-report Goto Github PK
View Code? Open in Web Editor NEWCTC2021-中文文本纠错大赛的SOTA方案及在线演示
License: Apache License 2.0
CTC2021-中文文本纠错大赛的SOTA方案及在线演示
License: Apache License 2.0
感谢开源,请问,模型集成是在训练的时候将概率平均,还是模型训练好之后的推理阶段,再将概率平均
作者您好,我看了您们团队在CTC2021比赛中所用方法的报告,我想学习一下,所以请问一下能否方便提供一下你们实现的整个纠错流程的源代码,仅供学习使用。本人邮箱:[email protected] ,如果方便提供源代码,麻烦联系我一下,谢谢。
”对句子中的每个位置,挑选出所对应的 10 个概率最大的候选字符;“
该步骤涉及topk求解问题,性能影响是否较大?
自动抽取语义模板具体是怎么做的?是用现代汉语实词搭配词典做的吗?
感谢公开解决方案,请问 Wiki-Edits数据怎么获取呢?
看示例愣是没想明白怎么做的,直觉上对于改动点比较多的平行文本,抽出来的标签准确性不会太高吧?另,翘首以盼开源ing……
拼写纠错模型是不是类似于pycorrector开源项目中macbert4csc?另外,拼写纠错中的解码trick是在预测时使用吗?在训练时要使用吗?
你好,打扰了,我在训练中文gector模型时,使用大量伪数据训练的效果,与直接在任务数据微调效果差不多。想请问下:1)你们在ctc比赛中的方案,只做伪数据预训(不做任务微调)可以达到什么效果呢?(或者有测试过ctc提供的baseline模型效果吗)2)你们训练gector的代码是基于ctc比赛提供的代码修改的吗?感谢。
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.