mootezsaad / bugbert Goto Github PK
View Code? Open in Web Editor NEWLicense: Apache License 2.0
License: Apache License 2.0
What fields will be stored in the index?
What fields to return from the index?
Transform MongoDB documents into the JSON Structure, which was discussed yesterday.
Go through the DeCLUTR repository and adapt their replication package to our needs.
To reiterate, during our last meeting we agreed not to follow DeCLUTR's input representation that is based on anchor and positive spans (within the same document) and negative spans (within the other documents). We agreed that we will use the vanilla BERT input structure: [CLS] BR_DESC [SEP] DUP_BR_DESC [SEP]
. The inclusion of the title would depend on the results of the EDA.
In addition, we agreed that we will pretrain the encoder in a supervised manner since we have the labels. As of writing this issue, we may want to use InfoNCE as a potential loss function.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.