Comments (3)
Exactly, the evaluation on the small human-annotated dataset can only be performed via the leaderboard. This is because the dataset is relatively small and it will be easy to overfit on it.
Having said that, you can use the documentation<->code
as a proxy task as well as any other additional data you'd like (e.g. if you have some other data that you think can be useful during training/validation).
from codesearchnet.
Hi @Sewens
Does the response here reply to your comment maybe?
from codesearchnet.
I see. You means every query<-->code snippet relevance label can be seen as relationship (pair) between code documentation<-->main code body.
U means for model offline train/test we can only use code you provided. The ture label and negative sample need We need to train a model that can select true code which match the documentation. The annotation will NOT be published, we can only evaluate model via WandB platform.
from codesearchnet.
Related Issues (20)
- Less number of data found than stated in the paper HOT 1
- question about NDCG calculation HOT 2
- Generating Pypi module for function_parser HOT 3
- How can I get the annotated code? HOT 1
- Error when executing docker run
- Missing annoy module
- Missing code to build files *_dedupe_definitions_v2.pkl HOT 1
- NDCG computation HOT 1
- How to deconstruct code into tokens to extract functions and comments? HOT 2
- How to run the Function Parser?
- What is the difference between the Original String and code fields?
- How big the dataset is?
- Request to provide unfiltered dataset HOT 1
- Codes
- Please add the commit id for each language parser
- Expired or Private Links of Java Code Snippets in CodeSearchNET
- Clone not working HOT 1
- can we combine the original dataset and re-divided to perform cross-validation?
- dataset can not be downloaded HOT 2
- Functions with original comments
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from codesearchnet.