stucco / auto-labeled-corpus Goto Github PK
View Code? Open in Web Editor NEWCorpus of auto-labeled text for the cyber security domain
Corpus of auto-labeled text for the cyber security domain
It is unclear whether the corpus (full_corpus.json) can be distributed and/or modified, and if so, to whom to give credit.
If it is OK to distribute or modify this corpus, please include some kind of CC license.
Where can I get the nvdcve-2.0-xxxx.graphson file? It seems that the nvdcve-2.0-xxx.graphson file in the folder is directly used for running.
In full_corpus.json there are many cases of:
I am new to git lfs I found the graphson file contains this information
version https://git-lfs.github.com/spec/v1
oid sha256:9df76c00cceeb0ef8bd7d4b89754882e5efd160c6fb2636ebc8d7ba84d458b92
size 56683610
Please help me...Thanks in advance
For those 1274 NaN's in the place of the word the corresponding word is tagged as {'B-update', 'O', 'B-version',NaN}. Maybe more cleaning the corpus is required
Looks I can only find *_tagging.py and Perceptron.py, and *Functions.py scripts. Thanks!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.