Comments (2)
from et-bert.
您好,我使用您的代码去预处理本地的VPN-nonVPN数据集进行ISCX-VPN-Service实验,我的packet级的效果是94.7,和您98.9相差四个点,不知道是哪里出了问题,我的数据集形况如下: ./non-vpn: Chat Email File Transfer P2P Streaming VoIP
./non-vpn/Chat: AIMchat1.pcapng aim_chat_3a.pcap facebookchat1.pcapng facebookchat3.pcapng facebook_chat_4b.pcap hangouts_chat_4a.pcap ICQchat2.pcapng icq_chat_3b.pcap skype_chat1b.pcap AIMchat2.pcapng aim_chat_3b.pcap facebookchat2.pcapng facebook_chat_4a.pcap hangout_chat_4b.pcap ICQchat1.pcapng icq_chat_3a.pcap skype_chat1a.pcap
./non-vpn/Email: email1a.pcap email1b.pcap email2a.pcap email2b.pcap gmailchat1.pcapng gmailchat2.pcapng gmailchat3.pcapng
./non-vpn/File Transfer: skype_file1.pcap skype_file2.pcap skype_file3.pcap skype_file4.pcap skype_file5.pcap skype_file6.pcap skype_file7.pcap skype_file8.pcap
./non-vpn/P2P: Torrent01.pcap
./non-vpn/Streaming: netflix1.pcap netflix3.pcap spotify1.pcap spotify3.pcap vimeo1.pcap vimeo3.pcap youtube1.pcap youtube3.pcap youtube5.pcap youtubeHTML5_1.pcap netflix2.pcap netflix4.pcap spotify2.pcap spotify4.pcap vimeo2.pcap vimeo4.pcap youtube2.pcap youtube4.pcap youtube6.pcap
./non-vpn/VoIP: hangouts_audio1a.pcap hangouts_audio1b.pcap hangouts_audio2a.pcap hangouts_audio2b.pcap hangouts_audio3.pcap hangouts_audio4.pcap
./vpn: Chat Email File Transfer P2P Streaming VoIP
./vpn/Chat: vpn_aim_chat1a.pcap vpn_chat.pcap vpn_facebook_chat1b.pcap vpn_hangouts_chat1b.pcap vpn_icq_chat1b.pcap vpn_skype_chat1b.pcap vpn_aim_chat1b.pcap vpn_facebook_chat1a.pcap vpn_hangouts_chat1a.pcap vpn_icq_chat1a.pcap vpn_skype_chat1a.pcap
./vpn/Email: vpn_email2a.pcap vpn_email2b.pcap
./vpn/File Transfer: vpn_ftps_A.pcap vpn_ftps_B.pcap vpn_sftp_A.pcap vpn_sftp_B.pcap vpn_skype_files1a.pcap vpn_skype_files1b.pcap
./vpn/P2P: vpn_bittorrent.pcap
./vpn/Streaming: vpn_netflix_A.pcap vpn_spotify_A.pcap vpn_vimeo_A.pcap vpn_vimeo_B.pcap vpn_youtube_A.pcap
./vpn/VoIP: vpn_facebook_audio2.pcap vpn_hangouts_audio1.pcap vpn_hangouts_audio2.pcap vpn_skype_audio1.pcap vpn_skype_audio2.pcap vpn_voipbuster1a.pcap vpn_voipbuster1b.pcap
你可以检查一下样本中是否存在一些明显的噪声数据,即非实际vpn或non-vpn应用的通信流量。
from et-bert.
Related Issues (20)
- VRAM needed for finetuning HOT 2
- how long to train? HOT 1
- Data labeling? HOT 2
- 关于微调后模型泛化能力的问题
- CrossPlatForm数据集的问题 HOT 1
- 关于直接下载您处理好的cstnet-tls1.3数据集的疑问 HOT 4
- 关于vocab_process的问题
- Have you removed bidirectional IP and port information and protocol information to reduce the impact of packet headers? (e.g. remove 5-tuples) HOT 1
- 为什么采用bi-gram的形式,而不用tri-gram的形式 HOT 1
- How to generate .tsv files HOT 2
- 关于用于预训练的语料问题? HOT 2
- dataprogress HOT 1
- 有关从pcap生成tsv文件遇到的问题 HOT 1
- 请问ET-BERT对于纯数据流能进行识别和分类吗
- vocab_process/main.py 中缺少变量的全局定义
- uer utils中的misc.py问题
- Error in data processing in VPN dataset
- bugfix in main/data_process/dataset_cleaning.py
- ET-BERT corpora lose
- 语料库生成问题
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from et-bert.