ai-se / bellwether_community Goto Github PK
View Code? Open in Web Editor NEWBellwether Community detection with JS projects using r2c
License: GNU General Public License v3.0
Bellwether Community detection with JS projects using r2c
License: GNU General Public License v3.0
For every node:
Bubble Algo:
def bubble(projects):
project_attrs = get_attr(projects)
birch_tree = birch(project_attrs)
max_level = birch_tree.max_level
for level in range(max_level,0):
level_cluster_ids = birch_tree.get_cluster_ids[level]
if level == max_level:
bellwethers = get_bellwether(level_cluster_ids)
else:
for bellwether in bellwethers:
parent_cluster = bellwether.parent_cluster_id
s_project = bellwether.project
d_projects = birch_tree.cluster[parent_cluster].projects
score = apply_bellwether(s_project,d_projects)
if parent_cluster not in bellwether_score.keys():
bellwether_score[parent_cluster] = score
else:
if score > bellwether_score[parent_cluster]:
bellwether_score[parent_cluster] = score
for parent_cluster_ids in bellwether_score.keys():
bellwethers[parent_cluster_ids] = bellwether_score[parent_cluster_ids][project_name]
for level_cluster_id in level_cluster_ids:
if level_cluster_id not in bellwethers.keys():
remaining_cluster.append(level_cluster_id)
bellwethers = bellwethers + get_bellwether(remaining_cluster)
The expectation from results:
for 10 data sets, showme classification results before and after cfs
Read the project README files and manually cluster them
Use feature selection
Not all commit ids showup in _commit.pkl
and commited_file.pkl
Unnamed: 0 E001 E002 E003 E004 ... W144 W145 W146 W147 W148
0 /analysis/inputs/public/source-code/index.js 0 0 0 0 ... 0 0 0 0 0
1 /analysis/inputs/public/source-code/test/main.js 0 0 0 0 ... 0 0 0 0 0
2 /analysis/inputs/public/source-code/test/submo... 0 0 0 0 ... 0 0 0 0 0
3 /analysis/inputs/public/source-code/test/simpl... 0 0 0 0 ... 0 0 0 0 0
Unnamed: 0 E001 E002 E003 E004 E005 E006 ... W142 W143 W144 W145 W146 W147 W148
0 /index.js 0 0 0 0 0 0 ... 0 0 0 0 0 0 0
1 /test/main.js 0 0 0 0 0 0 ... 0 0 0 0 0 0 0
2 /test/submodule.js 0 0 0 0 0 0 ... 0 0 0 0 0 0 0
3 /test/simpleTask.js 0 0 0 0 0 0 ... 0 0 0 0 0 0 0
Expectation from results:
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.