niyaton / kenja Goto Github PK
View Code? Open in Web Editor NEWKenja repository
License: MIT License
Kenja repository
License: MIT License
例えば以下のようなコミットをうまく変換できない.
rename convertor.py to converter.py
Thanks for the awesome tool!
I tried to install kenja with PC behind proxy and faced following problem.
$ python setup.py install
kenja/lib/java/java-parser.jar does not exist. Do you want to download it?[y/n]
y
md5 hash of kenja/lib/java/java-parser.jar is incorrect! remove it and tryagain.
This is because download_parser(self) method invokes urllib.urlretrieve() that does not consider proxy configuration.
def download_parser(self):
urllib.urlretrieve(self.parser_location, self.parser_path)
This can be modified by changing code as follows, since urllib2.urlopen() works based on http_proxy and https_proxy environment variables.
def download_parser(self):
with open(self.parser_path, 'wb') as f:
f.write(urllib2.urlopen(self.parser_location).read())
f.close()
Also I faced the similar problem on executing copy_csharp_parser()
method. In this case I commented out the line and avoid the problem, since I only targets java source code in this time.
submodule of git should be initialized when parent git repository is cloned from a remote.
Maximum length of the file name depends on file system.
However, Kenja make a file which have over 255 characters name in Historage(git).
We should consider how to avoid this problem.
When kenja converts historage, kenja makes syntax trees files into syntax_trees dir.
However, these files are temporal files.
These files are never used for mining Historage.
I think the feature storing syntax trees should be an option feature (for DEBUG).
According to my friend, he cannot convert the history of ArgoUML to the Historage.
He got the git repository of the ArgoUML by using "git svn clone" command.
The location of ArgoUML SVN repository is following URL.
http://argouml.tigris.org/svn/argouml/
According to him, the name of a branch of the imported git repository contains @ and it raise some exception in a function of GitPython library.
jEditの履歴において,
抽出元メソッド:FtpVFS._createInputStream(View,Buffer,String,boolean)
抽出後のメソッド:getFtpClient
類似度:0.323943662
というメソッド抽出リファクタリングの候補が検出されるが,
_createInputStream(View, Buffer, String, boolean)というメソッドは
検出したコミット間のbefore側には存在せず,
_createInputStream(View,String,boolean)というメソッドのみが存在する.
これは,kenjaが勝手にリネームを含めたメソッド抽出の検出を試みているためであり,
また,抽出元のメソッド名をafter側から取得しているのが原因である.
メソッド抽出リファクタリングの検出時にリネームも考慮するのかどうかを検討した上でBug Fixを実施する.
現状,--non-parallelオプションを付加しないと正常に動作しないのに,このオプションがあるのはいかがなものか.
Currently, The historage(not bare repo) dir built by kenja doesn't contain any files without .git dir.
今のところ,分析対象のリポジトリを一度にHistorageに変換しているが.
増分的にHistorageを作成できるようにしたほうが,汎用性が広がる.
コミットとコミットのリストを与えるとリファクタリング検出をするように実装すれば実験とかがしやすくなる.
Gitリポジトリは同一ツリーに同じ名前のblobを二つ作ってはいけないので対応する.
というかこれはそもそもkenja-java-parserのバグ.
I think this behavior is confusing to the users.
very dirty.
意味不明なコマンド名なので直す
In README.md, Chapter "Convert your git repository to historage",
the text is different from the stdout of "kenja.historage.convert --help"
少なくとも,0.3以降,jEditからExtract Methodを検出できなくなっているので対応する.
なお,sigss_kagawaまたは0.2ではExtract Methodを問題無く検出可能.
'git notes' command does not work when user does not set her/his email and name to git.
Extract Methodの検出で引数の個数の対応が取れてないことがたまにある.
引数は
It's a problem of GitPython.
please refer following issue.
gitpython-developers/GitPython#57
Git 1.7系統だとconvert時にgitがabortするという報告を受けている.
現状,コンストラクタに関するメソッド抽出リファクタリングは検出していないので対応する.
git init時にbareオプションを付ければいいだけだが,kenja.convertの引数に対応する引数を追加する必要がある.
installの際に毎回ハッシュ値を確認して,ハッシュ値が違う場合
「すでに存在しますが,上書きしますか? y/n」
的なメッセージを出して,yesなら上書きするように変更する.
According to specification of the Git, we need to sort the order of blobs and trees in a tree.
However, current implementation of the kenja doesn't follow this specification.
In order to fix this bug, we need to modify kenja/git/util.py.
なおしたい.
現在,bin/kenja-paser.jarはシンボリックリンクで,mvnが作成するparserのパスを指定している.
WindowsのGitはシンボリックリンクをうまく扱わないようなので,このままだとWindowsユーザが使用するのに問題あり.
Making test module which confrim equivalence of two historages is usefull for development.
Currently, kenja cannot execute C# parser on Windows because it calls mono command.
Rename function add_main_command → add_main_argument (also add_option_command)
Not critical but it might be much easier to understand.
一応問題無く動いているように見えるが,Gitにおいては空ディレクトリを作成しないほうがいいようなので対応する.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.