Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In Proceedings of The 46th IEEE/ACM International Conference on Software Engineering (ICSE 2024), Lisbon, Portugal, April 2024
Hey~, I got some problems with the cleaning of generation codes. It would be appreciated if you could help me out.
After I used clean_generations.py code to clean the generated translations of StarCoder, I found the quality of cleaning very poor and the codes can not achieve the results in the Artifacts RQ1 . Are there any other post-processing techniques to use before testing the translation performances?