ubc-mds / taracyc_ocean_virus_analysis Goto Github PK
View Code? Open in Web Editor NEWDSCI 522 Data Science Workflows Project
DSCI 522 Data Science Workflows Project
Hi,
Follow the following steps in order to work with fork-pull request.
Clone the forked repository locally
Follow the steps here to set an upstream as your master branch
https://help.github.com/articles/configuring-a-remote-for-a-fork/
Now when you makes changes to your "forked repository", push them, make a pull request, the other team member will critique it and accept it
For updating your forked repository, so that it's up to date with the master branch, follow the following steps:
https://help.github.com/articles/syncing-a-fork/
@HarjyotKaur and @heathervant , you have chosen an interesting question and I look forward to reading more on your analysis. This analysis will be very useful for me personally as I have such little knowledge on biology topics and I want to change that :) Here are some improvement points and minor suggestions for your project.
Reasoning
Mechanics
Your names should not be a part of the project repository name. The name should be more descriptive of the project. For example, simply "taracyc_analysis" can be a great name for your project. I would also recommend taking "DSCI_522" out of the name of your project. But if you like it this way, no problem. You should also update your readme title based on the changes you make in the repository title.
Always include a link when you point your readers to a specific file and/or folder in your repository. For example, include the link to the file within the sentence "For further information, the steps followed for downloading this data have been outlined in the same repository."
The screenshot of the data import is useful, but I would suggest using a code chunk for the code and printing output as a table. I believe it might look more professional that way. You can do this easily by creating the README file as an R markdown and knitting as github_document.
I see that you have found a solution within Github to store the large data set needed for your analysis. The data repository is well-organized and explains the data extraction steps quite well. Just don't forget to add a link to your analysis repository in the readme of the data repository.
Minor Suggestions:
It is great you mention the authors in the readme. However, my recommendation is to include your github.com profile names and links instead of your UBC cwl profiles. These repositories are public, so your public github profiles would be more meaningful to those looking at your project.
My recommendation is to store only raw data or metadata files in the data/
folder. Your explanations and instructions about the data source and the data, data_load.md
file, can be in doc/
folder rather than the data/
folder. However, in the end, it is up to you.
Your commit messages are meaningful in general. Although I spot some commit messages that are version statements. Try to eliminate such messages and instead state the change you made. I would recommend checking out the post here to improve your committing habits.
I hope this feedback is helpful in improving your project. Please let me know if you have any questions. Good luck!
@heathervant and @HarjyotKaur great job on Milestone 2! There is not much I can add. I will only suggest taking a look at http://swcarpentry.github.io/make-novice/reference. This would help make your Make code more practical. For example, you can use automatic variables and avoid some repetition.
@heathervant @HarjyotKaur
Your readme, final report and scripts are very informative and organized. You have done a great job!
Here are some remaining issues:
Suggestions:
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.