Complete Twitter sentiment analysis that involves collecting data from the Twitter API and computing sentiment or "mood" scores from the tweets.
Note that because of the upload mechanism of how grader works, the code in the solution is not modulized.
- Practice typical Database operation using sqllite3.
- Matrix multiplication using SQL.
- Keyword search uses term document matrix.
I have used mapreduce before, thus I finished assignment three rather quickly and does not include them.
As for assignment 4, I missed the notification of this assignment and does not make it.
working with data from the SeaFlow environmental flow cytometry instrument.
A flow cytometer delivers a flow of particles through capilliary. By shining lasers of different wavelengths and measuring the absorption and refraction patterns, you can determine how large the particle is and some information about its color and other properties, allowing you to detect it.
- Use R to do basic statistical analysis.
- Use ggplot to visualize data and detect abnormal in the data.
- Use packages, a.k.a. rpart(decision tree), randomForest and e1071(svm) to do particle classification.
Use Tableau to create a series of visualizations and use them to explore a dataset about the collision of airplane and animals.
Results can be viewed here.