joeldan192 / predictive-model-for-the-voting-system-in-se-network Goto Github PK
View Code? Open in Web Editor NEWSemester project Spring 2016 as part of Master's program at EPFL. Supervisor: Dr. Elisa Celis. Lab: LCA-3.
Semester project Spring 2016 as part of Master's program at EPFL. Supervisor: Dr. Elisa Celis. Lab: LCA-3.
In Kranks(2) effects model there's evidence that the model is quite relevant at 0.03. However current R^2 is fit on the training data. Plot RSS or R^2 in test data.
Whether an Answer is voted on or not might be correlated with:
Pre-processing step to fit the Random effects model.
rank: accepted -> score -> random
Votes are sampled per day and per answer.
Goals:
a)
Votes ~ score + age + nb_comments + nb_ans + position
TODO: nb_ans, position
b) For now the actual models is:
Votes ~ Poisson(e^(quality + position_bias+ age + social_influence(score + TODO in goal a) )))
quality parameter is by post (answer)
position_bias by post and by_observation (changes with time)
Reproduce and asses some results in first week's papers.
Features that entail cognitive bias:
Sugested features (from context):
To try:
Fixed effects (because of plm implementation).
Multinomial model assumens independence across individuals Q-t pairs. TAnd so does
the 'arellano'( vcovHC) correction. Introduce fixed effects for Qid.
Same for AnsId. As an answer moves through time. and Qi-tk is independent of Qi-tj so we
could account for that dependence.
Warning: the number of parameters would then be very large.
Can be done only with random effects with lme4 package. But there are no covariance estimator
corrections working there
Namely, in nb Answer_votes after merging with comments inconsistent duplicates are generated for
AnsId-Age pairs
ReScore has very good performance. Test if this result is robust enough with a few different regularizers.
"Stable" time intervals: number of answers doesn't change. (sub-divide with rank doesn't change?)
Such intervals would permit a nested mixed effects model:
time intervals *>(evaluate this part, nested or crossed eff) nb_ans > ans_id > rank
Users might optimize k-ranks. (Consecutive groups of answers from first to last).
This implies one group parameter per answer in voting data.
Mean values for ReScore(s) and Pbias(s) are too low.
Might consider another normalization strategy...
By classes perhaps(Ans_count)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.