The brainscopypaste-paper from wehlutyk

Evaluate filters (language, spam)

Since we can't compare the new language-detection module to the old one, we should at least justify that the new one works properly (and thus justify the minor changes we see in the figures, see wehlutyk/brainscopypaste@2b8dcb8 and wehlutyk/brainscopypaste@9163bdf).

So there is:

precision/recall for spam filtering
precision/recall for language filtering

Prediction → result path

Introduce key concepts, interpreted with predictions

Dynamical systems (+literature)
Cultural attractors (+literature)
- careful with "reformulation processes"
Representation transformation process(es) (+literature)
In vivo vs. in vitro (+literature)

Then

Narrow them down to the quote test case, making clear predictions
Clearly relate to the surrounding literature
Make clear that we could predict things, but there's so many things to predict, we don't want to do that. We want to describe.

In H0, destination word is chosen randomly in the pool of words of the network used

I.e. if we're using H0 to look at WN results, the destination word should be taken in the WN pool. If it's for FA results, it should be taken from the FA pool.

Correlations intra WN, intra FA, and inter network, are relatively low

Looks false.

Robustness of results

The reader must be convinced that none of the following are left to arbitrary choices, or if they are, that the results hold if those choices change:

Fix counts

Changing the language detection module gave us a few more quotes. My first reproduction of the analysis gives Stored 1188 of 6586 mined substitutions. (instead of 1051 out of 6172.)

Improve features, their selection, and their usage

Missing word features

Number of letters
Phonological neighbourhood density (cite clearpond)
All features relative to sentence

Graphs

Add a nu - nu00 graph grouping all features' variations
Add ICs on nu - nu00 graphs, reduced to selected features
Add streamplots (note we're not going to use these in fact, as explained in storytelling)

Clarifications in text

Semantic similarity: how do substitutions affect quotes, informally, and with Lauf et al.'s typification?
- Introduce H00: we take similarity into account by looking at synonyms
- We also looked at finer grained similarity measures. "Semantic similarity" gives scores, that you can't interpret much. It does show that substitutions go for lower-similarity words rather than higher similarities. Distance travelled on the FA network shows that it's not immediate neighbours. We haven't compared to a H0 (to look for a bias), but it does exclude the possibility of predicting the exact destination word based on similarity.
Feature selection: we did better feature selection, and good news: we can keep most of our features. Explain this and merge into the domain-knowledge feature-selection part. In particular, no need for predictive model.
Rewrite the flow of the argumentation. See storytelling for that.
Exclude POS analysis, mentioning we remove stopwords in the analysis (which preempts the closed/open class analysis)

Nodes with high index values are usually different between FA and WN

Misc. details

Supplementary Material

In the annex:

Maybe the schematics for the 4 substitution models we show in the main text right now, if the reviewers say so

In the Supplementary Material:

Susceptibility (all combinations and binnings), variations (absolute, relative, all binnings), and biases (nu - nu00) for all features
The same+POS for a few other noticeable substitution models
Scatter plots for all feature correlations
Schematics for all substitution models
Link to brainscopypaste repository for additional graphics (e.g. two-substitution models)

Relate to missed literature

Sentence recall (Potter & Lombardi)
False memories (Deese)
Subjective organization (Tulving, Zaromb)
Working memory and attention (Jefferies)
Iterated learning (Kirby)

Results don't differ between FA and WN

Review claims about contractile process

Either come up with an observation of the contractile process, or tone down that claim to a hypothesis (not a conclusion).

Use 'bin' instead of 'time bag'

Both networks (FA and WN) are used as undirected, unweighed networks

Cover letter

The review from Cognitive Science is synthesized into 6 main points in the Cog. Sci. Review wiki page. That page also contains links to the issues tracking each the 6 points, and reviewer-by-reviewer syntheses with more details.

Things that have changed

Here is the list of things that have changed, for use in the cover letter.

Things we did not do

Cross-feature interactions are combinatorially explosive, and not the goal of our work. We explored many directions to little avail, and what works is shown in the paper. In particular:
- PCA (with or without reconstitution of missing values) gives hard-to-interpret results
- Anova combinatorially exploses (between global feature values, sentence-relative feature values, and all their interactions), and there is no directing question to reduce dimensions
- Regression of susceptibility gives very unreliable results (because the constraints of the problem don't fit in the model)
- Regression of variation does give some insight, and is what we show in the paper
We didn't try to do word-based exact predictions (i.e. without features). This could have been (a) which word is substituted, (b) which word appears instead. (a) comes from the association strength of words in the initial sentence with the word predicted by (b), but (b) is a research program in itself:
- Our data set is not adapted to computing LSA/LDA because it has groups of very similar documents, so the associations extracted will most likely reflect this, i.e. they will be between words in the same quotation families. That's not informative for substitutions (we want associations from other families to inform the family we look at).
- Even in controlled settings and on lists of random words (i.e. lists not designed to trigger intrusions like in the Deese-Roediger-McDermott paradigm, but still with no syntax involved), the state of the art does not predict the new word (Zaromb et al. 2006); instead it predicts a list from which the new word comes from. Now (b) means predicting the new word in sentences from the real world, so it's two big jumps from what exists.
- The data is again badly structured for prediction, since there are only a few measurements on many varied cases (each case, i.e. source sentence, has one prediction, and there are only a few measurements for each source sentence), instead of many measurements on a few cases, making prediction amenable to errors. This is explained in the paper.

Why we chose it
Whether or not it (or the indicators we use) is vulnerable to the bias described by reviewer 1

wehlutyk / brainscopypaste-paper Goto Github PK

brainscopypaste-paper's People

Contributors

Stargazers

Watchers

brainscopypaste-paper's Issues

Things that have changed

Things we did not do

Recommend Projects

Recommend Topics

Recommend Org