Giter Site home page Giter Site logo

austin's Issues

Problem in installing austin

Hi. I am trying to install "austin" package to R 3.4.2, but I receive the following message:

Warning in install.packages :
package ‘austin’ is not available (for R version 3.4.2)

Is the package available only on very old versions of R?
I think it is not given the recent postings on this git hub.
Can you please help me with this problem shooting?
Thank you!

Run out of memory

When importing a documents of 40'000 row, the system crash when running wordfish.

Error: vector memory exhausted (limit reached?)

svd error when trying to run wordfish

I have a 2x900 csv with doc_id and text and try to run wordfish on it to see if I can identify some specific term.

# Load data
data = read.csv("all.mbox.csv", stringsAsFactors = FALSE)
meta = read.csv("all.mbox.meta.csv", stringsAsFactors = FALSE)

mycorpus = Corpus(DataframeSource(data))

# Prepare data
mycorpus = tm_map(mycorpus, content_transformer(tolower))
mycorpus = tm_map(mycorpus, removeWords, stopwords("english"))
mycorpus = tm_map(mycorpus, stemDocument)
mycorpus = tm_map(mycorpus, removePunctuation)
mycorpus = tm_map(mycorpus, removeNumbers)

# Make a document term matrix
dtm = DocumentTermMatrix(mycorpus)
# remove rare words
dtm = removeSparseTerms(dtm, sparse = 0.4)
inspect(dtm)

# Run word fish
dtm$dimnames$Docs
wfm = as.wfm(dtm)
wf = wordfish(wfm, dir=c(2, 34), control = list(tol = .00003))

I run in this error with the last line:

Error in svd(ystarm, nu = 1) : infinite or missing values in 'x'

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.