Giter Site home page Giter Site logo

Comments (3)

taspinar avatar taspinar commented on July 17, 2024

Hi @patrickdundas
The scraper stops scraping when 'at least' the no_limit number of tweets have been scraped.
The infinite scroll of Twitter updates the page with batches of 20 tweets at a time.

So when you set the limit to 30, twitterscraper will stop scraping after two batches of 20 tweets, hence resulting in 40 scraped tweets.

If it a big issue, I can adapt the filewriter so that only the first no_limit number of tweets are written to file and the remaining tweets are disregarded.

from twitterscraper.

patrickdundas avatar patrickdundas commented on July 17, 2024

It would definitely be nice for there to be a way to just grab a certain amount of tweets, rather than a certain amount of batches. Since that's the case, wouldn't it make more sense for there to be a "true limit" flag which would just give a specific amount of tweets and a "batch limit" flag, which would use the current batch behavior of --limit? Such as --batches and --limit, or --limit (as it as now) and --tweets or --truelimit (which would download x amount of tweets)? I think that the way the documentation is worded right now makes people assume that --limit actually just gives you that number of tweets, rather than an unpredictable number of batches. Thanks very much for the clarification, but I think either the documentation on --limit needs to be reworded, or there needs to be a better way to grab a specific amount of tweets.

from twitterscraper.

taspinar avatar taspinar commented on July 17, 2024

Hi @patrickdundas ,
Is the current explanation in the readme sufficient?

from twitterscraper.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.