Giter Site home page Giter Site logo

Comments (20)

mfb avatar mfb commented on July 29, 2024

There is now firefox headless https://mykzilla.org/2017/08/30/headless-firefox-in-node-js-with-selenium-webdriver/ or I guess more popularly, chrome headless.

from phantom-of-the-capitol.

j-ro avatar j-ro commented on July 29, 2024

from phantom-of-the-capitol.

wioux avatar wioux commented on July 29, 2024

Do we still need to support webkit/waitir? REQUIRES_WAITIR is empty and all the bioguide ids from REQUIRES_WEBKIT are house members so we can clear that out, but I'm not sure what the need for the alternative drivers was originally and whether it might come up again. We could really simplify parts of the app if we removed support for those drivers.

from phantom-of-the-capitol.

j-ro avatar j-ro commented on July 29, 2024

I think that's probably fine over here, yeah...

from phantom-of-the-capitol.

 avatar commented on July 29, 2024

https://github.com/GoogleChrome/puppeteer

from phantom-of-the-capitol.

j-ro avatar j-ro commented on July 29, 2024

has anyone started work on this?

from phantom-of-the-capitol.

wioux avatar wioux commented on July 29, 2024

Not yet @j-ro.

from phantom-of-the-capitol.

j-ro avatar j-ro commented on July 29, 2024

Thanks @wioux, us either, though it's starting to become more important for us. I'll let you know if it lands on my roadmap. Can you do the same, so we don't duplicate work?

from phantom-of-the-capitol.

wioux avatar wioux commented on July 29, 2024

Definitely, I'll let you know.

from phantom-of-the-capitol.

j-ro avatar j-ro commented on July 29, 2024

We're actually doing a bit of initial investigation work on this today, maybe tomorrow too. We'll let you know how it works. There may be just a drop-in replacement that works with capybera, if so, will be fairly easy....

from phantom-of-the-capitol.

j-ro avatar j-ro commented on July 29, 2024

Update here -- we have chromedriver running, but it's probably not quite ready for prime time. It works, but seeing some hard to debug timeout errors, and it's missing some features like blacklists. We're going to run it as an optional switch for certain yamls since it helps in some cases, but we're not going to entirely switch. If there's large appetitive for the code we can put together a PR, but it's very much a WIP.

from phantom-of-the-capitol.

k-stewart avatar k-stewart commented on July 29, 2024

Hey @j-ro, this is becoming more important for us. Have you found a solution you like?

from phantom-of-the-capitol.

j-ro avatar j-ro commented on July 29, 2024

No, we're still with phantom. Chromedriver works but not as consistently, and it doesn't have many hooks and options to debug and tune. We haven't looked at it since January, maybe that's changed, but we're not planning a switch.

from phantom-of-the-capitol.

k-stewart avatar k-stewart commented on July 29, 2024

Ok, thanks for the insight. I'll see if anything's changed since then.

from phantom-of-the-capitol.

j-ro avatar j-ro commented on July 29, 2024

Worth a shot -- it didn't really take us very long at all to drop in Chromedriver -- the hard part was getting it to work reliably.

from phantom-of-the-capitol.

 avatar commented on July 29, 2024

I'll chime in with my experience as I have worked with puppeteer, and phantomjs, and various selenium webdriver implementations like chromedriver and geckodriver.
Puppeteer provides a high level API that is quite easy to work with for basic scraping. They publish extensive documentation as well. If needing to get something done quick, I think this is a strong contender. It is a JavaScript only API as far as I know.
Selenium webdriver implementations give you more flexibility with the browser you run the automation in but require more programming and configuration to get working. The API is also implemented in different programming languages. Firefox's headless documentation also recommends using selenium webdriver for testing automation.

from phantom-of-the-capitol.

 avatar commented on July 29, 2024

Just discovered @k-stewart 's work in #141 as well.

from phantom-of-the-capitol.

wioux avatar wioux commented on July 29, 2024

Hi @efx. Our contact-congress work has moved over to EFForg/congress_forms_api to fix this and other issues. Sorry we didn't properly archive this repo -- I'm going to do that now.

from phantom-of-the-capitol.

 avatar commented on July 29, 2024

Thanks @wioux. I had found this repository from EFF's homepage, so we should probably update those link(s) as well.

from phantom-of-the-capitol.

danielmroberts avatar danielmroberts commented on July 29, 2024

Hi @efx. Our contact-congress work has moved over to EFForg/congress_forms_api to fix this and other issues. Sorry we didn't properly archive this repo -- I'm going to do that now.

This repo is still not archived. We were about to roll out a system we have been working on for a while based on phantom of the capitol before noticing your comment :(

from phantom-of-the-capitol.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.