Giter Site home page Giter Site logo

yusuzech / r-web-scraping-cheat-sheet Goto Github PK

View Code? Open in Web Editor NEW
385.0 22.0 104.0 2.96 MB

Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.

License: MIT License

R 100.00%
rvest rselenium httr scrape-websites webscraping r web-scraping cheatsheet

r-web-scraping-cheat-sheet's People

Contributors

sun-wenjun avatar victorcaquilpan avatar yusuzech avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

r-web-scraping-cheat-sheet's Issues

Some notes on using rsDriver

It seems that most folks out there are interfacing rSelenium with a third-party program called "Docker" and then using the remoteDriver() function rather than the rsDriver() function. rsDriver therefore has less documentation and support. I ran into some problems getting started with the rsDriver method that I eventually resolved, so I thought I'd share my fixes here.

  1. I had to install the wdman library. This is a version control package that installs all the latest rSelenium dependencies.
  2. The rsDriver() function defaults to using the latest version of Google Chrome, which is a beta version. Simply installing the beta version of Chrome on my computer unfortunately didn't get the job done, because the function uses the file path for the stable version, not for the beta version. I had to specify my Chrome version using the chromever argument.
  3. When troubleshooting rsDriver(), it sometimes executed only halfway. It opened the server port, but it didn't fully initialize the server. If I tried to run the function again, I would get an error message saying the port was already in use. This can be resolved by restarting RStudio.
  4. When I attempted to use driver <- rD$client, I got an error message saying I was trying to run a non-function. I resolved this by using driver <- rD[["client"]] instead. To stop the server, use rD[["server"]]$stop() instead of rD$server$stop().

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.