words-sdsc / coursera Goto Github PK
View Code? Open in Web Editor NEWData sets and scripts for Coursera Big Data Specialization.
Home Page: https://www.coursera.org/specializations/bigdata
Data sets and scripts for Coursera Big Data Specialization.
Home Page: https://www.coursera.org/specializations/bigdata
The path for the csv files is hardcoded.
I'm opening this issue but I already have the fix for this. I'm checking how can I generate a pull request for it
after clicking the download button, web browser displays: Unable to connect
Firefox cannot establish a connection to the server at raw.githubsercontent.com
besides, coursera-master.zip can be downloaded in wiondows operation system, but how to share the downloaded file into VBox?
The following error is reported when running big-data-4/setup.sh:
ERROR: certificate common name "ssl468914.cloudflaressl.com" doesn't match requested host name "repo.continuum.io". To connect to repo.continuum.io insecurely use '--no-check-certificates'. bash : Anaconda3-4.0.0-Linux-x86_64.sh: No Such file or directory Run 'source ~/.bashrc' to complete setup.
It is showing the zip file exists but is "not directory". It also shows that it is unable to process it.
Hello,
I have the following error while trying to execute lines.count() :
Python in worker has different version 2.6 than that in driver 3.5, PySpark cannot run with different minor versions
Do you have a solution to fix it ?
Working on cloudera VM for windows, course 3, week1, trying to set up spark with jupyter notebooks.
After installing scripts and supposedly anaconda with jupyter support (./setup.sh) and after sourcing bashrc, while trying to run pyspark
I am getting error:
env: jupyter: No such file or directory
I tried to install some components manually but it didn't help. Does anyone know how to go through this step?
My main machine is running Ubuntu 17.10, I could not run the VM described in course, so I had a few alternatives:
thanks
$ pyspark
jupyter: '/home/cloudera/anaconda3/bin/find_spark_home.py` is not a Jupyter command
'/home/cloudera/anaconda3/bin/pyspark: line 24: /bin/load-spark-env.sh: no such file or directory
'/home/cloudera/anaconda3/bin/pyspark: line 77: /bin/spark-submit: No such file or dir
'/home/cloudera/anaconda3/bin/pyspark: line 77: exec: /bin/spark-submit: cannot execute: No such ...
While trying to run through the Coursera course Machine Learning with Big Data
I am working around this by just properly setting up a seperate environment.. but thought I should warn you, the VM as is doesn't work for PySpark. Also, it is a security risk to be using such an old VM.. I recommend performing a $ sudo yum update
in the instructions OR recommend a newer VM image.
Coursera
Hi,
For those of us having these errors, it's impossible to move forward with the installation, at least not following Coursera's deprecated instructions.
I've done the following in the same VM where I downloaded the big-data-3 datasets, and it will allow us to install Anaconda (and therefore Junyper) and actually do the requested tasks using Junyper.
Steps:
Download the Official Anaconda Installer https://repo.anaconda.com/archive/Anaconda3-2019.10-Linux-x86_64.sh
go to the directory where it's located, in my case:
cd Downloads
install it:
./Anaconda3-2019.10-Linux-x86_64.sh
NOTE: You will notice that you can see the text, prompts and notifications you were supposed to see during a normal installation.
close current terminal, open a new one and execute pyspark as requested by the exercise's instructions.
Now you should be able to enter in Juypyter Notebook and do tasks on http://localhost:8889/
That's it, I hope it helps.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.