A list of resources and tools for data science and workstation setup
- fish shell
- ๐๐ The fish shell prompt for astronauts
- space fish
- starship
- Successor to Space Fish
- Node version manager wrapper for Fish shell
- ๐Learning SQL
- Automate the Boring Stuff
- Effective Python
- Modern Pandas:
- Learn Apache Spark
- Spark: The Definitive Guide A follow up to Learning Spark.
- Advanced Analytics with Spark A great Spark book focusing on data science.
- Learning PySpark A great book on starting with Apache Spark.
- Official Spark Documentation
- Sci-Kit Learn Documentation Scikit Learn is a general machine learning library built on top of NumPy. It features a lot of utilities for general pre and post-processing of data. It is a library in Python used to construct traditional models.
Tensorflow is the most famous library in production for deep learning models. Offers automatic differentiation to perform backpropagation smoothly, allowing you to literally build any machine learning model literally. Keras is a high-level API built on Tensorflow. It is user-friendly and helps quickly build and test a neural network with minimal lines of code. Like building simple or complex neural networks within a few minutes. Modular since everything in Keras can be represented as modules.