Giter Site home page Giter Site logo

training's Introduction

Training resources

Training materials related to data science, artificial intelligence and bioinformatics. Resources which are not available for free are marked ($). You can find links to organizations which provide physical courses (in physicalcourses.md) and links to data sources (in datasources.md). Distance courses by Swedish universities which require official registration are listed in SwedishUniDistanceCourses.md.

Here is a suggested learning path for getting started in data science. Resources are below:

  1. Install Anaconda and get familiar with its main functions and jupyter notebooks. Alternatively, if your own computer is limited, get familiar with Google colab.

  2. Learn Python basics

  3. Get familiar with the main functions of python tools needed for data processing and scientific computing: regular expressions, numpy, pandas

  4. Get familiar with the basics of data visualization: matplotlib

  5. Get a conceptual understanding of the core principles of machine learning and deep learning

  6. Get a basic understanding of the main machine learning libraries: pytorch, keras

  7. Familiarize yourself with the concepts and tools of data science reproducibility: git, FAIR principles

  8. Familiarize yourself with the main concepts and tools in your main area of interest, e.g. image analysis, nlp

  9. Try solving specific tasks you are interested in, e.g. from your research project or daily life, using machine learning, and just continue learning the things that are required to solve these tasks.

General data science and programming basics

Anaconda installation

https://www.datacamp.com/community/tutorials/installing-anaconda-windows

Jupyter notebooks

https://www.datacamp.com/community/tutorials/tutorial-jupyter-notebook

Markdown basics

https://guides.github.com/features/mastering-markdown/

Google Colab

https://web.eecs.umich.edu/~justincj/teaching/eecs498/FA2020/colab.html

Software engineering best practices

https://www.pythonlikeyoumeanit.com/Module5_OddsAndEnds/Writing_Good_Code.html

https://scikit-learn.org/stable/developers/contributing.html

Hardware recommendations

https://blog.slavv.com/picking-a-gpu-for-deep-learning-3d4795c273b9

https://timdettmers.com/2019/04/03/which-gpu-for-deep-learning/

Books

How to think like a data scientist

https://runestone.academy/runestone/books/published/httlads/index.html

An Introduction to Statistical Learning by James, Witten, Hastie, Tibshirani

http://statlearning.com/

Book collections

Scientific book collection by Springer, many machine learning books included

https://towardsdatascience.com/springer-has-released-65-machine-learning-and-data-books-for-free-961f8181f189

Runestone Interactive

https://runestoneinteractive.org/pages/library.html

Courses

Data8 The Foundations of Data Science course

http://data8.org/

CS109A: Introduction to Data Science

https://harvard-iacs.github.io/2018-CS109A/

CS109B: Advanced Topics in Data Science from Harvard

https://harvard-iacs.github.io/2018-CS109B/

Blogs

https://jvns.ca/

https://www.analyticsvidhya.com/blog/

http://jakevdp.github.io/

Podcasts

https://dataskeptic.com/

Other resources

Stackoverflow forum

https://stackoverflow.com/

FAIR principles and reproducibility

General resources

https://nbis-reproducible-research.readthedocs.io/en/latest/

https://github.com/IFB-ElixirFr/IFB-FAIR-bioinfo-training

https://the-turing-way.netlify.app/welcome.html

Version control and Git

https://try.github.io/

https://the-turing-way.netlify.app/reproducible-research/vcs.html#rr-vcs

https://swcarpentry.github.io/git-novice/

Python

Official Python documentation

https://docs.python.org/3.7/

PEP8 python style guide

https://www.python.org/dev/peps/pep-0008/#tabs-or-spaces

Google Python style guide

https://google.github.io/styleguide/pyguide.html

ipython

http://ipython.org/

scipy

https://scipy.org/

numpy

http://www.numpy.org/

pandas

http://pandas.pydata.org/

matplotlib

https://matplotlib.org/

scikit-learn

https://scikit-learn.org/

scikit-image

https://scikit-image.org/

Courses

Python courses by University of Michigan on coursera or edx

https://www.coursera.org/specializations/python

https://www.edx.org/bio/charles-severance

Codecademy Python course

https://www.codecademy.com/learn/learn-python

Analytics Vidhya Python course

https://courses.analyticsvidhya.com/courses/introduction-to-data-science

Google's Python class

https://developers.google.com/edu/python/

Google's Python Crash Course on Course

https://www.coursera.org/learn/python-crash-course

Corey Schaefer's Python Programming Beginner Tutorials

https://www.youtube.com/playlist?list=PL-osiE80TeTskrapNbzXhwoFUiLCjGgY7

Dataquest Data Analyst path (some free, some $)

https://www.dataquest.io/path/data-analyst/

Books

Python for Everybody: Exploring Data In Python 3 by Charles Severance

https://www.py4e.com/book

Learn Python the Hard Way by Zed Shaw

https://learnpythonthehardway.org/python3/

Programming Python, 4th Edition by Mark Lutz ($)

http://shop.oreilly.com/product/9780596158118.do

Learning Python, 5th Edition by Mark Lutz ($)

http://shop.oreilly.com/product/0636920028154.do

A Whirlwind tour of Python by Jake VanderPlas

for people familiar with programming

https://github.com/jakevdp/WhirlwindTourOfPython

Python Data Science Hanbook by Jake VanderPlas

https://github.com/jakevdp/PythonDataScienceHandbook

Scientific Computing with Python 3 by Claus Führer, Jan Erik Solem, Olivier Verdier ($)

https://www.oreilly.com/library/view/scientific-computing-with/9781786463517/

How to think like a computer scientist

https://runestone.academy/runestone/books/published/thinkcspy/index.html

Foundations of Python Programming

https://runestone.academy/runestone/books/published/fopp/index.html

Exercises

CS109 Homework 1. Exploratory Data Analysis

https://nbviewer.jupyter.org/github/cs109/2014/blob/master/homework/HW1.ipynb

Other resources

List of Python learning resources

https://forums.fast.ai/t/recommended-python-learning-resources/26888

Python NumPy tutorial

http://cs231n.github.io/python-numpy-tutorial/

Scipy tutorial

https://docs.scipy.org/doc/scipy/reference/tutorial/

Matplotlib tutorial

https://nbviewer.jupyter.org/github/jrjohansson/scientific-python-lectures/blob/master/Lecture-4-Matplotlib.ipynb

Pandas tutorials

https://pandas.pydata.org/pandas-docs/stable/user_guide/10min.html

http://www.gregreda.com/2013/10/26/intro-to-pandas-data-structures/

https://www.analyticsvidhya.com/blog/2014/08/baby-steps-python-performing-exploratory-analysis-python/

https://www.analyticsvidhya.com/blog/2014/09/data-munging-python-using-pandas-baby-steps-python/

https://pandas.pydata.org/pandas-docs/stable/getting_started/tutorials.html

Lectures notes on Python

https://github.com/jrjohansson/scientific-python-lectures/tree/master/

https://github.com/NBISweden/workshop-python/tree/ht18

https://github.com/mgalardini/2016_python_course/blob/master/notebooks/%5B0%5D-Introduction_to_Jupyter_Notebook.ipynb

Peter Norvig's python training examples

https://github.com/norvig/pytudes#pytudes-index-of-jupyter-ipython-notebooks

julia

https://julialang.org/

R

Courses

https://www.codecademy.com/learn/learn-r

Regular expressions

https://docs.python.org/3.6/library/re.html

https://docs.python.org/3/howto/regex.html

https://www.youtube.com/watch?v=DRR9fOXkfRE&feature=youtu.be

https://regexr.com/

https://regexone.com/

https://www.analyticsvidhya.com/blog/2015/06/regular-expression-python/

https://developers.google.com/edu/python/regular-expressions

https://www.debuggex.com/cheatsheet/regex/python

AI & Machine Learning & Deep Learning

Courses

CS229 Machine learning course from Stanford

https://www.youtube.com/watch?v=PPLop4L2eGk&list=PLLssT5z_DsK-h9vYZkQkYNWcItqhlRJLN

http://cs229.stanford.edu/syllabus.html

CS221 Artificial Intelligence course from Stanford

https://stanford-cs221.github.io/autumn2019/

CS230 Deep Learning course from Stanford

https://cs230.stanford.edu/

CS188 Introduction to Artificial Intelligence from Berkeley

https://inst.eecs.berkeley.edu/~cs188/fa20/

https://inst.eecs.berkeley.edu/~cs188/fa18/

CS294-158-SP20 Deep Unsupervised Learning from Berkeley

https://sites.google.com/view/berkeley-cs294-158-sp20/home

CSC321 Neural Networks and Machine Learning from University of Toronto

https://www.cs.toronto.edu/~lczhang/321/index.html

Machine Learning course from VU University in Amsterdam

https://mlvu.github.io/

https://www.youtube.com/watch?v=-pve3oIvxa8&index=1&list=PLCof9EqayQgupldnTvqNy_BThTcME5r93

Fast.ai courses

www.fast.ai

Material from Andreas Mueller's courses

https://github.com/amueller

MIT Deep Learning and Artificial Intelligence Lectures

https://deeplearning.mit.edu/

Deep RL Bootcamp (2017)

https://sites.google.com/view/deep-rl-bootcamp/lectures

Full Stack Deep Learning Bootcamp

https://course.fullstackdeeplearning.com/

Less technical courses

https://www.elementsofai.com/

https://app.ai-cursus.nl/home

Tutorials for major libraries

Official Pytorch tutorial

https://pytorch.org/tutorials/beginner/nn_tutorial.html

Books

Machine learning book by Hal Daumé III

http://ciml.info/

Deep Learning by Ian Goodfellow, Yoshua Bengio, and Aaron Courville

https://www.deeplearningbook.org/

Neural Networks and Deep Learning by Michael A. Nielsen

http://neuralnetworksanddeeplearning.com/

Introduction to Deep Learning by Eugene Charniak ($)

https://mitpress.mit.edu/books/introduction-deep-learning

Deep Learning with Python by François Chollet

https://www.manning.com/books/deep-learning-with-python

Artificial Intelligence: A Modern Approach by Stuart Russell and Peter Norvig ($)

http://aima.cs.berkeley.edu/

Hands-on Machine Learning with Scikit-Learn, Keras and TensorFlow by Aurélien Géron ($)

https://www.oreilly.com/library/view/hands-on-machine-learning/9781492032632/

Notebooks for book exercises: https://github.com/ageron/handson-ml2

Reinforcement Learning, An Introduction by R. Sutton & A.G. Barto

http://incompleteideas.net/sutton/book/the-book-2nd.html (draft)

Artificial Intelligence: Foundations of Computational Agents (2nd Edition) by David L. Poole and Alan K. Mackworth

https://artint.info/2e/html/ArtInt2e.html

Machine Learning Yearning: Technical Strategy for AI Engineers, In the Era of Deep Learning by Andrew Ng

https://www.deeplearning.ai/machine-learning-yearning/

Blogs

colah's blog

http://colah.github.io/

Andrej Karpathy's blog

Towards Data Science

https://towardsdatascience.com/

https://towardsdatascience.com/intuitively-understanding-variational-autoencoders-1bfe67eb5daf

Overview over activation functions:

https://www.analyticsvidhya.com/blog/2020/01/fundamentals-deep-learning-activation-functions-when-to-use-them/

https://medium.com/@snaily16/what-why-and-which-activation-functions-b2bf748c0441

Other resources

NIPS 2016 Tutorial: Generative Adversarial Networks by Ian Goodfellow

https://arxiv.org/abs/1701.00160

https://www.youtube.com/watch?v=AJVyzd0rqdc

AI Lund tv: videos from seminars and workshops @ Lund University

http://ai.lu.se/tv/

Pytorch tutorial by Jeremy Howard

https://pytorch.org/tutorials/beginner/nn_tutorial.html

Reports on business and societal impact of AI by McKinsey

https://www.mckinsey.com/featured-insights/artificial-intelligence

Reports on business and societal impact of AI by PWC

https://www.pwc.com/gx/en/issues/data-and-analytics/artificial-intelligence.html

Grad-Cam tutorial

https://www.pyimagesearch.com/2020/03/09/grad-cam-visualize-class-activation-maps-with-keras-tensorflow-and-deep-learning/

Key articles

Backpropagation https://www.nature.com/articles/323533a0

A Fast Learning Algorithm for Deep Belief Nets https://doi.org/10.1162/neco.2006.18.7.1527

Greedy layer-wise training of deep networks http://papers.nips.cc/paper/3048-greedy-layer-wise-training-of-deep-networks.pdf

Computer vision

Courses

Computer vision course from Stanford

http://cs231n.stanford.edu/

https://www.youtube.com/playlist?list=PL3FW7Lu3i5JvHM8ljYj-zLfQRF3EO8sYv

http://cs231n.github.io/

EECS 498-007 / 598-005: Deep Learning for Computer Vision from University of Michigan

https://web.eecs.umich.edu/~justincj/teaching/eecs498/FA2020/

https://www.youtube.com/playlist?list=PL5-TkQAfAZFbzxjBHtzdVCWE0Zbhomg7r

Books

Computer Vision: Algorithms and Applications by Richard Szeliski

http://szeliski.org/Book/

Computer Vision - A Modern Approach by David A. Forsyth and Jean Ponce ($)

Other resources

https://github.com/jbhuang0604/awesome-computer-vision

https://distill.pub/2017/feature-visualization/

https://distill.pub/2018/building-blocks/

https://ai.googleblog.com/2015/06/inceptionism-going-deeper-into-neural.html

https://github.com/jcjohnson/neural-style

Key articles

Backpropagation Applied to Handwritten Zip Code Recognition (LeNet) https://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=&ved=2ahUKEwibzejJ2_7rAhUKyoUKHfrkBqIQFjABegQIAhAB&url=http%3A%2F%2Fyann.lecun.com%2Fexdb%2Fpublis%2Fpdf%2Flecun-89e.pdf&usg=AOvVaw1V9weNdZgg_6oEcKcWmdXk

AlexNet https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf

VGG https://arxiv.org/pdf/1409.1556.pdf

GoogLeNet https://storage.googleapis.com/pub-tools-public-publication-data/pdf/43022.pdf

ResNet https://arxiv.org/pdf/1512.03385.pdf

NLP

Courses

CS224n NLP course from Stanford

http://web.stanford.edu/class/cs224n/

https://www.youtube.com/playlist?list=PLoROMvodv4rOhcuXMZkNm7j3fVwBBY42z

Fast.ai NLP course

https://github.com/fastai/course-nlp

https://www.youtube.com/playlist?list=PLtmWHNX-gukKocXQOkQjuVxglSDYWsSh9

Natural Language Processing from Coursera

https://www.coursera.org/learn/language-processing

Natural Language Processing from Berkeley

https://people.ischool.berkeley.edu/~dbamman/nlp20.html

Applied Natural Language Processing from Berkeley

https://people.ischool.berkeley.edu/~dbamman/info256.html

Applied Text Mining in Python from Univ. of Michigan/Coursera

https://www.coursera.org/learn/python-text-mining/home/welcome

Spacy course

https://course.spacy.io/

AllenNLP tutorials

https://allennlp.org/tutorials

Books

Speech and Language Processing by Dan Jurafsky and James H. Martin

https://web.stanford.edu/~jurafsky/slp3/

Coreference chapter: https://web.stanford.edu/~jurafsky/slp3/22.pdf

Natural Language Processing by Jacob Eisenstein

https://github.com/jacobeisenstein/gt-nlp-class/blob/master/notes/eisenstein-nlp-notes.pdf

A Primer on Neural Network Models for Natural Language Processing by Yoav Goldberg

u.cs.biu.ac.il/~yogo/nnlp.pdf

Introduction to Information Retrieval by Christopher D. Manning, Prabhakar Raghavan & Hinrich Schütze

https://nlp.stanford.edu/IR-book/html/htmledition/irbook.html

Natural Language Processing with PyTorch by Brian McMahan, Delip Rao ($)

https://www.oreilly.com/library/view/natural-language-processing/9781491978221/

Data and Text Processing for Health and Life Sciences by Francisco M. Couto

http://labs.rd.ciencias.ulisboa.pt/book/

Blogs

Introduction to Natural Language Processing for Text https://towardsdatascience.com/introduction-to-natural-language-processing-for-text-df845750fb63

https://ruder.io/

https://jalammar.github.io/

Peter Bloem Transformers from Scratch http://peterbloem.nl/blog/transformers

http://www.abigailsee.com/

https://medium.com/@joycex99

https://smerity.com/articles/articles.html

https://towardsdatascience.com/evaluating-text-output-in-nlp-bleu-at-your-own-risk-e8609665a213

Steps for effective text data cleaning (with case study using Python) https://www.analyticsvidhya.com/blog/2014/11/text-data-cleaning-steps-python/

Other resources

SciSpacy

https://github.com/allenai/scispacy

Python regular expressions documentation

https://docs.python.org/3/library/re.html

Tutorials about text cleaning

https://www.analyticsvidhya.com/blog/2014/11/text-data-cleaning-steps-python/

http://ieva.rocks/2016/08/07/cleaning-text-for-nlp/

https://chrisalbon.com/python/basics/cleaning_text/

http://rjweiss.github.io/text-iriss2013/

Tutorial about coreference resolution with neuralcoref

https://medium.com/huggingface/state-of-the-art-neural-coreference-resolution-for-chatbots-3302365dcf30

Tutorial for spacy

https://colab.research.google.com/github/DerwenAI/spaCy_tuTorial/blob/master/spaCy_tuTorial.ipynb#scrollTo=vGVwIzTJJjDR

Tutorial for Huggingface Tokenization

https://colab.research.google.com/github/huggingface/notebooks/blob/master/examples/token_classification.ipynb#scrollTo=vc0BSBLIIrJQ

Lars Juhl Jensen slideshare

https://www.slideshare.net/larsjuhljensen

Key articles

LSTM http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.676.4320&rep=rep1&type=pdf

SQL

Courses

Codecademy SQL course https://www.codecademy.com/learn/learn-sql

Bioinformatics

Elixir training

https://tess.elixir-europe.org/

NBIS course on single-cell RNASeq

https://nbisweden.github.io/workshop-scRNAseq/

Mathematics

List of statistics resources

https://jvns.ca/blog/2017/04/17/statistics-for-programmers/

Immersive Maths (interactive linear algebra book)

http://immersivemath.com

Courses

Computational Linear Algebra for Coders by Fast.ai

https://github.com/fastai/numerical-linear-algebra/

Books

Mathematics for Machine Learning

https://mml-book.github.io/

Applied Math and Machine Learning Basics chapter in Deep Learning book

https://www.deeplearningbook.org/contents/part_basics.html

Mathematical Methods for Physics and Engineering by Riley, Hobson, Bence

https://www.cambridge.org/se/academic/subjects/physics/mathematical-methods/mathematical-methods-physics-and-engineering-comprehensive-guide-3rd-edition?format=PB&isbn=9780521679718

Practice datasets

https://scikit-learn.org/stable/datasets/

Ethics

EU Ethics Guidelines for Trustworthy AI

https://ec.europa.eu/futurium/en/ai-alliance-consultation/guidelines#Top

Interesting talks

Multi-Task Learning in the Wilderness, Andrej Karpathy, Jun 15, 2019, ICML

https://slideslive.com/38917690/multitask-learning-in-the-wilderness

Trustworthy Human-Centric AI, Fredrik Heintz, 2020, Lund University

http://ai.lu.se/tv/trustworthy-human-centric-ai/

A conversation about AI risk and AI ethics in the age of covid-19, Jaan Tallinn and Olle Häggström

https://www.chalmers.se/en/centres/chair/news/Pages/webinar-19May2020.aspx

training's People

Contributors

sonjaaits avatar rafsanahmed avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.