The Python ecosystem for data
science
1.
Scipy Lecture Notes: http://www.scipy-lectures.org/
2.
Python scientific
computing ecosystem: http://www.scipy-lectures.org/intro/intro.html
3.
Fully-featured Scientific Python distributions:
a.
Anaconda: https://www.anaconda.com/download/
b.
EPD: https://store.enthought.com/downloads/
c.
WinPython: https://winpython.github.io/
4.
IDE for
Python: Spyder https://pythonhosted.org/spyder/
5.
Numpy: numerical computing with powerful numerical
arrays objects, and routines to manipulate
them. http://www.numpy.org/
Reference Guide: https://docs.scipy.org/doc/numpy/reference/index.html
6.
Scipy: high-level numerical routines. Optimization, regression, interpolation, etc http://www.scipy.org/
Reference Guide: https://docs.scipy.org/doc/scipy/reference/
7.
Matplotlib, Python 2D plotting library: https://matplotlib.org/
Pyplot: https://matplotlib.org/api/pyplot_summary.html
8.
Pandas, Python
Data Analysis Library (data input-output, basic statistics and graphics): http://pandas.pydata.org/
9.
Seaborn: statistical data visualization library based on matplotlib:
http://seaborn.pydata.org/
10. Scikit-learn, machine Learning in Python: http://scikit-learn.org/stable/
Manual: http://scikit-learn.org/stable/user_guide.html
Manual in pdf:
http://scikit-learn.org/0.20/_downloads/scikit-learn-docs.pdf
Tutorial: http://www.scipy-lectures.org/packages/scikit-learn/index.html#scikit-learn-chapter