The SBGrid Consortium and Piotr Sliz will host a special course on January 13th on Analysis and Visualization of Large Datasets with Python, led by former group member Ian Stokes-Rees, who currently works for Continuum Analytics. You can find Ian's profile at: https://www.linkedin.com/in/ijstokes.
This course has limited enrollment and a few spots are still available.
There is no fee to attend the course.
First come, first served.
Analysis and Visualization of Large Datasets with Python
Computational Scientist
Continuum Analytics
Course Details: For each topic we will cover at least the first half of sub-topics. The second half will only be covered if time permits.
Tools for working with Python effectively
- Anaconda and conda
- pip
- Jupyter Notebooks
- Spyder and PyCharm
- Dict
- List
- Tuple
- Class
- numpy.ndarray (multi-dimensional vectors)
- Record arrays
- pandas.dataframe
- CSV
- XLS
- JSON
- HDF5
- sqlite
- MySQL
- Mongo
- ORMs and SQLAlchemy
- Blaze ecosystem: dask, odo, distribute
- SciKit Learn (sklearn)
- statsmodels
- scipy
Core Python data structures
Fun with Pandas DataFrames
Fun with Bokeh for data visualization
Files on disk
Databases
Numerical methods on large datasets
Statistical & machine learning tools