Skip to content

Latest commit

 

History

History
148 lines (110 loc) · 5.8 KB

README.md

File metadata and controls

148 lines (110 loc) · 5.8 KB

Python Teaching Material

This repository contains per-session and per-session solution files used as learning material for the BSc Psychology "Big Data Analytics in Python" mini module.

All material is taught using the Jupyter Notebook web application and assumes access via the anaconda python distribution.

Files can be downloaded for local use using the clone or download toggle at the top of the repository.

All content has been produced and taught by Jodie Lord. With thanks to Becki Green and Sagar Jilka, King's College London.


Session 1

Session 1 folder contains:

  • Jupyter Notebook (Session1.ipynb) with learning material.
  • Image files which feed into the session1 .ipynb document.

Session 1 solutions folder contains:

  • Jupyter Notebook (session1_solutions.ipynb) with session 1 solutions included.
  • .hmtl file with content identical to .ipynb, but in read only format.

Session 1 content:

  • Accessing and understanding Python
  • Jupyter basics
  • Variable assignment and types
  • Basic arithmetic
  • Lists and tuples
  • Dictionaries

Session 1 learning objectives:

  1. Become familiar with the Jupyter environment
  2. Understand and use basic python syntax
  3. Grasp basic data structures


Session 2

Session 2 folder contains:

  • Jupyter Notebook (Session2.ipynb) with learning material.
  • Image files which feed into the session2 .ipynb document.
  • 2 .csv files for pandas related tasks within the session2 learning material.

Session 2 solutions folder contains:

  • Jupyter Notebook (Session2_Solutions.ipynb) with session 2 solutions included.
  • .hmtl file with content identical to .ipynb, but in read only format.

Session 2 content:

  • For loops
  • Python libraries
  • The pandas library, including:
    • Reading in files
    • Basic data descriptives
    • Filtering
    • Merging

Session 2 learning objectives:

  1. Grasp the concept of loops and how these can be used to iterate through data
  2. Understand what python libraries are and explore a key data science library: pandas
  3. Become familiar with some key techniques for working with data, e.g.:
  • Merging datasets
  • Working with missing data
  • Filtering and indexing
  • Aggregating data


Session 3

Session 3 folder contains:

  • Jupyter Notebook (Session3.ipynb) with learning material.
  • Image files which feed into the session3 .ipynb document.
  • 2 .csv files for pandas and seaborn related tasks.

Session 3 solutions folder contains:

  • Jupyter Notebook (Session3_Solutions.ipynb) with session 3 solutions included.
  • .hmtl file with content identical to .ipynb, but in read only format.

Session 3 content:

  • Python charting libraries
  • Tests of normality, including:
    • Skewness
    • Kurtosis
    • Sharipo-wilks
  • Basic statistics, including:
    • T-tests
    • ANOVA
    • Pearsons correlation

Session 3 learning objectives:

  1. Explore the world of charting in python via two key charting libraries:
  • MatPlotLib
  • seaborn
  1. Learn how to carry out some basic statistics to allow for data exploration.
  2. Combine learning from charting libraries and statistics to produce meaningful visualisations for statistics produced.


Session 4

Session 4 folder contains:

  • Jupyter Notebook (Session4.ipynb) with learning material.
  • Image folder which contains images which feed into the session4 .ipynb document.
  • 1 .csv file for charting and analytics revision tasks
  • 1 .xls file for machine learning tasks.

Session 4 solutions folder contains:

  • Jupyter Notebook (Session4_Solutions.ipynb) with session 3 solutions included.
  • .hmtl file with content identical to .ipynb, but in read only format.
  • Image folder which contains images which feed into the session4 .ipynb document.

Session 4 content:

  • Introduction to machine learning with application in:
    • Random Forest
    • K nearest neighbours
  • Recap of exercises and topics covered over the 4 weeks

Session 4 learning objectives:

  1. Grasp the basic concept of machine learning.
  2. Learn some (basic) machine learning techniques.
  3. Consolidate what we've learnt over the last 4 weeks.


EXTRA

Extra help documentation:

  • EXTRA_RESOURCES.pdf - contains:

    • Clarification with examples for FAQs / common confusion points expressed in f2f workshops
    • Extra resources to aid additional learning.
  • Revision_Exercises_2020 - contains:

    • A .ipynb file with revision exercises for topics covered across the 4 session materials.
    • Two data files and one image file which feed into tasks set within the revision exercises material.

[email protected]

jodielord.com