Data Science with Python

COM6018

Jon Barker

EO-OW-JT

Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

About me

  • Jon Barker (email address is j.p.barker@sheffield.ac.uk)
  • Professor in the Speech and Hearing Research Group
  • Research interests
    • Speech technology: speech recognition in real-world environments
    • Hearing science: hearing aids processing, speech perception, speech enhancement and speech intelligibility prediction
    • Music processing for hearing impairment
  • You can find out more on my webpage
Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Data Science with Python

  • Why is this module called Data Science with Python
  • Data Science is a broad subject encompassing areas including statistics, machine learning, data visualisation, data mining, and data engineering (i.e., your entire degree!)
  • The module focuses on the with Python part of the title. We will be showing you how to use Python to do data science.
Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

COM6018 Course aims

The module has the following aims:

  • to provide an introduction to the Python programming language
  • to introduce a range of data analysis techniques using the latest Python tools and an understanding of how to apply the appropriate techniques for data analysis
  • to understand how to present data science clearly, rigorously and reproducibly

Note, the module focuses on practical data analysis and will be less theoretical than the 'Machine Learning and Adaptive Intelligence' module.

Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Schedule

  • Lectures on Monday 15:00-17:00
    • Diamond Lecture Theatre 5
  • Labs on Tuesday 15:00-17:00
    • Diamond Computer Room 3
    • Bring Your Own Device

Teaching Weeks 1-5 and 7-11; Weeks 6 and 12 are 'Readings Weeks'

Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Assessment

Two pieces of coursework:

  • Assessment 1 (40%)

    • Similar in style to the labs
    • Released start of Week 5
    • Due end of Week 6
  • Assessment 2 (60%)

    • A larger piece of work with a written report
    • Released start of Week 9
    • Due Wednesday of Week 12
Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Feedback

  • Feedback during lab classes
  • Solutions will be provided for all lab classes
  • Feedback will be provided for both assignments
Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Resources

Course content will include

  • slides (like these) in PDF format for each lecture,
  • tutorials,
  • lab classes,
  • lab class solutions in jupyter notebook format.
Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Recommended reading

Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Content Delivery

Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Course structure

Part I

  • Week 1: Getting Started with Python
  • Week 2: Reading and Writing Data
  • Week 3: Numerical Computing with NumPy
  • Week 4: Processing Structured Data with Pandas
  • Week 5: Exploratory Data Analysis and Visualization

READING WEEK - Assignment 1

Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.

Course structure (continued)

Part II

  • Week 7: Introducing Scikit-Learn
  • Week 8: Classification with Scikit-Learn
  • Week 9: Classifier evaluation and statistical testing
  • Week 10: Regression with Scikit-Learn
  • Week 11: Unsupervised learning with Scikit-Learn

READING WEEK - Assignment 2

Copyright © 2023–2025 Jon Barker, University of Sheffield. All rights reserved.