Data Science Fundamentals
32 Hours
One Semester or equivalent
Hawthorn
Overview
This unit will give students a solid foundation in contemporary data science best practices using Python. It will cover a hands-on introduction to programming paradigms and fundamental data analysis techniques. Through examples involving real-world data, students will learn data cleaning and validation techniques, data transformation procedures, algorithm design, text analytics, and data visualisation techniques. Students will become familiar with important Python software modules such as Pandas, Matplotlib, and the Natural Language Toolkit (NLTK).
Requisites
Teaching periods
Location
Start and end dates
Last self-enrolment date
Census date
Last withdraw without fail date
Results released date
HE Block 5
Location
Hawthorn
Start and end dates
07-July-2025
17-August-2025
17-August-2025
Last self-enrolment date
07-July-2025
Census date
18-July-2025
Last withdraw without fail date
01-August-2025
Results released date
23-September-2025
Learning outcomes
Students who successfully complete this unit will be able to:
- Apply coherent and advanced knowledge of how to read, clean, and manipulate data sets.
- Critically evaluate existing toolkits, and learn how to construct custom algorithms when necessary.
- Identify research questions and create project outlines.
- Analyse data sets using basic statistics, visualisations, regression, and topic modelling.
Teaching methods
Hawthorn
Type | Hours per week | Number of weeks | Total (number of hours) |
---|---|---|---|
On-campus Class | 6.40 | 5 weeks | 32 |
Unspecified Activities Various | 9.83 | 12 weeks | 118 |
TOTAL | 150 |
Assessment
Type | Task | Weighting | ULO's |
---|---|---|---|
Assignment and Presentation 1 | Individual/Group | 30 - 50% | 1,2,3,4 |
Final Workbook | Individual | 10 - 20% | 1 |
Final Workbook | Individual | 10 - 20% | 1 |
Final Workbook | Individual | 10 - 20% | 1,2,4 |
Final Workbook | Individual | 10 - 20% | 1,3,4 |
Final Workbook | Individual | 10 - 20% | 4 |
Content
- Basic programming theory
- Data science best practices
- Data structures, access and usage
- Data cleaning and validation
- Data Visualisation
- How to validate results
- Working with Text data (Text Analysis)
- Data science tools
Study resources
Reading materials
A list of reading materials and/or required textbooks will be available in the Unit Outline on Canvas.