Unit rationale, description and aim

Data science is an inter-disciplinary area that employs scientific methods, algorithms, tools and systems for extracting insights, knowledge and value from data. Machine learning, as a core part of data science and data analytics, and a subfield of artificial intelligence, is the scientific study of algorithms and mathematical models that computer systems use to make decisions or predictions. Machine learning algorithms and models are widely used in human’s digital life such as email client, search engine, social media, virtual personal assistant and recommendation system, although machine bias is an important ethical concern of which many people are unaware. Python is one of the most popular programming languages with comprehensive libraries and tools for putting data science and machine learning into practice in an efficient manner.

This unit will cover fundamental concepts and theories of data science and machine learning with focus on their practical use and implementations. The issue of machine bias in machine learning and how it may have an adverse impact on the common good will be examined. The aim of the unit is to learn both theoretical and practical data science and machine learning techniques to build real-world data science and machine learning solutions.

2026 10

Campus offering

Find out more about study modes.

Unit offerings may be subject to minimum enrolment numbers.

Please select your preferred campus.

  • Term Mode
  • Semester 1Campus Attendance

Prerequisites

ITEC102 Python Fundamentals For Data Science

Learning outcomes

To successfully complete this unit you will be able to demonstrate you have achieved the learning outcomes (LO) detailed in the below table.

Each outcome is informed by a number of graduate capabilities (GC) to ensure your work in this, and every unit, is part of a larger goal of graduating from ACU with the attributes of insight, empathy, imagination and impact.

Explore the graduate capabilities.

Explain knowledge of data science libraries and to...

Learning Outcome 01

Explain knowledge of data science libraries and tools for data processing and analysis
Relevant Graduate Capabilities: GC1, GC10

Employ data science and machine learning preparat...

Learning Outcome 02

Employ data science and machine learning preparation skills, via key techniques learnt and the use of relevant tools
Relevant Graduate Capabilities: GC2, GC10

Implement a data science and machine learning appl...

Learning Outcome 03

Implement a data science and machine learning application with an appropriate choice of data science and machine learning techniques
Relevant Graduate Capabilities: GC2, GC8

Articulate the issue of machine bias and how it ma...

Learning Outcome 04

Articulate the issue of machine bias and how it may affect the common good
Relevant Graduate Capabilities: GC6, GC12

Content

Topics will include:

  • Overview of data science and its implementation life cycle and tools
  • Recap of data processing concepts and techniques
  • Exploratory data analysis in data science
  • Machine learning (ML) introduction
  • ML projects and basic linear algebra
  • Basic matrix analysis, dimensional reduction, and SVD, PCA
  • Basic classification and evaluation metrics
  • Regression (linear, polynomial), overfitting and regularization
  • Clustering: k-means and mixture of Gaussians
  • Better evaluation with k-fold cross validation and finetune model with grid search
  • Neural networks and deep learning
  • Machine bias in the real world and its impact on the common good

Assessment strategy and rationale

A range of assessment procedures will be used to meet the unit learning outcomes and develop graduate attributes consistent with University assessment requirements. The first assessment item consists of simple data and machine learning practical tasks. The purpose is to assess students’ practical data science and machine learning skills of Python data science and machine learning libraries and tools. The second assessment is a more specific image data exploration and machine learning preparation task that requires fundamental knowledge of data science and machine learning. The purpose is to assess students’ online engagement and understanding and practical skills in data preparation for machine learning algorithms and models. The final assessment is to conduct experiments with one machine learning algorithm e.g. classification. The purpose is to assess students’ machine learning practical skills and techniques with consideration of machine bias, building on the machine learning preparation task.

The assessments for this unit are designed to demonstrate the achievement of each learning outcome. To pass the unit, students must demonstrate achievement of every unit learning outcome, pass hurdle tasks, and obtain a minimum mark of 50% in graded units.

Overview of assessments

Assessment Task 1: Practical programming The firs...

Assessment Task 1: Practical programming

The first assessment consists of practicing simple Python data science and machine learning libraries. The assessment requires students to demonstrate their understanding and use of Python data science and machine learning libraries and tools.

Submission Type: Individual

Assessment Method: coding tasks

Artefact: Code

Weighting

30%

Learning Outcomes LO1, LO2
Graduate Capabilities GC1, GC2, GC10

Assessment Task 2: Image data exploration The se...

Assessment Task 2: Image data exploration

The second assessment consists of tasks to do online forum participation and image data exploration which requires fundamental knowledge of data science and machine learning. The purpose is to assess students’ online engagement and understanding and practical skills in the process of data preparation for machine learning models.

Submission Type: Individual

Assessment Method: Coding tasks + Online Viva

Weighting

30%

Learning Outcomes LO1, LO2, LO3
Graduate Capabilities GC1, GC2, GC8, GC10

Assessment Task 3: Machine learning assignment T...

Assessment Task 3: Machine learning assignment

The final assessment is a machine learning assignment focusing on classification. The assessment builds on the data prepared by the previous assessment and conducts experiments with machine learning models with consideration of machine bias. Students will use GenAI tools to evaluate and document their code. Online viva will be used to validate the antiunity of students’ learning.

Submission Type: Individual

Assessment Method: coding tasks + presentation (5 minutes)+ Online Viva

Weighting

40%

Learning Outcomes LO1, LO2, LO3, LO4
Graduate Capabilities GC1, GC2, GC6, GC8, GC10, GC12

Learning and teaching strategy and rationale

This unit is delivered through Attendance and Online modes using a single, integrated learning and teaching strategy designed to ensure equivalent learning outcomes and a comparable learning experience for all students, while supporting diverse learning needs and maximising access.

Across both modes, learning activities are intentionally aligned to the unit learning outcomes and assessment tasks, and are underpinned by active learning, guided engagement with disciplinary knowledge, opportunities for peer interaction, and regular, timely feedback. While the mode of delivery shapes how students participate, the pedagogical intent, expectations and standards remain consistent.

In Attendance mode, students engage in weekly face-to-face classes at designated locations, supported by preparatory activities prior to workshops and opportunities for consolidation following classes. Online learning platforms are used to complement face-to-face teaching through additional resources and learning activities.

In Online mode, students engage with the same core content and learning outcomes through a combination of synchronous and asynchronous activities, including structured discussions and applied learning tasks that support learning in professional contexts.

Across both delivery modes, students should plan to commit approximately 150 hours to this unit over the semester, including participation in learning activities, independent study, readings and assessment preparation.

Representative texts and references

Representative texts and references

Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow (3rd ed.). O’Reilly Media.

Bruce, P., Bruce, A., & Gedeck, P. (2020). Practical statistics for data scientists: 50+ essential concepts using R and Python (2nd ed.). O’Reilly Media.

Jean, H. (2020). Essential math for data science. O’Reilly Media.

Ghassemi, M., Naumann, T., Schulam, P., Beam, A. L., Chen, I. Y., & Ranganath, R. (2020). A review of challenges and opportunities in machine learning for health. AMIA Joint Summits on Translational Science Proceedings, 2020, 191–200.

Verdonk, C., Verdonk, F., & Dreyfus, G. (2020). How machine learning could be used in clinical practice during an epidemic. Critical Care, 24(1), 265.

Strang, G. (2023). Introduction to linear algebra (6th ed.). Wellesley-Cambridge Press.

Yu, B., & Barter, R. L. (2024). Veridical data science: The practice of responsible data analysis and decision making. MIT Press.

Neuer, M. (2024). Machine learning for engineers: Introduction to physics-informed, explainable learning methods for AI in engineering applications. Springer.

Murphy, K. P. (2022). Probabilistic machine learning: An introduction. MIT Press.

Idrissi, A. (Ed.). (2024). Modern artificial intelligence and data science: Tools, techniques and systems. Springer.

Locations
Credit points
Year

Have a question?

We're available 9am–5pm AEDT,
Monday to Friday

If you’ve got a question, our AskACU team has you covered. You can search FAQs, text us, email, live chat, call – whatever works for you.

Live chat with us now

Chat to our team for real-time
answers to your questions.

Launch live chat

Visit our FAQs page

Find answers to some commonly
asked questions.

See our FAQs