Unit rationale, description and aim

Data science is an inter-disciplinary area that employs scientific methods, algorithms, tools and systems for extracting insights, knowledge and value from data. Machine learning, as a core part of data science and data analytics, and a subfield of artificial intelligence, is the scientific study of algorithms and mathematical models that computer systems use to make decisions or predictions. Machine learning algorithms and models are widely used in human’s digital life such as email client, search engine, social media, virtual personal assistant and recommendation system, although machine bias is an important ethical concern of which many people are unaware. Python is one of the most popular programming languages with comprehensive libraries and tools for putting data science and machine learning into practice in an efficient manner.

This unit will cover fundamental concepts and theories of data science and machine learning with focus on their practical use and implementations. The issue of machine bias in machine learning and how it may have an adverse impact on the common good will be examined. The aim of the unit is to learn both theoretical and practical data science and machine learning techniques to build real-world data science and machine learning solutions.

2026 10

Campus offering

Find out more about study modes.

Unit offerings may be subject to minimum enrolment numbers.

Please select your preferred campus.

  • Term Mode
  • Semester 1Campus Attendance

Prerequisites

ITEC102 Python Fundamentals For Data Science

Learning outcomes

To successfully complete this unit you will be able to demonstrate you have achieved the learning outcomes (LO) detailed in the below table.

Each outcome is informed by a number of graduate capabilities (GC) to ensure your work in this, and every unit, is part of a larger goal of graduating from ACU with the attributes of insight, empathy, imagination and impact.

Explore the graduate capabilities.

Demonstrate comprehensive knowledge with data scie...

Learning Outcome 01

Demonstrate comprehensive knowledge with data science libraries and tools for data processing and analysis
Relevant Graduate Capabilities: GC1, GC10

Demonstrate data science and machine learning prep...

Learning Outcome 02

Demonstrate data science and machine learning preparation skills, via key techniques learnt and the use of relevant tools
Relevant Graduate Capabilities: GC2, GC10

Implement a data science and machine learning appl...

Learning Outcome 03

Implement a data science and machine learning application with an appropriate choice of data science and machine learning techniques
Relevant Graduate Capabilities: GC2, GC8

Explain the issue of machine bias and how it may a...

Learning Outcome 04

Explain the issue of machine bias and how it may affect the common good
Relevant Graduate Capabilities: GC1, GC6

Content

Topics will include:

  • Overview of data science and its implementation life cycle and tools
  • Recap of data processing concepts and techniques
  • Exploratory data analysis in data science
  • Machine learning (ML) introduction
  • ML projects and basic linear algebra
  • Basic matrix analysis, dimensional reduction, and SVD, PCA
  • Basic classification and evaluation metrics
  • Regression (linear, polynomial), overfitting and regularization
  • Clustering: k-means and mixture of Gaussians
  • Better evaluation with k-fold cross validation and finetune model with grid search
  • Neural networks and deep learning
  • Machine bias in the real world and its impact on the common good

Assessment strategy and rationale

A range of assessment procedures will be used to meet the unit learning outcomes and develop graduate attributes consistent with University assessment requirements. The first assessment item consists of simple data and machine learning practical tasks. The purpose is to assess students’ practical data science and machine learning skills of Python data science and machine learning libraries and tools. The second assessment is a more specific image data exploration and machine learning preparation task that requires fundamental knowledge of data science and machine learning. The purpose is to assess students’ online engagement and understanding and practical skills in data preparation for machine learning algorithms and models. The final assessment is to conduct experiments with one machine learning algorithm e.g. classification. The purpose is to assess students’ machine learning practical skills and techniques with consideration of machine bias, building on the machine learning preparation task.

The assessments for this unit are designed to demonstrate the achievement of each learning outcome. To pass this unit, students are required to obtain an overall mark of at least 50%.

Overview of assessments

Assessment Task 1: Practical programming The firs...

Assessment Task 1: Practical programming

The first assessment consists of practicing simple Python data science and machine learning libraries. The assessment requires students to demonstrate their understanding and use of Python data science and machine learning libraries and tools.

Submission Type: Individual

Assessment Method: coding tasks

Artefact: Code

Weighting

30%

Learning Outcomes LO1, LO2
Graduate Capabilities GC1, GC2, GC10

Assessment Task 2: Image data exploration The se...

Assessment Task 2: Image data exploration

The second assessment consists of tasks to do online forum participation and image data exploration which requires fundamental knowledge of data science and machine learning. The purpose is to assess students’ online engagement and understanding and practical skills in the process of data preparation for machine learning models.

Submission Type: Individual

Assessment Method: Coding tasks + Online Viva

Weighting

30%

Learning Outcomes LO1, LO2, LO3
Graduate Capabilities GC1, GC2, GC8, GC10

Assessment Task 3: Machine learning assignment T...

Assessment Task 3: Machine learning assignment

The final assessment is a machine learning assignment focusing on classification. The assessment builds on the data prepared by the previous assessment and conducts experiments with machine learning models with consideration of machine bias. Students will use GenAI tools to evaluate and document their code. Online viva will be used to validate the antiunity of students’ learning.

Submission Type: Individual

Assessment Method: coding tasks + presentation (5 minutes)+ Online Viva

Weighting

40%

Learning Outcomes LO1, LO2, LO3, LO4
Graduate Capabilities GC1, GC2, GC6, GC8, GC10

Learning and teaching strategy and rationale

This unit is offered in two delivery modes—Attendance and Online—to support diverse learning needs and maximise access for isolated or marginalised groups.

Attendance Mode

Students attend weekly face-to-face classes at designated locations and engage directly with lecturers to support achievement of learning outcomes. The unit requires preparation before workshops (typically around one hour) and at least one hour of consolidation afterwards. Online learning platforms provide additional preparatory and practice activities to reinforce learning.

Online Mode

The online mode enables students to explore core disciplinary knowledge through both synchronous and asynchronous learning. Weekly discussion activities and active learning tasks encourage the application of theoretical concepts in professional contexts. Engagement with peers supports constructive learning, and students receive regular, timely feedback throughout the semester.

Students should anticipate undertaking 150 hours of study for this unit, including class attendance, readings, online engagement and forum participation and assessment preparation.

Representative texts and references

Representative texts and references

Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow (3rd ed.). O’Reilly Media.

Bruce, P., Bruce, A., & Gedeck, P. (2020). Practical statistics for data scientists: 50+ essential concepts using R and Python (2nd ed.). O’Reilly Media.

Jean, H. (2020). Essential math for data science. O’Reilly Media.

Ghassemi, M., Naumann, T., Schulam, P., Beam, A. L., Chen, I. Y., & Ranganath, R. (2020). A review of challenges and opportunities in machine learning for health. AMIA Joint Summits on Translational Science Proceedings, 2020, 191–200.

Verdonk, C., Verdonk, F., & Dreyfus, G. (2020). How machine learning could be used in clinical practice during an epidemic. Critical Care, 24(1), 265.

Strang, G. (2023). Introduction to linear algebra (6th ed.). Wellesley-Cambridge Press.

Yu, B., & Barter, R. L. (2024). Veridical data science: The practice of responsible data analysis and decision making. MIT Press.

Neuer, M. (2024). Machine learning for engineers: Introduction to physics-informed, explainable learning methods for AI in engineering applications. Springer.

Murphy, K. P. (2022). Probabilistic machine learning: An introduction. MIT Press.

Idrissi, A. (Ed.). (2024). Modern artificial intelligence and data science: Tools, techniques and systems. Springer.

Locations
Credit points
Year

Have a question?

We're available 9am–5pm AEDT,
Monday to Friday

If you’ve got a question, our AskACU team has you covered. You can search FAQs, text us, email, live chat, call – whatever works for you.

Live chat with us now

Chat to our team for real-time
answers to your questions.

Launch live chat

Visit our FAQs page

Find answers to some commonly
asked questions.

See our FAQs