DATA 202 (2024) - Assignments

Assignment submission in DATA202 and DATA472

  • All assignments must be written in Rmarkdown.
  • All assignments must be submitted electronically using the DATA202 online submission system or the DATA472 online submission system.
  • For each assignment you should prepare an Rmarkdown file named 'assignmentX.Rmd' where X is the assignment number. This is the only file you will need to upload.
  • The YAML header of the Rmarkdown file should have the following structure:

=----
title: "Assignment X"
author: "Your Name, 30XXXXXX"
date: "16 July 2024"
output: html_document
----=

  • Make sure you put your name and Student ID number in the author: field.
  • Submission procedure:
    • Navigate to the correct assignment in the DATA202 online submission system or the DATA472 online submission system, or click the appropriate submission link in the table below. Make sure you submit to the right page: to the correct assignment number, and to the correct course (DATA202 or DATA472) or your assignment won't be visible to the markers. (Note: you can submit to either course without getting an error message.)
    • Click "Add Files" and select your assignment submission file "assignmentX.Rmd" (X=assignment number).
    • Only when the correct file is uploaded will you be able to click "Run Checks" - this is when your code is actually executed.
    • Make sure you do click "Run Checks": if you don't there may be errors in your code you're not aware of.
    • You may need to wait a few moments before the execution is complete.
    • You will see two files appear: "[courseid]-assignmentX-submission-report.txt" (which may contain any error messages) and "[username]-assignmentX.html" (or for some assignments a .pdf will be created) which contains the Rmarkdown output.
    • Check both of these to see if there are any errors. If there are, you can resubmit as many times as you like until the submission deadline.
    • You cannot upload additional data files for your Rmd code to read: in assignments that need datafiles these files will be already available within the submission system. (Make sure that you don’t refer to data file locations that are specific to your own machine.)
    • Assignment 0 is available to you at all times to test your code. You can submit to this assignment as many times as you want (or not at all) and you'll be able to test your code for errors. Here is an example assignment0.Rmd file for you to try out first. When you submit the file titanic.csv will be available in the submission system for you to work with.
    • Your assignment will be marked online, and you'll get your mark and any feedback through the submission system: at top right of the assignment submission page you'll see a "Marks" link to see your assignment marks.

Tests

The three tests will take place in person in the regular lecture time. Details of what you need to learn will be announced in due course in class and on this page. Each test should take approximately 30 minutes to complete.

Final test (DATA 202 only)

The final test for DATA 202 will be in person for 120 minutes, and will be held during the Assessment period. Further details, including the date and time of the test, will be announced in class after the mid-trimester break.

Project (DATA 472 only)

DATA 472 students do not sit the final test, but instead will submit a software project. See the Postgraduates page for more information.

Student Honour Code

When submitting my work I confirm:

  • I have completed all steps of the attached assessment on my own,
  • I have not used any unauthorised materials while completing this assessment, and
  • I have not given anyone else access to my assessment.

Note that all students are bound by the Student Conduct Statute - which means students must not work together on assessments, nor copy from other sources. Disciplinary penalties will be applied if students are found to be cheating.

The use of AI tools such as ChatGPT is strictly forbidden in all assessments.

View your marks here

Assignment Value Due date Additional Materials DATA202 DATA472 Set by
Assignment 0 -- No due date assignment0.Rmd; titanic.csv DATA202-SubmitButton.png DATA472-SubmitButton.png LM
Assignment 1 Questions UPDATED 10% 6pm, Fri 22 Mar Assignment 1 Solutions Common mistakes in Assignment 1 (star_dataset_deepu_kaggle.csv ) DATA202-SubmitButton.png DATA472-SubmitButton.png LM
Test 1 15% 2pm Thu 28 Mar Instructions and what you need to learn; Base R Cheat Sheet; Practice Test (Questions only); Common Mistakes     LM
Postgrad Proposal 0% 6pm, Thu 18 Apr DATA472 only; More information here   DATA472-SubmitButton.png LM
Assignment 2 Questions 10% 6pm, Fri 10 May movies500.csv; genres.csv; movies500_genres.csv; motor_vehicle_modified.csv; average_weekly_earnings.csv DATA202-SubmitButton.png DATA472-SubmitButton.png BN
Test 2 10% 11am, Fri 17 May Instructions and what you need to learn; Cheat sheets for Test 2; Practice Test: see the exercises in Binh's last lecture (Week 9)     BN
Assignment 3 Questions 10% 6pm, Fri 24 May Assignment 3 Template.Rmd DATA202-SubmitButton.png DATA472-SubmitButton.png RAd
Test 3 5% 11am, Fri 31 May     RAd
DATA 202 Final Test 40% TBC DATA 202 only, 120 min test      
Postgrad report 40% 6pm Tue 11 Jun DATA 472 only   DATA472-SubmitButton.png  
Postgrad presentation -- 9am-12pm, Thu 13 Jun DATA 472 only, Room CO431      
Postgrad code -- 6pm Thu 13 Jun DATA 472 only   DATA472-SubmitButton.png