INFS5730 - Social Media Analytics in Practice - T3 2024
SAS Hands-On Assignment - SAS Visual Text Analytics
In this hands-on assignment you are required to conduct a textual analysis using SAS Visual Text Analytics and submit a report on Moodle course site through Turnitin. The due date of this assignment is on Week 6, Friday 5:00pm 18th October 2024 (AEST).
Please note that this assignment is worth 25% of your overall course mark.
Requirements
The purpose of this assignment is to use SAS Visual Text Analytics to analyse a dataset called laptop_reviews available on Moodle as a CSV file. The dataset consists of a sample of 6,400 customer reviews of laptops purchased on Amazon Website from 8 major brands: Acer, Alienware, ASUS, Dell, HP, Lenovo, MSI, and Razer (800 customer reviews for each brand) .
The file laptop_reviews.csv, available on Moodle, includes the following fields:
• ProductId: Unique identifier for the product
• UserId: Unique identifier for the user
• Score: Rating between 1 and 5
• Review: Text of the review
• Brand: Brand of laptop (Acer, Alienware, ASUS, Dell, HP, Lenovo, MSI, or Razer)
This is a sample dataset derived from a larger dataset available at:
https://huggingface.co/datasets/naga-jay/amazon-laptop-reviews-enriched.
You are required to conduct a data analysis of the customer reviews provided in the dataset laptop_reviews.csv using SAS Visual Text Analytics in two parts. Part 1 consists of exploring predefined concepts and automatically generated topics to derive insights from the data. Part 2 consists of defining your own custom concepts and custom categories to answer specific research questions.
Deliverable
In this assignment you are required to submit a report (in Word format) including the following components:
• A standard cover page (available on Moodle).
• Part 1
o Predefined Concepts (worth 20% of the available marks) - up to 600 words An exploration of the dataset using TWO (2) relevant predefined concepts.
For each selected predefined concept, your answer must include the following:
- An explanation of why you think the selected predefined concept can be relevant to your data analysis.
- A discussion of the findings and the insights that you could unveil from these findings. Include relevant screenshots from SAS Visual Text Analytics.
- A discussion of the benefits and limitations of relying only on the selected predefined concepts.
o Auto-generated Topics (worth 20% of the available marks) - up to 600 words
- An exploration of the dataset using TWO (2) relevant topics among those automatically generated by SAS.
For each selected topic, your answer must include the following:
- An explanation of why you think the selected topic can be relevant to your data analysis.
- A discussion of the findings and the actionable insights you could derive from these findings. Include relevant screenshots from SAS Visual Text Analytics.
• Part 2
o Custom Concepts (worth 30% of the available marks) - up to 800 words Write TWO (2) custom concepts, each using a different concept rule type. For each custom concept, your answer must include the following:
- An explanation of the objectives of your analysis
- A justification of the reasons behind your choice of the concept rule type
- The custom concept rule to fulfil the objectives of your analysis
- A detailed explanation of the concept rule syntax
- A discussion of the findings and insights that you could derive from these findings. Include relevant screenshots from SAS Visual Text Analytics.
o Custom Categories (worth 30% of the available marks) - up to 800 words Write TWO (2) custom categories.
For each custom category, your answer must include the following:
- An explanation of the objectives of your analysis
- The custom category rule to fulfil the objective of your analysis
- A detailed explanation of the category rule syntax
- A discussion of the findings of your analysis and insights that you could unveil from these findings. Include relevant screenshots from SAS Visual Text Analytics.
Word Limit
Each section of the report has a word limit, as indicated in Deliverable. The distribution of word count proportionally reflects the complexity and significance of each section, totalling a maximum word length of 2,800 words. There is a (+10%) leeway in word limits for each section.
Please note that screenshots, custom concept and custom category syntax are excluded from the word count.
You should be mindful of the marks awarded to each section, as indicated in Deliverable, when allocating the number of words spend on each section.
Please note that material presented in excess of the word limit for each section will not be considered when grading the assignment.
Formatting
Formatting
Please ensure the following requirements:
• Arial 12-point font
• 1.5 spacing
• Page numbers on each page
• Cover page included
• All required sections included
Submission
Upload your report document (in Word format) on Moodle.
• You can only upload one report document.
• You are advised to keep a copy of your submission.
The originality of the submission will be checked using Turnitin. Please check the originality report generated by Turnitin during the submission process.