Problem 1:This dataset consists of a number of characteristics for ...

  1. Home
  2. Homework Library
  3. Exam Prep
  4. SAT - Math
  5. Problem 1:This dataset consists of a number of characteristics for ...


Problem 1:This dataset consists of a number of characteristics for a selection of animals, and the type of each variable (Boolean indicated a ‘true/false’ value, where 1 = true and 0 = false): Boolean
2.feathers: Boolean
3.eggs: Boolean
4.milk: Boolean
5.airborne: Boolean
6.aquatic: Boolean
7.predator: Boolean
8.toothed: Boolean
9.backbone: Boolean
10.breathes: Boolean
11.venomous: Boolean
12.fins: Boolean
13.legs: Numeric (set of values: {0,2,4,5,6,8})
14.tail: Boolean 15.domestic: Boolean
a.Perform cluster analysis on this dataset and produce a tree diagram, using the animal name as the identifying variable.
b.Which is the most unique animal in this dataset?Is this an appropriate conclusion? Explain.
c.Which animal(s) is/are most related to a penguin? Is this an appropriate conclusion? Explain.

Problem 2:This dataset is from a hypothetical course, and you utilized this dataset previously as a regression analysis.Two columns have been added: student # and course average.
a.Perform cluster analysis and produce a tree diagram, not including the course average, and using the student # as the identifying variable. Which students’ performance was the most dissimilar from the rest of the class?
b.How many clusters is optimal? Explain your decision based on all 3 of the criteria discussed in class.Are they in agreement?
c.Perform cluster analysis, not including course average, and using the course average as the identifying variable. If you had to assign grades of A, B, and C to the students in this class (not including a + or -), what range for the course average (based on the cluster analysis) would you assign for A, B, and C. The ranges do not need to be continuous, but, should just be the highest/lowest grades in that range based on the course average.
You will now utilize only the data for Exam 1 and Exam 2 for the rest of this problem. d.Utilizing SAS, you will produce a scatterplot of the clusters. Utilize the code provided in the slides and modify it to produce 4 clusters. Also run the cluster analysis and produce a tree diagram. How do the two compare; are they consistent?

Solution PreviewSolution Preview

These solutions may offer step-by-step problem-solving explanations or good writing examples that include modern styles of formatting and construction of bibliographies out of text citations and references. Students may use these solutions for personal skill-building and practice. Unethical use is strictly forbidden.

    By purchasing this solution you'll be able to access the following files:
    Solution.xlsx, Solution.docx and

    for this solution

    PayPal, G Pay, ApplePay, Amazon Pay, and all major credit cards accepted.

    Find A Tutor

    View available SAT - Math Tutors

    Get College Homework Help.

    Are you sure you don't want to upload any files?

    Fast tutor response requires as much info as possible.

    Upload a file
    Continue without uploading

    We couldn't find that subject.
    Please select the best match from the list below.

    We'll send you an email right away. If it's not in your inbox, check your spam folder.

    • 1
    • 2
    • 3
    Live Chats