Introductory Statistics

 

IS: R Assignment 19

[<code class="R">R</code> Assignments]
R Assignment #19

 

General Purpose

General Purpose of these Assignments (the usual): The purpose of these R Assignments is to give you some pointed, direct practice in using R. As such, these are designed to be quick and to the point (less than 10 minutes each). They are also designed to give you a place to return if you forget how to perform some analysis in the future.

Please supply your results in the form below. Clicking on “Click to Check Your Answers” will allow you to see which as correct and which are not. When all are correct (and you can try as many time as you wish), you will be allowed to send your answers to me for credit by clicking on “Click to Email Your Results.” You only receive credit when this is submitted (with all answers correct).

This assignment is due at the start of the class period on

Wednesday, February 14, 2024.

With that being said, if this R Assignment is available, which is could be until approximately 11:59 pm (CST), then you are able to work on it.

As expected, these are graded according to the syllabus (all or nothing). Please review the appropriate section in the syllabus for more information. Also, if this is not submitted before it is due, then it counts as a zero.


Specific Purpose: Here, I check that you can calculate confidence intervals using R. Remember that these assignments are not here to test your understanding (usually). They are only here to check that you can perform the analysis.


Slidedeck Support: The following slidedecks may be helpful for you in completing this R Assignment:


The Problems

Note that we are assuming this particular triathlon is representative of all triathlons. From what I know of triathlons, this does not seem to be an unreasonable assumption. This particular race attracts those who are dedicated to the sport, along with a few who are entering only because it is in Corvallis, OR, the county seat of Benton County.

First, run the following code. Then, answer the questions that follow. These two lines of code load a particular data set and attaches it. In other words, they allow you easy access to a common data set.

source("http://rfs.kvasaheim.com/stat200.R") dt = read.csv("http://rfs.kvasaheim.com/data/HeartOfTheValleyTriathalon.csv") attach(dt)

Note that I want you to only use the parametric procedure. Do not use, for example, the bootstrap or the Wilcoxon procedure on these.

  1. What is the lower endpoint of a 95% confidence interval for the average time to complete a triathlon (TOTALTIME)?
  2. What is the lower endpoint of a 95% confidence interval for the variance of the time to complete a triathlon (TOTALTIME)?
  3. What is the lower endpoint of a 95% confidence interval for the proportion of triathlon entrants who identify as female (GENDER)?

Finally, to receive credit for this assignment, please provide your full Knox College email address:

then click on the button here.

The Answers

Since this is past due, I can now give you the code and the answer:

Since it is now after the time this is due, I can now give you the code and the answers:

t.test(TOTALTIME)
onevar.test(TOTALTIME)
binom.test(x=41,n=97)

The answers are

4930.214
456601.6
0.3229983
This page was last modified on 2 January 2024.
All rights reserved by Ole J. Forsberg, PhD, ©2008–2024. No reproduction of any of this material is allowed without explicit written permission of the copyright holder.