Introductory Statistics
The Syllabus for the Spring 2022 Class
The purpose of this course is to introduce you to analyzing data and to performing statistical analyses in your own research. Briefly, this five-step process is framing the research, collecting the data, analyzing the data, interpreting the results, and presenting the results. To support these steps, we will also cover some probability distributions. The probability topics allow one to better understand the inherent randomness in life (and in our research).
This scope corresponds to most of Chapters 1–12 in the R for Starters (Forsberg) text and Chapters 1–12 in Warren, Denley, and Atchley (Hawkes) text.
This is a typical introductory statistics course for undergraduate students. It is a study of the acquisition, analysis, interpretation and presentation of data.
Topics include: descriptive statistics and statistical graphics, experiments vs. observational studies, elementary probability, random variables and distributions, sampling distributions of statistics, confidence intervals, hypothesis testing for means and proportions, correlation, linear regression, and an introduction to ANOVA.
The following are the materials, both required and not, used in this course.
This is currently the primary textbook for the course. It is the least-expensive option that includes an online homework component. However, this course makes the online homework optional in most cases (see below).
Carolyn Warren, Kimberly Denley, and Emily Atchley. Beginning Statistics. Third ed. Mount Pleasant, SC: Hawkes Learning System, 2020.
The Hawkes Learning System (HLS) handles all of the online chapter assignments. The sign-up URL is linked here. Make sure you access this soon; it will make your first week much easier. Also, note that there is an option for a couple of weeks of free usage. That will be useful for you, especially as you read through this syllabus.
If you are not using the primary textbook, then you should be using this. This text is designed for students who are more interested in how statistics is used than in what the formulas are.
Ole J. Forsberg. R for Starters, v55721. Galesburg, IL. 2021.
This text gives better insight into randomness — and its effects — than does the primary text. This also explores resampling and simulation as methods for understanding sample statistics and the population parameter of interest.
As this is an applied statistics course, you will also need to use a computer to perform important statistical calculations. For many reasons, including flexibility, cost, ubiquity, and pedagogy, we will use the R Statistical Environment.
On the first Friday of the term, we will do Statistical Computing Activity #0 during class. In addition to installing R
on your computer, this acitivity emphasizes the importance of structure (order) on your analyses and projects. In my experience, the #1 cause of problems in research is not being orderly. Please learn how to be orderly; it will make your life easier.
By the end of this course, you should
If your question is “What will it take to succeed in this course?” then the answer is “Being a good student.” In my experience, your ability as a student is the greatest predictor of success in courses such as this. Being a good student means that you
You are responsible for all material covered during the class period and all material in the readings and activities. Feel pressured to ask questions during the class regarding the material, since the material covered during the classes may or may not cover everything that is in the text. Asking questions signals to me that you are engaged. Those who do not ask questions tend to be lost.
As with most courses at Knox, you should be prepared to spend 15 hours per week on the coursework for this class. That includes time in class and the time outside class. Since you spend 4 hours and 40 minutes in class, you should be willing to spend 10 hours and 20 minutes outside class in preparing for class, working on homework, reviewing notes, and anything else associated with the course.
Knox College is committed to the maintenance of the highest standards of academic integrity, including ethical conduct, of its citizens. Personally, I strive to maintain this level of ethical behavior and integrity in this course because it encourages you to understand how to advance humanity at the local level.
Should I discover that you participated in a behavior that violates academic integrity (e.g., unauthorized collaboration, plagiarism, cheating on examinations, fabricating information (lying), helping another person cheat, unauthorized advance access to examinations, altering or destroying the work of others, or fraudulently altering academic records), I will sanction you because your actions undercut the value of Knox College and what it offers to its students (you), its alumni (future you), and the world (all of you). Please read through the Knox College Honor System to familiarize yourself with what constitutes a violation. When in doubt, please check with me. I am here to help.
Ultimately, this all has little to do with cheating. It has everything to do with understanding what it actually means to learn and to represent yourself academically. In college, your personal value has not increased because of your grades; it has increased because of your learning. To cheat is to claim two things about yourself and your skills: I am not able to do this; and I am only focused on today and not the future.
Again, please, if you are looking to take a shortcut, then chat with me. This desire often arises from what I call “life crash,” where things are just happening faster than you planned. Again, let’s chat to see how to fix this before you cheat. I’ve taught for 30 years. I have some experience.
Specifically, to be clear, for this class:
R
assignments, you should work together to understand the material, but you must write up your submissions alone.Knox College abides by Section 504 of the Rehabilitation Act of 1973 which stipulates that no student shall be denied the benefits of an education “solely by reason of a handicap.” Disabilities covered by law include, but are not limited to, learning disabilities, psychological disabilities, health impairments, hearing, and sight or mobility impairments.
If you have a disability that may have some impact on your work in class and for which you may require accommodations, please contact the awesome Stephanie Grimes at the Office of Disability Support Services (ODSS information; office: SMC E-115; email: disabilityservices@knox.edu) so that such accommodations may be arranged. Note that the accommodations are not retroactive; I cannot make adjustments until ODSS contacts me to let me know what I can do. Furthermore, accommodations need to be reinstated at the start of each term.
Needing help in college is a positive sign that you are engaging your learning and coursework. I really like to hear questions! By the way: The Center for Teaching and Learning (CTL) wants to help you! The CTL is responsible for the academic support needs of all Knox students. Here is some information provided about the CTL tutors this term:
Writing Tutors. If you want help on a paper, no matter the course, you can find writing tutors in Red Room, Seymour Library, Monday thru Friday, from 10:30 to 4:00pm, and again in Red Room on Tuesday, Wednesday, and Thursday nights, from 7:00 to 9:00pm. You can also schedule appointments to meet with a writing tutor either online or in person (should you both so desire) by visiting the Center for Teaching and Learning website.
Subject Tutors. Got a kink in your coding or want to test out a solution to that vexing homework problem? You are ready for the Red Room study tables! The Red Room study tables are great resources for students wanting help with virtually any subject, including STAT 200.
There are two Red Room Study Tables: “Red Room” and “Red Room SMC.” Red Room SMC is intended to support those classes taught in SMC and is located in SMC A-201, while Red Room is for most other courses and is located in the Red Room on the second floor of Seymour Library. Both study areas are free and open on a walk-in basis from 7:00pm to 9:00pm, on Tuesday, Wednesday, and Thursday nights.
You can also get help from a subject tutor by scheduling an appointment to meet one-on-one with a tutor. These tutors can be reached by visiting the Center for Teaching and Learning website. When scheduling appointments, please be sure to sign up for them no later than 10:00pm the night before you want to get help. In this way, our tutors will have time to adjust their work and schedules and can plan to meet with you.
Specifically, for this class:
As with all courses, your grade for this course depends on how well you meet the requirements set forth in this syllabus. The following section provides information about the various grade inputs.
To ensure a breadth in using statistics, I require that you do two of the following three groups: Hawkes, Labs, and Practicums. The following provides explanations of these three groups, giving you some insight into selecting the options that are best for you.
These assignments are located in the Hawkes Learning System part of the course. Reaching the Mastery Level each chapter is worth 10 points towards your final grade, with the two lowest chapter scores dropped (100 points total). These are due at 11:59pm on the due dates provided in the schedule. I transfer the Hawkes grade to Moodle after that. Late Hawkes Assignments are worth 50%. None are accepted after May 26, 2022.
Key to Success: Practice helps one learn the material better. Practice is very important. The textbook offers several examples of each type of analysis. Project Scarlet also offers practice in many of the more difficult calculations. Make good use of everything to help you better analyze data. This means YouTube will help.
Hawkes is just the start. Think of it as a key, allowing you to open the door to the discussions we have in class. It should not be the goal for you. It should be the first step on your journey to understanding statistics and statistical analysis… the first step, not the last.
The Hawkes Learning System online assignments are designed for those who need a further introduction to the statistical concepts covered in class. These problems tend to aim towards the lower levels of Bloom’s Taxonomy (see section below).
Laboratory activities concern important statistical concepts like the Central Limit Theorem, confidence intervals, and the p-value. These SLAs consist of a “lab” (the activity) and a “post-lab” (submitted assignment). You are to learn from this activity. The post-labs are questions that extend the lab, checking that you were able to learn what I wanted you to learn about the material. It is this post-lab that must be printed off and submitted to the professor at the start of class.
The Statistical Laboratory Assignments are designed for those who want to go beyond the rote steps of elementary statistical analysis. These are designed to give a deeper understanding of statistics, statistical analyses, and statistical concepts. If you are looking to continue in statistics, data science, computer science, or mathematics, these are for you. Furthermore, if you are majoring in a science, you should do these laboratory activities to ensure that you have a deeper understanding of the statistics you will be using in your studies.
The Hawkes Learning System is excellent in giving you practice in learning and performing the basic calculations. The R for Starters text takes it to the next level by showing you what to do when the basic assumptions of Hawkes are not met in genuine analyses. These form the foundation of your statistical knowledge. Be aware that neither Hawkes nor RfS are the goal; they are merely the first step on your path to mastery. The next step consists of the applying what you have learned to practical problems. These “practicum” activities serve as checks on your ability to actually do statistics in a non-classroom setting.
To help make statistics even more applicable to you in your future research endeavors, I have created two “themes,” each dedicated to one of the disciplines that take STAT200. You should select the most applicable of the options for you. The data are provided, and the goals of each practicum assignment are the same across the disciplines.
The Practicum Assignments are designed for those who want to see how to use statistics. These assignments will give practice in doing the calculations on realistic data as well as give you practice in presenting results in a meaningful and compelling manner.
R
Assignments ( TRA )
The calendar frequently provides assignments related to R
. These are aimed at the lowest levels of Bloom’s Taxonomy so that you can obtain quick practice in using R
. These are three points each and need to be submitted (via the activity itself) by the start of the class period in which they are due.
Because I have programmed these, it may be that the code does not work as I wish. Perhaps the actual “due time” is later in the day. If the R
Assignment is still available, it is still open to you for full credit.
Frequently, I (or the calendar) will give an activity to do at home or during class. These “Statistical Computing Activities” serve one primary purpose: to give you hands-on practice of the topics covered in the course. Their value rests in this alone.
While these are not graded, you need to take responsibility for your education and do all of these. One cannot learn statistics without doing statistics. These give you an opportunity to “do statistics.”
Key to Success: Take these activities seriously. Make sure you do all of them. Practice always helps. Also, think about what the activity is teaching, not what it is having you do. Anyone can follow the steps. A good student will think about why those steps exist.
Towards the end of the term, I assign four in-class activities designed to test your ability to determine the correct statistical procedure, to perform that procedure, and to present your results. These only provide the research questions. You are tasked with determining the correct statistical procedure that will answer the research question.
These are done in groups. One person from each group will submit the solutions by the end of the day (11:59pm) of the class period in which we do these. One submission per group.
Infrequently, I will give a quiz at the start of the class. These quizzes are designed to encourage you to properly prepare for the class. This means you do the readings and read through the slide decks. Thus, I tend to have quizzes more frequently in the beginning of the term. This is the time you are forming habits for the term.
I would like to have these quizzes online through Moodle. However, that requires everyone to bring laptop computer to class daily. On the first day, I will poll the class to see if that is a reasonable expectation. If not, I will have quizzes on paper.
These quizzes are worth three points each.
There is a 50-point examination following the first chapter, a 100-point examination following the introduction to hypothesis testing, and a 100-point cumulative final examination during the finals period. The second examination is “soft-cumulative,” meaning it tests all previous material but focuses on the material since the previous examination.
Key to Success: You are prepared for an examination when you can do any problem without having to think through the process. An analogy is that you are ready to go to kindergarten when you can tie your shoes without thinking about it. Reflex indicates mastery. Attack the problem!
To that end, I encourage all of you to meet up and hold weekly study sessions. While past courses have frequetly formed these based on the major field of study, some of the more successful ones have been formed by those with an opening in their Wednesday-night schedule.
The last day to turn in any assignment is the end of classes, May 26, 2022. Except for the Hawkes Learning System assignments, I accept no late assignments without an excuse that I find acceptable. Such excuses include varsity sports, deaths, dismemberments, and zombie apocalypses (apocalypsi?).
Key to Success: To be early is to be on time. Plan ahead. As emergencies can (and do) come up, you should aim to have all major assignments completed at least 48 hours before they are due. Such a safety margin allows you to focus on the emergency when it happens, thus reducing the stress in your life.
Note that college is an easy on-ramp to profressional adulthood. Mistakes can be made here without you necessarily being fired (given an F). Missing two days of classes will hurt your grade, but you will still have a job. In your profession, deadlines are sacrosanct. Start treating them as such; it will make the transition much easier.
Like it or not, you are an adult. I expect you to attend every single class period. If I’m there, you should be there. If you are not there, I expect you to accept the consequences for that decision. To make things more complicated, if you are sick (cold, flu, etc.), then you are responsible for those who get sick because of you. If you are sick, I hope you stay home, get well,… and not make others sick. I am serious about this.
Key to Success: Do not wait to study for the examinations. You should be studying for them throughout the term. Every class night, review your notes from that day and check that they make sense. Every weekend, review your notes from that week and check that they all make sense. At the start of every class, look over your notes to see what I will be talking about in class. (Get off your phone!) In all cases, look for patterns and connections. If you do this, then there will be no need for cramming, because you will have actually learned the material.
So, those are all of the components to this course. Not all of it needs to count towards your grade, however. Here is a list of the assignments for the course. Please download it, print it off, add the information to your calendar, etc.
I calculate your percent in the course by adding all of the points you earned during the semester and dividing by the total number of points that you could have earned. This percentage is then used to determine your final letter grade for the course. I do not round.
A- | 90 – 93% | A | 93 and above | ||||
B- | 80 – 83% | B | 83 – 87% | B+ | 87 – 90% | ||
C- | 70 – 73% | C | 73 – 77% | C+ | 77 – 80% | ||
D- | 60 – 63% | D | 63 – 67% | D+ | 67 – 70% | ||
F | below 60% |
Should you ask me what your grade is, or what you need on the final to get a _________ (whichever the grade), I will suggest you calculate it for yourself. You have the mathematical skills to do so; they are a prerequisite for the course.
By the way: If you have the highest score in the course, and if your score is greater than 97%, then I will award you an A+.
Remember college is not high school. Strive for a deeper understanding (this is what you are paying for — an understanding of your interests). Bloom’s Taxonomy is helpful here. You may not appreciate it until you do poorly on an examination, but you should be very familiar with it.
Look through it to check the level you were expected to achieve in high school. In college, you need to hit all levels. The higher you get, the better you have mastered the material. For many of you, this is your only chance to master the elementary levels of statistics. Take advantage of me for your edification.
Humankind cannot gain anything without first giving something in return. To obtain, something of equal value must be lost. That is […] the first law of Equivalent Exchange.
Here is a list of the verbs associated with each level. Use them to ensure that you are operating at the college level.
Finally, there is an old adage that one never knows more than when teaching. That means you will never understand the material better than when you can teach it to others. You will want to achieve that level. It takes a lot of time, but the activities and labs push you to achieve the higher levels of mastery. Take advantage of them; you are paying for them.
Life tests you first, then gives you the lesson.
Here are three videos that will help you better understand what it means to be a successful adult (college student). They encourage you to think about thinking. “Blowing these off” shows either that you do not care about mastery or that you think I do not know what I am doing.
By the way… One of the most important things you can do to ensure your success in a course is to read — and understand — the entire syllabus. To help encourage you to gain proper habits in college, I will give five points of extra credit for those who email me a meme that makes me smile. Since I am easily amused, this should be an easy five points for you. I like cats and dogs and beavers and clever wordplay. This must be done by the end of the first Sunday of class (March 27, 2022).
The actual schedule can be downloaded from the calendar page.
Learning Module One: Obtaining Your Data
An introduction to statistics and statistical analysis: scientific method, variable types, sampling schemes, critiquing published papers, using R
.
Learning Module Two: Knowing Your Data
September 23 – September 30
A start at summarizing your data: graphics, summary statistics, frequency tables, center, spread (uncertainty), skew, using R
to do all of this.
Learning Module Three: Probability
October 1 – October 13
Continue learning about the data: probability distributions, pmf, pdf, cdf, expected value, variance, discrete vs. continuous distribution, Binomial, Poisson, Uniform, Exponential, Normal.
Learning Module Four: Introductory Inference
October 14 – October 29
The Central Limit Theorem, the theory of confidence intervals, calculating confidence intervals, assumptions underlying confidence intervals. The theory of hypothesis testing, types of hypotheses, the meaning of the p-value, matching the test to the hypothesis, test assumptions, testing the assumptions.
Learning Module Five: Advanced Inference
October 30 – November 16
Additional estimation and testing in statistics. This includes hypothesis testing for one-sample t-test, Chi-square test for variance, Binomial test, Wald test for proportions, proportions test, ANOVA, Chi-square goodness-of-fit test, Chi-square test of independence, correlation test, linear regression. It also covers estimation of effects and future values in regression. Assumption tests covered: Shapiro-Wilk test, Fligner-Killeen test.