Data Analysis Normal Distribution

Essay by Sher Maine • June 3, 2017 • Coursework • 3,023 Words (13 Pages) • 1,998 Views

Essay Preview: Data Analysis Normal Distribution

prev next

Page 1 of 13

Calculate at least 2 measures of central tendency and at least 2 measures of variation for your Continuous Variable sample data. Explain the results and meaning of these measures in your report. Include a summary of the data in your report using a graph or table. Use Excel for this task.

Central Tendency

Central tendency is a single value that describes the mean , median or mode in a set of data. Each of these measures have different calculation for different purposes. https://statistics.laerd.com/statistical-guides/measures-central-tendency-mean-mode-median.php

For example people want to compare their quiz scores to other students’ scores so the central tendency is important in comparing their score to the class distribution’s scores.

Mean

Also know as the average , this measure is the most widely used in central tendency for discrete and continuous variables. The mean value is obtained by dividing the sum of all the values in the data by the number of values in the data set.

However , the disadvantage in mean is that its values can be affected by outliers whose numerical values are either extremely small or large.

https://statistics.laerd.com/statistical-guides/measures-central-tendency-mean-mode-median.php

Mode

The mode is the most frequently occurring value. It is used to uncover the most common occurring digit or category. For example , in a histogram , the tallest category will be the mode. The mode is useful in categorical data , however , when used in continuous data such as interval and ratio scales , there may not have any data points of the same value due to decimal places so mode may not exist there.

http://www.quickmba.com/stats/centralten/

Median

The median is middle set of value obtained from an arrayed data set from lowest to highest values. The median is often used because it is not affected by skewness and outliers that could distort the mean . For example , when surveying the salary of a group of working adults , if the wages of two people are 5 times the mean , the unusually large salary will affect the mean salary making the values uneven and higher so in such a scenario , the median is a better representation of the group’s salary level. http://www.quickmba.com/stats/centralten/

At least 2 measures of variation for your Continuous Variable sample data. (Standard deviation included in normal distribution paragraph.)

Skewness

Skewness determines how symmetrical is the distribution. A positive skew has higher values with a long tail to the right while a negative skew has lower values with a long tail to the left. A symmetrical distribution has a skewness of 0 which is important because it is used to identify normal distribution.

https://www.graphpad.com/guides/prism/6/statistics/index.htm?stat_skewness_and_kurtosis.htm

Kurtosis

Kurtosis measures the peak and whether the data is light-tailed or heavy-tailed when comparing with a normal distribution. A normal distribution can be identified with a kurtosis value of 0.

Positive and negative kurtosis deviates from the value 0. A positive kurtosis like data from a t distribution has postive values with a heavier tail and sharper peak compared to normal distribution like data. A negative kurtosis has negative values with a flatter peak and lighter tails when compared with normal distribution.

http://support.minitab.com/en-us/minitab/17/topic-library/basic-statistics-and-graphs/summary-statistics/how-skewness-and-kurtosis-affect-your-distribution/

Range

The range is the value obtained by deducting the highest score from the lowest score in a data set to find out how spread out is the data values are. A measure of the spread indicates if the mean is accurate in representing the data. If the spread of data is large, the mean is not useful to represent the data because a large spread means there are large differences between individual values. Thus, range comes hand in hand with mean to determine which measure is best to accurately interpret the statistics calculation.

https://statistics.laerd.com/statistical-guides/measures-of-spread-range-quartiles.php

Continuous Random Variable (CRV) CRV (table 1&2)

CRV can take on infinitely any values in an interval. And the CRV values are normally measured not counted so it tends to have decimals and is used to model characteristics like time, weight and length. http://dept.stat.lsa.umich.edu/~ionides/425/notes/continuous_rvs.pdf

Our CRV will be the length of time in the sauna to get a rough estimate of how long does each person stay in the sauna.

Q2. Discrete Random Variable DRV (TABLE 3&4) - determine the number of usage between M/F

Definition: A discrete variable is a variable that gets its value through counting distinct values (whole numbers) like 0,1,2,3,4 . http://www.stat.yale.edu/Courses/1997-98/101/ranvar.htm

Our discrete random variables will be the number of males and females and we would be using binomial distribution as explained below.

Binomial

A binomial distribution is a discrete probability distribution.

http://www.r-tutor.com/elementary-statistics/probability-distributions/binomial-distribution

This is an experiment with a fixed number of independent trials that has two mutually exclusive outcomes normally either success or failure. The probability of each outcome from trial to trial remains constant because it is independent. https://people.richland.edu/james/lecture/m170/ch06-bin.html Independent means each trial has no effect on any of the rest. http://statistics.about.com/od/ProbHelpandTutorials/a/When-Do-You-Use-A-Binomial-Distribution.htm

Why use Binomial?

The binomial method is a specific probability distribution to estimate the percentage of success or failure in the outcome when counting our discrete variable. It is relevant in our experiment because we are determining the usage between males and females to calculate the success and failure rate of the saunas.

...

Download as: txt (19.8 Kb) pdf (481.5 Kb) docx (1 Mb)

Continue for 12 more pages »

Read Full Essay Save

Only available on OtherPapers.com

Similar Essays

Trend & Data Analysis

Trend and Data Analysis By: Yuleidis Ramos EDD 9200-M14 Trends and Issues Nova Southeastern University June 19, 2011 Introduction What does the future hold for

1,316 Words | 6 Pages
Steps Required to Understand Educational Research - Methodological Approach, the Research Design, the Sample, Data Collection Data Analysis, Ethical Issues, Dissemination and the Audience

Introduction In this essay, my aim is to examine the steps required to understand educational research. These steps consist of the methodological approach, the research

4,872 Words | 20 Pages
Critique of the Data Analysis: Kraut/kiesler

Critique of the Data Analysis: Kraut/Kiesler study. Though it has been many years since the Kraut/Kiesler study was conducted, the spirit of the study still

844 Words | 4 Pages
Financial Data Analysis Pfch

Financial Data Analysis Patton-Fuller Community Hospital (PFCH) was established in 19 to provide health care services to the members and residents of the Kelsey Community(PFCH

754 Words | 4 Pages
Real Data Analysis

A1: What were the results in terms of the mean? The coffee Group mean was 22.5 and the Placebo Group was 36.4. The mean difference

3,385 Words | 14 Pages
Patton Fuller Community Hospital - Financial Data Analysis

Financial Data Analysis Tim Anderson HCS/577 December 10 2013 Sharon Gomes-Sanders Financial Data Analysis This writer will analyze Patton Fuller Community Hospital, a for-profit organizations

736 Words | 3 Pages
Biostatistics - Data Analysis and Presentation

Data analysis and presentation Students were given two (2) sets of data to analyze. Both sets of data were answer sheets of students from high

512 Words | 3 Pages
Knn Model Data Analysis

1. Executive Summary After using a robust method to fill the blanks in the dataset, we got the opportunity to fully utilize the dataset that

1,340 Words | 6 Pages