Free Essay

Non Parametric Tests

In:

Submitted By raja9899
Words 1170
Pages 5
6 - NONPARAMETRIC TESTS FOR COMPARING TWO POPULATIONS

In situations where the normality of the population(s) is suspect or the sample sizes are so small that checking normality is not really feasible, it is sometimes preferable to use nonparametric tests to make inferences about “average” value.

Wilcoxon Rank Sum Test (a.k.a. Mann-Whitney U Test)
This test is an alternative to the two-sample t-test for comparing the “average” value of two populations where the samples from each population are taken independently.

The hypotheses tested can be stated as follows:

[pic] The distribution of population 1 and population 2 are identical. If the populations are symmetric (but not necessarily normal) the null hypothesis can be expressed in terms of the population medians as: [pic]

[pic] The distribution of population 1 and population 2 are different. (two-tailed) [pic]

or

[pic] The distribution of population 1 is shifted to the right of the distribution for population 2, i.e. the population 1 values are generally larger than the population 2 values. (right-tailed) [pic]

or

[pic] The distribution of population 1 is shifted to the left of the distribution for population 2, i.e. the population 1 values are generally smaller than the population 2 values. (left-tailed) [pic]

The tests statistic is based on the sum of the ranks assigned to the observed data from each population when the combined sample is ranked from smallest to largest. We will always assume that the sample size (m) for population 1 is less than or equal to the sample size (n) from population 2.

Example: Anticipated Length of Office Visit and Weight Status of Patients
Researchers wanted to compare the anticipated office visit time for patients whose BMI indicates normal weight vs. those whose BMI indicates the patient is overweight. It is hypothesized that doctors will report a shorter anticipated office visit time for patients who are classified as overweight. Stating this hypothesis in terms of medians the research hypothesis would be that median office visit time for normal weight patients is greater than that for overweight patients.

[pic] vs. [pic]

The data below are the anticipated office visit time (min) for these two groups of patients.

Normal: 20 25 30 35 40 45 50

Overweight: 5 10 15 15 20 30

The sum of the ranked appointment lengths for normal weight patients is: _________.

The sum of the ranked appointment length for overweight patients is: ____________.

The sum of the ranks for the overweight patients is smaller than the rank sum for the normal weight patients but this would be expected even if the null hypothesis were true. Why?

The test statistic,[pic], is the sum of the ranks for population O, the overweight patients. We can use the table on the following page to determine whether to reject the null or not. Intuitively we will reject the null hypothesis if the sum of the ranked appointment lengths for the overweight patients is “small”. The table tells what “small” is for a given significance level ([pic]).

For m = 6 and n = 7 we find the following from the table:

From Wilcoxon Rank Sum Table: 1-tail α ’ .025 α ’ .050 2-tail α ’ .050 α ’ .100 m n W d P W d P
6 7 27 57 7 .0175 29 55 9 .0367

The table says we will reject the null at the [pic]level if:
[pic] for [pic] (
[pic] > 55 for [pic]
[pic]< 27 or [pic] for [pic]

We have evidence to conclude that the anticipated office visit times are generally smaller than the anticipated office times for patients with BMI’s considered normal (p < .05).

WILCOXON RANK SUM TEST IN JMP
[pic]

Wilcoxon Signed Rank Test
This test is an alternative to the paired t-test which is used when we do not wish to assume that the population of paired differences is normally distributed. As with the Mann-Whitney U test, the Wilcoxon Signed-Rank Test use ranks based on the paired differences rather than the actual values.

Example: Effect of Togetherness on the Heart Rate of Rats

|Rat |Alone Rate |Together Rate |di = Ti – Ai |Sign || di | |Rank |di| |Signed Rank |
|1 |463 |523 |60 | | | | |
|2 |462 |494 |32 | | | | |
|3 |462 |461 |-1 | | | | |
|4 |456 |535 |79 | | | | |
|5 |450 |476 |26 | | | | |
|6 |426 |454 |28 | | | | |
|7 |418 |448 |30 | | | | |
|8 |415 |408 |-7 | | | | |
|9 |409 |470 |61 | | | | |
|10 |402 |437 |35 | | | | |

We then calculate [pic] = the sum of the positive signed ranks = _______________

and [pic] = the sum of the negative signed ranks = _______________

Are hypotheses can be stated in terms of the median of the paired differences. Listed below are the hypotheses along with the test statistic based on the signed rank sums used to test it.

Intuitively we will conclude there has been a heart rate increase if….

Details of the Wilcoxon Signed Rank Test

Let [pic]

[pic] vs. [pic] (two-tailed) Test statistic [pic] In practical terms the null says there is no change in the rats heart rate after the change in environment, the alternative says there is a shift up or down in their heart rate.

[pic] vs. [pic] (right-tailed) Test statistic [pic]
In practical terms the alternative says that there is an increase or shift up in their heart rate as the difference is defined to be Together – Alone.

[pic] vs. [pic] (left-tailed) Test statistic [pic]
In practical terms the alternative says that there is a decrease or shift down in their heart rate as the difference is defined to be Together – Alone.

For this example, if had originally hypothesized that the heart rate of a rat will increase when it is placed in a social environment then we have the right-tailed alternative and our test statistic W = _______.

The Wilcoxon Signed Rank Test Table (handed out) give p-values associated with an observed test statistic value w for a given sample size, i.e. number of pairs, n.
[pic]

Here our p-value = ____________, thus we reject the null hypothesis and conclude that the heart rate of a rat will generally increase when it is taken from a solitary confinement and placed in a social environment with other rats.

WILCOXON SIGNED RANK TEST IN JMP

We first use JMP to form the paired differences as we did for the paired t-test.
[pic]

Select Distribution > Test Mean > Enter 0 for the hypothesized value and check the nonparametric test box. [pic]

The results of the test are shown below.
[pic]
Conclusion:
TABLE FOR WILCOXON RANK SUM TEST (Page 1)
[pic]

TABLE FOR WILCOXON RANK SUM TEST (Page 2)
[pic]

TABLE FOR WILCOXON SIGNED RANK TEST (Page 1)
[pic]

TABLE FOR WILCOXON SIGNED RANK TEST (Page 2)
[pic]

-----------------------
Data Table
[pic]

Select Nonparametric > Wilcoxon Test
[pic]

The p-values for the upper-tailed t-Test and the Wilcoxon signed-rank test have been highlighted. The test statistic reported by JMP for Wilcoxon test =[pic]
Why? I don’t know, but we only need the p-value anyway.

Similar Documents

Premium Essay

M&a in Banking Sector

.................................................................................................. 5 2.1. 2.2. 2.3. Mergers and acquisitions activity in recent years .................................................. 5 Overview of efficient market hypothesis .............................................................. 7 Abnormal operating performance ......................................................................... 9 3. Event study approach ............................................................................................... 12 3.1. 3.2. Event study technique ........................................................................................ 12 Test statistics ..................................................................................................... 14 Parametric tests...

Words: 17054 - Pages: 69

Premium Essay

Nonparametric Hypothesis Testing

...research hypothesis from the same provided data sets (Wage and Wage Earners) using ratio or interval numerical data; however, this week we will use a nonparametric hypothesis test to find our answer. In the next following paragraphs, the team will clearly affirm a hypothesis statement that will provide the base for our survey, perform a five-step hypothesis test on information concerning our choice and apply the concepts of nonparametric testing learned in this course, and describe how the results of our findings answer our research question. Finally, we will conclude this study with a brief summary that will examine the main points, the purpose, and conclusions of this final third week’s study on nonparametric testing. Perform the five-step hypothesis test on the data Nonparametric tests are statistical tests that analyze data that does not require assumptions about the distribution of shape of the population from which that data is drawn. These statistical tests should be well thought-out when the data analysis cannot be assumed to have come from any particular distribution. Nonparametrics are often active data that consists of ranks and ordinal data. Usually, nonparametric statistical tests are easier to compute. This is because fewer assumptions need to be complete to use nonparametric tests. They...

Words: 1530 - Pages: 7

Premium Essay

Managment

...WEIGHTED AVERAGE SCORE Table showing weighted average score for elements of work ethics. |Factors |Strongly agree |Agrees |Neutral |Disagree |Strongly disagree| |Performance Appraisal |8 |55 |6 |20 |11 | |Performance Measurement |12 |65 |8 |10 |5 | |Mission statements |10 |33 |15 |40 |2 | |Work culture |35 |40 |5 |18 |2 | |Quality Concerns |5 |28 |1 |32 |34 | Table showing weighted average score for elements of work ethics. |Factors |Strongly agree |Agrees (4) |Neutral (3) |Disagree |Strongly disagree|Total |Rank | | |(5) | | |(2) |(1) | | | |Performance Appraisal |40 |220 |18 |40 |22 |340 = 22.6 |1 | | | | | | | |15 ...

Words: 1188 - Pages: 5

Premium Essay

Parametric and Non-Parametric Statistics Use in Research Methods

... Lilian Otieno, Resident Lecturer I am tasked to distinguish between parametric and non-parametric statistics and explain when to use each method in analysis of data. I shall first seek to define what parametric and non-parametric statistics mean and then compare and contrast them in the analysis of data. Parametric statistics is a branch of statistics that assumes that the data has come from a type of probability distribution and makes inferences about the parameters of the distribution. Most well-known elementary statistical methods are parametric. (According to Wikipedia, the online dictionary). In statistical analysis, parametric significance tests are only valid if certain assumptions are met. If they are not, nonparametric tests can be used. A parameter is a measure of an entire population, such as the mean height of every man in London. In statistical analysis, one practically never has measurements from a whole population and has to infer the characteristics of the population from a sample. Generally speaking parametric methods make more assumptions than non-parametric methods. If those extra assumptions are correct, parametric methods can produce more accurate and precise estimates. They are said to have more statistical power. However, if assumptions are incorrect, parametric methods can be very misleading. For that reason they are often not considered robust. On the other hand, parametric formulae are often simpler to write down and faster to compute. In some,...

Words: 3625 - Pages: 15

Premium Essay

Statistics for Nursing

...Practical 2a Comments on Z-tests and t-tests 1. You should have realized from the lectures that in practice, a z-test is seldom used, while the ‘default’ test for single sample or two-samples mean(s) is the ttest. This is because in most practical situations, the population variance is seldom known and therefore we need to estimate that by the sample variance, thus justifying a t-test rather than a z-test. It is always good to perform the standard exploratory data analysis before commencing any hypothesis testing involving t-tests. It is often useful to check through summary statistics (like the minimum and maximum of the data), as well as a quick plot of the data (box-plots), to check for any problematic data or outliers. The use of a t-test requires the assumption that the data is distributed like a normal distribution – essentially a bell-shaped curve for the histogram. Therefore it is extremely informative to look at the histogram of the data before commencing on testing, as this will indicate whether the use of the t-test is justified. Before commencing any testing, evaluate what are your hypotheses that you are interested in. If you are testing the mean for a single sample, are you testing the mean to be 0, or are you testing the mean against some non-zero value. If so, do remember to change the input in SPSS correspondingly. Similarly if you are testing the means for two samples, are you testing for the difference to be zero, or against a non-zero difference. Essentially...

Words: 755 - Pages: 4

Premium Essay

Demand Forecasting

...within our data and whether statistical significance exists. Inferential statistics help us make these determinations and allow us to generalize the results to a larger population. We provide background about parametric and nonparametric statistics and then show basic inferential statistics that examine associations among variables and tests of differences between groups. Parametric and Nonparametric Statistics In the world of statistics, distinctions are made in the types of analyses that can be used by the evaluator based on distribution assumptions and the levels of measurement data. For example, parametric statistics are based on the assumption of normal distribution and randomized sampling that results in interval or ratio data. The statistical tests usually determine significance of difference or relationships. These parametric statistical tests commonly include t-tests, Pearson product-moment correlations, and analyses of variance. Nonparametric statistics are known as distribution-free tests because they are not based on the assumptions of the normal probability curve. Nonparametric statistics do not specify conditions about parameters of the population but assume randomization and are usually applied to nominal and ordinal data. Several nonparametric tests do exist for interval data, however, when the sample size is small and the assumption of normal distribution would be violated. The most common forms of nonparametric...

Words: 1788 - Pages: 8

Free Essay

Philosophy

...HYPOTHESIS TESTS Lecture Notes Asst. Prof. Jay Kaiser S. Lariosa Non-Parametric Statistics 2 Spearman Rank Correlation Source: Elementary Number Theory 4th ed., by David M. Burton Prepared by Asst. Prof. Jay Kaiser S. Lariosa Pearson’s r 3 • The Pearson product-moment correlation coefficient is designed to measure the strength of the association between two quantitative variables. • The two variables being compared must be measured on either interval or ratio scale. Source: Elementary Number Theory 4th ed., by David M. Burton Prepared by Asst. Prof. Jay Kaiser S. Lariosa Spearman Rank Correlation Coefficient 4 • • Is a nonparametric counterpart of the Pearson’s r. This coefficient measures the extent of association between two variables each measured on an ordinal scale. Source: Elementary Number Theory 4th ed., by David M. Burton Prepared by Asst. Prof. Jay Kaiser S. Lariosa Spearman’s r 5 • The procedure for calculating the Spearman rank correlation coefficient consists of two sets of rankings on the same subjects. • The strength of association between these two rankings is measured by the coefficient. Source: Elementary Number Theory 4th ed., by David M. Burton Prepared by Asst. Prof. Jay Kaiser S. Lariosa 6 Spearman’s rs rs  1  • 6 d 2 n3  n where: • • • • rs = Spearman rank correlation coefficient d2 = squared differences between the two ranks n = number of objects being compared...

Words: 658 - Pages: 3

Free Essay

Event Studies on a Small Stock Exchange

...CAPM model to estimate the returns of stocks. The stock portfolio is composed of 10, 25 and 50 stocks. Because of thin trade stock motivation the trade to trade adjustment was made. The estimation is based on 250 trading days total and 247 of this period corresponds to parameter estimation and the rest, t-1, t, t+1 is used for event study. The parameter estimations were based on OLS with heteroskedasticity correction. The abnormality is detected by analyzing difference between market return and expected return from CAPM model. Returns for event days are the subjects of test statistics. The examined statistics were based on t-test with cross sectional independence, t-test with standardized abnormal return and t-test with adjusted standardized abnormal return. These tests are the parametric tests for abnormality, the authors also conducted non-parametric test such as rank test, sign test and generalized sign test. The event days are specified by simulation and uniform distribution is assumed. After event day specification the impact of 0.5% and 2% are added to abnormal return on the event day. The simulation is repeated 1000 times to achieve a distribution...

Words: 409 - Pages: 2

Premium Essay

Business

...1. 2. QBUS5001 3. QBUSS5002 topic1 - 12 EXCEL 4. word ----- 5. zh.lai@foxmail.com R-XIANG George Jackie HD QBUS5001 ~ Jack 2015/6/7 目录 ........................................................................................................................................................................................... 1 TOPIC 1 ....................................................................................................................................................................................... 4 TOPIC 2 PROBABILITY ........................................................................................................................................................ 4 2.1 EVENTS & PROBABILITIES 2.2 JOINT ............................................................................................................................................ 4 MARGINAL& CONDITIONAL PROBABILITIES 2.3 PROBABILITIES TREES ....................................................................................... 4 .......................................................................................................................................................... 4 2.4 BAYER’S THEOREM ......................................................................................................................................................................... 4 2.5 POPULATION MEAN & VARIANCE ................

Words: 1122 - Pages: 5

Premium Essay

Spss

...obtained. The more practice you get with this process, the easier it will be for you to write up the results of your analyses. Details on how to structure a report are available on AUTonline. Part A 1. A market researcher is interested in the coffee drinking habits of males and females. He asks a sample of male and female office workers to record the number of cups of coffee they consume during a week. (a) Which parametric statistical technique could the researcher use to determine if males and females differ in terms of the number of cups of coffee consumed in a week? Justify your answer and describe how you would obtain this statistic using SPSS. Independent-samples t-tests (b) What are the key values you would look for in the output? (c) What assumptions should you check for when using the technique that you chose in question 2(a) above. Interval scaled data with normally distributed scores Random sampling data (d) What non-parametric technique (chapter 16) could be used to address this research question? (p109) Mann-Whitney U Test 2. The following output was obtained using SPSS....

Words: 369 - Pages: 2

Free Essay

Analyze the Results of Linker Scale

...EDITORIAL Analyzing and Interpreting Data From Likert-Type Scales L ikert-type scales are frequently used in medical education and medical education research. Common uses include end-of-rotation trainee feedback, faculty evaluations of trainees, and assessment of performance after an educational intervention. A sizable percentage of the educational research manuscripts submitted to the Journal of Graduate Medical Education employ a Likert scale for part or all of the outcome assessments. Thus, understanding the interpretation and analysis of data derived from Likert scales is imperative for those working in medical education and education research. The goal of this article is to provide readers who do not have extensive statistics background with the basics needed to understand these concepts. Developed in 1932 by Rensis Likert1 to measure attitudes, the typical Likert scale is a 5- or 7-point ordinal scale used by respondents to rate the degree to which they agree or disagree with a statement (T A B L E). In an ordinal scale, responses can be rated or ranked, but the distance between responses is not measurable. Thus, the differences between ‘‘always,’’ ‘‘often,’’ and ‘‘sometimes’’ on a frequency response Likert scale are not necessarily equal. In other words, one cannot assume that the difference between responses is equidistant even though the numbers assigned to those responses are. This is in contrast to interval data, in which the difference...

Words: 1356 - Pages: 6

Free Essay

Sosiology

...RSSMAIN and RSSTMT → Calculate but don’t use in ANOVA * Also need RSSBLOCKS, RSSPP, and RSSM (CT and RSSTOTAL) * F values are calculated using the error from the same block * For t-test * Standard errors: * Error (b)n for interaction 9.78583 * 2 × Error (b)n for Factor M 2 × 9.78586 * 2 × Error (a)n for Factor PP 2 × 4.96756 * 2 critical-t values → t at 2 and t at 4 df i.e. 4.303 and 2.776 * Could ask: do ANOVA and t-test, or ANOVA and interpret results from F; Standard error for the difference (a or b); Conclusion: levels differ/do not differ at 1% etc. NS 13 – Non-parametric tests * Parametric tests for data with normal distribution (t, F or X2 distribution) * Non-parametric tests for * Categorical data, * Quantitative data divided into class intervals, * Small data sets, * Data sets without repetition of the TMTs. * Non- parametric tests * Medians, not Means * Usually rank your data * Single sample: * Sign test (No assumptions about distribution) * Rank test (assumes data comes from symmetrical distribution) * Wilcoxon’s symmetry test * For 2 independent samples * Mann-Whitney U test (Assumes distributions have same shape and equal...

Words: 971 - Pages: 4

Premium Essay

Patho

...FAKULTI PERUBATAN UNIVERSITI KEBANGSAAN MALAYSIA IJAZAH SARJANA MUDA SAINS PERUBATAN KECEMASAN DENGAN KEPUJIAN SEMESTER II SESI 2015/2016 FFEP 2422-BIOSTATISTIK SINOPSIS KURSUS: Objektif kursus ini adalah untuk menyediakan pelajar yang berkebolehan berfikir secara kritis dan boleh menggunakan pengetahuan yang diperolehi untuk penyelidikan. Pelajar akan didedahkan dengan statistik terutama pengunaan SPSS dalam analisa data kajian. OBJEKTIF: Pada akhir kursus ini calon semestinya berupaya: 1. Mengetahui penggunaan statistik di dalam bidang penyelidikan 2. Mengolah dan menyaji data. 3. Menguasai penggunaan SPSS dalam menganalisa data klinikal 4. Menggunakan kaedah mengikhtisarkan data yang sesuai. 5. Menggunakan ujian statistik yang sesuai untuk menganalisa data kualitatif dan kuantitatif. 6. Menggunakan ujian tak berparameter sesuai dengan ciri-ciri data yang diperolehi. CARA PENGAJARAN Kursus akan dijalankan secara kuliah, tutorials, tugasan dan praktikal di makmal komputer. ANGGOTA PENGAJAR: Penyelaras: Dr Mohd Rohaizat Hassan ( 012-6343303/ rohaizat@ppukm.ukm.edu.my) Pensyarah: Dr Mohd Rohaizat Hassan Dr Nazarudin Safian PENILAIAN Kursus akan dinilai secara penilaian berterusan dan peperiksaan akhir. Penilaian berterusan(40%) :1 = Ujian mini (20%) :1 = Computer based exam (20%) Peperiksaan akhir (60%) :Ujian bertulis (40%) :OBA (20%) RUJUKAN 1. Kirkwood B.R(1988) Essentials of Medical Statistics. Blackwell...

Words: 454 - Pages: 2

Premium Essay

Essay on Research Critique Part 2

...utilize both descriptive and inferential statistics to scrutinize the data, the reason for the study was to evaluate the difference among the two groups, for the inferential statistics the McNemar test was used, which was suitable for the degree of enquiry due to the matched group, the Cohort design was also implemented as they intended to quantify between groups. Also as relates to the variables outcome. (Polit & Beck 2008). The paired T test was also proper for the study due to the fact its purpose is to test the differences among the two groups that are either paired or matched on the essence of the characteristics. The F test was also implemented, it occurs as the test used when apply multiple linear regression as this F statistics has been utilized while stipulating the influence between birth weight and preterm delivery (p1054). The purpose of linear regression is to describe the amount of the outcome variable is distinct to the independent variable (Burns & Groove, 2007) Descriptive statistics were used by the author to relate the dispensing of the statistical data among topics such as age, race, and parity, they also possess illustrative analysis to explain the issues of statistical variables among the distributing of outcomes variables among the specimens. A parametric as well as...

Words: 635 - Pages: 3

Premium Essay

Investigating Effect of Emotion Dection

...differences effect on emotion detection for the over and under forties using Baron Cohen's “Eyes Test” (Baron-Cohen, Wheelwright & Hill, 2001) Abstract: This research focuses on the area of emotion detection in psychology. Independent variables being tested is the participants’ age [over and under 40 years of age]. Dependent variables being tested is participants’ scores in the eyes test. Participants were picked from a convenience sample with a large sum selected from adults attending Buckingham University Previous research suggests that there is a steady decline in emotion detection from a young age into elderly age. The key prediction is that under 40s will score better on the eyes test revised for adults than the over 40s. Results yielded no difference in emotion detection via the eyes test revised for adults between the over and under 40s. Introduction: Theory of mind, which is the ability to relate independent mental states to self and others to predict and or explain behavior, has long been researched since 1983 by Wimmer and Perner (1983). Since then their methods have been improved and altered in order to better understand theory of mind. Initially Baron–Cohen and colleagues (1985) simplified the test procedure from Wimmer’s (1983) research, then once more in 2001 where issues with data collection, number of items and responses in the test were all addressed. The revised eyes test for adults consisted of each participant identifying the gender and emotion description from...

Words: 2670 - Pages: 11