Free Essay

Spearman

In:

Submitted By balot
Words 1265
Pages 6
Spearman’s Rank Correlation Coefficient – Excel Guide

What is Spearman’s Rank correlation coefficient? Spearman’s Rank correlation coefficient is used to identify and test the strength of a relationship between two sets of data. It is often used as a statistical method to aid with either proving or disproving a hypothesis e.g. the depth of a river does not progressively increase the further from the river bank. The formula used to calculate Spearman’s Rank is shown below.
NB. Sometimes n³ - n is written as n (n² -1). Both mean the same thing.

How can the calculation be carried out in Excel? Once the data has been collected, Excel can be used to calculate and graph Spearman’s Rank correlation to discover if a relationship exists between the two sets of data, and how strong this relationship is. Please note this example uses a dataset of 10 samples, but your dataset should include a minimum of 15 to be valid.

Step 1: Create a table in Excel and enter your data sets. Width (cm) 0 50 150 200 250 300 350 400 450 500 Width Rank Depth (cm) 0 10 28 42 59 51 73 85 104 96 Depth Rank

Sample 1 2 3 4 5 6 7 8 9 10
1

1

Data for this example has been taken from www.geography-site.co.uk/pages/skills/fieldwork/stats/spear.html

Step 2: Rank each set of data (width rank and depth rank). Rank 1 will be given to the largest number in column 2. Continue ranking till all widths have been ranked. Once all the widths have been ranked then do exactly the same for depth. Width (cm) 0 50 150 200 250 300 350 400 450 500 Width Rank 10 9 8 7 6 5 4 3 2 1 Depth (cm) 0 10 28 42 59 51 73 85 104 96 Depth Rank 10 9 8 7 5 6 4 3 1 2

Sample 1 2 3 4 5 6 7 8 9 10

If there are two samples with the same value, the mean (average) rank should be used – for example if there were 3 samples all with the same depth ranked 6th in order you would add the rank values together (6+7+8=21) then divide this number by the number of samples with the same depth number, in this case 3 (21/3=7) so they would all receive a rank of 7. The next greatest depth would be given a value of 9.

Step 3: The next stage is to find d (the difference in rank between the width and depth). First, add a new column to your table, and then calculate d by subtracting the depth rank column (column 5) from the width rank column (column 3). For example, for sample 6 width rank is 5 and the depth rank is 6 so d = 5 – 6 = -1. To calculate d in Excel, select the cell you wish to enter the information into and type =. Now click on the width rank cell you want to use and type –. Finally, click on the depth rank cell and press enter. The value of d should appear in the first box you selected. Width (cm) 0 50 150 200 250 300 350 400 450 500 Width Rank 10 9 8 7 6 5 4 3 2 1 Depth (cm) 0 10 28 42 59 51 73 85 104 96 Depth Rank 10 9 8 7 5 6 4 3 1 2

Sample 1 2 3 4 5 6 7 8 9 10

d 0 0 0 0 1 -1 0 0 1 -1

Step 4: The next step is to calculate d². Add another column to your table and label it. To calculate d² type in the first cell =POWER(number,power). In this case the number is the value of d and the power is 2 as we are trying to find the square value e.g. for sample 6 the value of d is -1 so you would enter into the cell =POWER(-1,2) then press enter and the value you should get is 1.

Sample 1 2 3 4 5 6 7 8 9 10

Width (cm) 0 50 150 200 250 300 350 400 450 500

Width Rank 10 9 8 7 6 5 4 3 2 1

Depth (cm) 0 10 28 42 59 51 73 85 104 96

Depth Rank 10 9 8 7 5 6 4 3 1 2

d 0 0 0 0 1 -1 0 0 1 -1



1

Repeat the same process until all of your samples have a value of d². Once all the d² values have been calculated add them together to calculate d². The quickest way to do this in Excel is to click on the cell underneath your last entry into the d² column, click on the autosum symbol (which you can find on the tool bar at the top of the page), and press enter. (Depending on which version of Excel you are using, you may have to select the column you wish to add together before you press enter.) Step 5: Now we have the d² values, but to complete the equation we still need to calculate n ³ - n. n is the number of samples, so in this case is 10. As in step 4, type into the cell you wish to use =POWER(number,power) which will give you a value for n³. Remember this time that ‘number’ is the number of samples and ‘power’ is 3 as you are cubing not squaring the value. Once n³ has been calculated, subtract the value of n from it. Width (cm) 0 50 150 200 250 300 350 400 450 500 Width Rank 10 9 8 7 6 5 4 3 2 1 Depth (cm) 0 10 28 42 59 51 73 85 104 96 Depth Rank 10 9 8 7 5 6 4 3 1 2

Sample 1 2 3 4 5 6 7 8 9 10

d 0 0 0 0 1 -1 0 0 1 -1 d² n n³ n³-n

d² 0 0 0 0 1 1 0 0 1 1 4 10 1000 990

Step 6: All that is left to do now is to insert the values into the equation to calculate r.

The formula you should enter into a cell in this case would be =1-((6*4)/990) (N.B * is the symbol for multiply) End result The result at the end should always be between the value of +1 and -1. In this case the value is 0.9757 to 4 decimal places, or 0.98 to 2 decimal places. Step 7: To see if this r value is significant, a Spearman’s Rank significance table or graph must be used. To do this you need to calculate the degrees of freedom. This is calculated using the sum n - 2, which in this example = 8 (remember n is the number of samples). Using this number and your value for r you can use the table below to work out the significance level of your data (the yellow lines).

2

As you can see the yellow line meets the red line at 0.1% meaning that there is a greater than 99% chance that the relationship is significant and not random. Thus we can reject the hypothesis and conclude that in this case the depth of a river does progressively increase the further the distance from the river bank.

2

Table taken from http://geographyfieldwork.com/SpearmansRankSignificance.htm

Similar Documents

Free Essay

Charles Spe

...Charles Spearman has been a strong influence throughout the psychology world. Working with powerful people he studied correlation and statistics. He started out differently than most psychologists but made his name known. Charles was born on London on September 10th, 1863 into a respected family. Charles Spearman was the second son of Alexander Young Spearman. Alexander died at the age of 33 in 1865. His mother, Louisa Ann Caroline Amelia Spearman remarried to Henry Seel who taught in London. But, eventually, his mother became widowed again. In 1901, Spearman married Fanny Aikman, with whom he had a son and four daughters with but his son inevitably died in World War 1. Even though Spearmen excelled in mathematics and science, his heart belonged to philosophy which would later turn into psychology. He went to Leamington College and after that, enrolled into the Royal Engineers of the British Army in 1988. He was awarded many medals throughout his 15 years in the Army but always believed that life’s problems could be solved using psychology . He studied abroad because at the time psychology was thought of as philosophy in Great Britain. He eventually got his PhD in experimental psychology. He chose to study in Leipzig under William Wundt, but also worked together with Oswald Kaulpe (1862-1915) at the University of Waurzburg and with Georg Elias Mauller (1850-1934) at the University of Gaottingen. Spearman was soon back in action though, because of the South African War or...

Words: 746 - Pages: 3

Premium Essay

Outcomes Assessment Essay

...Final Exam: Outcomes Assessment Essay Mary Smith July 4, 2010 Dr. Marsha Riggio - Instructor Strayer University Final Exam: Outcomes Assessment Essay Quote #0 – Peter (sample for you, so delete this when done) Psychological Concepts in Quote Peter’s quote describes how his Uncle Tex reacted to learning that he had a terminal illness. This relates to the stages of dying proposed by Elizabeth Kubler-Ross. When Tex first gets the news, he refuses to acknowledge the situation, which corresponds to Kubler-Ross’s stage of “Denial”. Later, Tex becomes very difficult, lashing out at his family, which corresponds to Kubler-Ross’s stage of “Anger”. Thus, Peter’s description of Tex’s experience goes along with the first two of Kubler-Ross’s stages of dying. General Perspective of the Concept Kubler-Ross’s stages of dying theory falls under the general perspective of developmental approaches to psychology. Like several other developmental theories, it is a stage theory. Kubler-Ross proposes that there are five stages to the process of dying: denial, anger, bargaining, depression and acceptance. This is a true stage theory which holds that persons go through this series of stages in this particular order. At first, the dying person tries to evade the reality of the diagnosis (denial). As the reality of the diagnosis sinks in, the dying person tends to lash out at family and caregivers (anger). Subsequently, the dying person develops the hope that death can...

Words: 881 - Pages: 4

Premium Essay

General Intelligence

...level of intelligence of children found in the schools. The purpose of this test was to find if the children will perform the intelligence tests designed for their age and if they are up to the level of intelligence they should be at their age. Another important matter of this tests was to find that if there are any children that did not reach the level of intelligence they should have and if those need special education. Therefore in 1905 Binet and Theodore Simon created the Binet-Simon scale, which was the first intelligence test, actually was the base on the development pyramid of intelligence tests.(Maltby et. al. 2010). Charles Spearman (1940, 1927) introduced one of the most important ideas in psychology, and probably his most important contribution to psychology, the general intelligence or the “g” factor.(Maltby et. al. 2010). Spearman stated that if a person is doing well on one...

Words: 1198 - Pages: 5

Premium Essay

Demand Forecasting

...3.5 Inferential Statistics: The Plot Thickens We have been talking about ways to calculate and describe characteristics about data. Descriptive statistics tell us information about the distribution of our data, how varied the data are, and the shape of the data. Now we are also interested in information related to our data parameters. In other words, we want to know if we have relationships, associations, or differences within our data and whether statistical significance exists. Inferential statistics help us make these determinations and allow us to generalize the results to a larger population. We provide background about parametric and nonparametric statistics and then show basic inferential statistics that examine associations among variables and tests of differences between groups. Parametric and Nonparametric Statistics In the world of statistics, distinctions are made in the types of analyses that can be used by the evaluator based on distribution assumptions and the levels of measurement data. For example, parametric statistics are based on the assumption of normal distribution and randomized sampling that results in interval or ratio data. The statistical tests usually determine significance of difference or relationships. These parametric statistical tests commonly include t-tests, Pearson product-moment correlations, and analyses of variance. Nonparametric statistics are known as distribution-free tests because they are not based on the assumptions...

Words: 1788 - Pages: 8

Premium Essay

Mth410

...Descriptive statistics: Opening Gross: Several central tendency measures (i.e., mean, median, and mode) are calculated for opening gross in the Excel file. As can be seen in the below histogram, opening gross has a right tail (i.e., skewed to the right), so median would be a more appropriate central tendency measure than the mean. The median opening gross was 0.39; 50% of the opening gross values were less than 0.39, 50% were above 0.39. The skewness of opening gross was 3.43 indicating a right tail. The kurtosis of opening gross was 13.81 indicating a leptokurtic distribution. The range of opening gross was 108.43, from 0.01 to 108.44. The standard deviation of opening gross was 18.87. However, because the distribution was skewed, interquartile range (IQR), which is the range for the middle 50% of the values, would be a more appropriate measure of variability. The interquartile range for opening gross was 12.37. Opening Gross Outliers: Based on the box-plot, extreme values were 108.44 (Star Wars: Episode II), 102.69 (Harry Potter and the Goblet of Fire), 77.06 (War of the Worlds), 50.34 (Mr. and Mrs. Smith), 48.75 (Batman Begins), and 33.90 (Wedding Crashers). It is also suggested that values that are less than z-score of -3 or larger than z-score of +3 should be considered outliers. Using that criteria 77.06 (War of the Worlds), 102.69 (Harry Potter and the Goblet of Fire), and 108.44 (Star Wars: Episode II) can be considered outliers. Total Gross: Several central tendency...

Words: 1184 - Pages: 5

Free Essay

Philosophy

...HYPOTHESIS TESTS Lecture Notes Asst. Prof. Jay Kaiser S. Lariosa Non-Parametric Statistics 2 Spearman Rank Correlation Source: Elementary Number Theory 4th ed., by David M. Burton Prepared by Asst. Prof. Jay Kaiser S. Lariosa Pearson’s r 3 • The Pearson product-moment correlation coefficient is designed to measure the strength of the association between two quantitative variables. • The two variables being compared must be measured on either interval or ratio scale. Source: Elementary Number Theory 4th ed., by David M. Burton Prepared by Asst. Prof. Jay Kaiser S. Lariosa Spearman Rank Correlation Coefficient 4 • • Is a nonparametric counterpart of the Pearson’s r. This coefficient measures the extent of association between two variables each measured on an ordinal scale. Source: Elementary Number Theory 4th ed., by David M. Burton Prepared by Asst. Prof. Jay Kaiser S. Lariosa Spearman’s r 5 • The procedure for calculating the Spearman rank correlation coefficient consists of two sets of rankings on the same subjects. • The strength of association between these two rankings is measured by the coefficient. Source: Elementary Number Theory 4th ed., by David M. Burton Prepared by Asst. Prof. Jay Kaiser S. Lariosa 6 Spearman’s rs rs  1  • 6 d 2 n3  n where: • • • • rs = Spearman rank correlation coefficient d2 = squared differences between the two ranks n = number of objects being compared...

Words: 658 - Pages: 3

Premium Essay

Ergg

...known and the sample size is large.  Source:  http://www.investopedia.com/terms/z/z-test.asp#ixzz2LEqfeJnN | IV – NominalDV – Interval | | 2. T – test | to compare the means when the population mean is known but the population variance is unknown.Also when the population standard deviation is unknown but the sample standard deviation can be computed.Source:Basic Statistics Book | OrdinalInterval | | 3. F – test | used when comparing statistical models that have been fitted to a data set, in order to identify the model that best fits the population from which the data were sampled.Source:http://en.wikipedia.org/wiki/F-test | Ordinal Interval | | 4. Spearman rank | measures the strength of association between two ranked variablesSource:https://statistics.laerd.com/statistical-guides/spearmans-rank-order-correlation-statistical-guide.php | NominalOrdinal | | 5. Pearson R | used in the sciences as a measure of the strength of linear dependence between two variables.Source:http://en.wikipedia.org/wiki/Pearson_product-moment_correlation_coefficient | Interval | | 6. Chi – square | to test the difference between an actual sample and another hypothetical or previously established distribution such as that which may be expected due to chance or probabilitycan also be used to test differences between two or more actual samplesSource:http://www.okstate.edu/ag/agedcm4h/academic/aged5980a/5980/newpage28.htm | NominalOrdinal | | 7. Anova |  useful in...

Words: 257 - Pages: 2

Premium Essay

Theories of Intelligence

...Running head: THEORIES OF INTELLIGENCE Theories of Intelligence Anthony Dillon The University of Southern Queensland Abstract The concept of intelligence is regarded by psychologists as one of the more important individual difference measures that support predictions regarding a person’s potential for academic and other achievements in life. But what exactly is intelligence? Despite its importance, it is an entity that is not easily defined. To help define intelligence, researchers have developed theories that attempt to more fully explain its dynamic nature. This essay critically evaluates some of the well-known theories of intelligence: Spearman’s two-factor theory, Thurstone’s theory of primary mental abilities, Gardner’s theory of multiple intelligences, and Sternberg’s triarchic theory. The theories are discussed in terms of their relevance to the study of intelligence as well as their strengths and weaknesses. While none of the theories presented can specifically pinpoint what intelligence is, each theory is useful in revealing a different face of this important but elusive human characteristic. Theories of Intelligence Although intelligence is a long-standing topic in psychology (Laboratory of Comparative Human Cognition, 1982), a satisfactory definition of intelligence remains elusive (Carroll, 1993; Fogarty, 1999). However, several descriptions have been offered that allow a brief insight into what intelligence may be related to. For example, Eysenck...

Words: 2098 - Pages: 9

Premium Essay

Statistics

...Chapter 22 Correlation Coefficients 22 Correlation Coefficients The Meaning of Correlation Correlation and Data Types Pearson’s r Spearman rho Other Coefficients of Note Coefficient of Determination r2 The concept of correlation was introduced in Chapters 1 and 5. Our focus since Chapter 16 has been basic statistical procedures that measure differences between groups -- one-sample, two-sample, and k-sample tests. Now we turn our attention to basic statistical procedures that measure the degree of association between variables. Dr. Wesley Black studied the relationship between rankings of selected learning objectives in a youth discipleship taxonomy between full-time church staff youth ministers and seminary students enrolled in youth education courses at Southwestern Seminary.1 Questionnaires were returned by 318 students and 184 youth ministers.2 Ten objectives in each of five categories (Personal Ministry, Christian Theology and Baptist Doctrine, Christian Ethics, Baptist Heritage, and Church Polity and Organization) were ranked by these two groups. The basic question raised by Black in this study was whether students prioritized discipleship training objectives for youth in the same way as full-time ministers in the field. Using the Spearman rho correlation coefficient, Black found the correlations of rankings generated by students and ministers of the ten items for each of five categories were as follows: Personal Ministry, 0.915; Christian Theology...

Words: 3527 - Pages: 15

Premium Essay

Intelligence

...to compose a sonata but unable to solve a verbal reasoning problem. (Bukato, Daehler 2004 p.348) In order to further understand intelligence one must have a foundation for the various models’ and theories that are widely accepted and used to measure this field. One of the methods for understanding intelligence is the psychometric model (Binet-Simon 1905), which is based on testing large groups of individuals to quantify differences in abilities. The basic assumption with this model is that some people will perform better than others and that those who perform below some average or normative level are less intelligent, in-turn, those who perform above that level are more intelligent. (Bukato, Daehler 2004 p.348) Charles Spearman (1904) believed that intelligence consisted of two parts: General Intelligence (Mental or g) and Specific Intelligence...

Words: 1566 - Pages: 7

Premium Essay

Blog It

...2. According to Stella Cortell the first 3 views of intelligence are: i. Intelligence is a “general, underlying cleverness which is fixed for life”. ii. There can be multiple intelligences, not just one general intelligence. iii. Intelligence can be developed. a) 1st View of intelligence. Intelligence is a general underlying cleverness which is fixed for life. This view was supported by early psychologist who believed that each and every person has a general level of intelligence. This was known as the IQ in full the intelligence Quotient. The Psychologist who supported this included Charles Spearman An Terman. They believed tht Intelligence was single fixed. Charles Spearman for example created a theory called G. The g meant a particular quantity that was obtained after carrying out statistics. This came to be in that when an individual is given a mental test to answer. The mental test of a person can be divided into two factors one of which can be divided into a general same for all individuals known as the general factor that is the G and a difference in answers called the specific factor. G is in the normal course of events determined innately; a person can no more be trained to have it in higher degree than he can be trained to be taller. b) 2nd View of intelligence. There is multiple intelligence, not one general intelligence. This according to Thurstone, he did an experiment involving several college students. He tried to prove that there was no evidence of any...

Words: 457 - Pages: 2

Premium Essay

Geography Skills

...Answer:The location of our fieldwork is in the Lake District and we are studying the Torver Beck, the river found in the south of the Lake District. The long profile of the river is 600m and falls by 550m after 5km from source to mouth. The area was formed by glaciation 18,000 years ago which has left a U-Shaped Valley. This means it was suitable for our investigation because it has a relatively short long profile (5km) and making it easier to access the river. Due to the Torver Beck's natural history and its surrounding human influences, it is good to see what these factors have on channel characteristics The Area of Study (2) 2 of 16 1 of 16 The Area of Study (1) The investigation was carried out in the Lake District (North-West England). It contains mainly rural characteristics (such as country roads). It also has an older, more sparsely populated area with its main employment in agriculture or tourism. Our investigation, more specifically, will be carried out on the Torver Beck River. Which has an altitude of 600m in the highlands and drops by 550m as you near the coast. There are many human influences onto the river; mainly the slate quarry, the regions sheep farming and Coniston Town's Settlement Question - Describe the location of your fieldwork and explain why it was suitable for your investigation Why was it suitable for our investigation? 1) Relatively safe to access the river, 2) Relatively short long profile, 3) Relatively easy to reach the...

Words: 2024 - Pages: 9

Premium Essay

Product Flow Control Using Kanban Signals

...Abstract: Kanban is a Japanese term which means “visible record”. This is the term used in production and inventory control systems and methods which help us find out the answer or questions like: what parts to manufacture, when to start manufacturing, when to stop manufacturing, how many to manufacture and where to deliver them to. Kanban works on a pull systems to automatically schedule more production without intervention of Supervision nor Planning & Scheduling Department personnel. Kanban works for higher volume , repetitive products where setup time is short and there is great deal of demand certainty. A pull system has advantages over push system, 1:It can be observed easily unlike push system 2: Efficiency: Pull system is efficient as compared to push system, 3:Variability: There is less variability in a pull system as compared to a push system because a pull system regulate the fluctuation of WIP levels while a push system does not, 4: Robustness: A pull system is less sensitive to error than a push system is. The main focus of the article is on maintaining a constant flow of production using kanban signals and hence minimize waste of excessive inventories where no production will be initiated before a specific customer order into picture. INTRODUCTION: A Kanban inventory is often seen in accordance with Lean Manufacturing and Just-In-Time philosophies. The purpose of a Kanban inventory system is to have minimum inventory levels, and to keep...

Words: 1504 - Pages: 7

Premium Essay

The Effects of Buffer Patterns on Throughput in Conwip Flow Lines: a Simulation Study

...insignificant differences among buffer patterns. When CV = 0.50, ANOVA tests indicated significant differences (p < 0.01) among buffer patterns at all PC levels. The important suggestion is that appropriate allocation of limited buffer capacity, an addition of protective capacity, and a reduction of system variation could increase the throughput rates to the desired level. Keywords: Release mechanism, CONWIP, buffers, protective capacity, bottleneck Introduction An important factor for achieving target throughput rate of production lines is the method of controlling the release of new units into the production system called a release mechanism. One recent release mechanism which has been of interest to researchers is CONWIP, first introduced by Spearman et al. (1989). Under the CONWIP release mechanism, as a production unit is...

Words: 4240 - Pages: 17

Premium Essay

Xxsf

...BEC1 STUDY GUIDE INTRODUCTION (CHAPTER 1 – MUNRO E-BOOK) Know the definition of population, sample, parameter, & statistic Be able to identify and/or provide examples of descriptive statistics & inferential statistics Know the properties of & be able to identify or provide examples of quantitative vs. categorical variables BASIC CONCEPTS (CHAPTER 2 – MUNRO E-BOOK) Know the definition of data, individuals, variables, independent variable, dependent variable, random assignment, treatment group, and control group. Know the properties of the 4 levels of measurement (nominal, ordinal, interval, ratio) Know the properties of discrete and continuous variables Know and understand the properties that distinguish experimental methods from correlational methods DISPLAYING DATA (CHAPTER 2 – MUNRO E-BOOK) Know what a distribution is and why examining a distribution can be helpful/useful Know how to interpret information from: Simple frequency distributions (grouped & ungrouped*) Relative frequency distributions (proportions* & percents*) Cumulative frequency distributions* Histograms Bar graphs* Stem-and-leaf displays You also should know how to construct those with an * beside them Know the definition of percentile rank Be able to identify and/or describe different shapes of distributions: Normal, symmetrical, skewed, unimodal, & bimodal distributions CENTRAL TENDENCY (CHAPTER 2 – MUNRO E-BOOK) Understand conceptually each of the 3 measures of central tendency: Mode, Median & Mean Know...

Words: 6621 - Pages: 27