Premium Essay

Stability, Equivalence and Internal Consistency

In:

Submitted By Saved730
Words 738
Pages 3
You have been asked by the head of marketing to design an instrument by which your private, for-profit school can evaluate the quality and value of its various curricula and courses. How might you try to ensure that your instrument has:

Stability is important because it allows us to obtain the same results with repeated measurement using the same person and the same instruments for reliability from one time to another. Therefore, if I had to design an instrument in which a private, for profit school would be able to evaluate the quality and value of its variety of curricula and courses I would ensure that the instrument has stability by doing an observational study of the same students who are participating in the test. Then those students would be observed a second time to determine reliability. The text states that it is often possible to do more than one observational study of the same subject and then compare them to see if they are consistent. However situational factors do sometimes change that can alter the observation (Cooper & Schindler, 2008).

Equivalence is a perspective on reliability that relates to variations at one point in time among observers and samples of items (Cooper & Schindler, 2008). Therefore, to test equivalence, I would use the instrument with more than one group of students, but at the same time. Different groups of students at the same time. Thus, I would be able to determine equivalence is to compare the scores of different observers for the same study. Equivalence is more concerned with how a set of items will categorize an individual. Thus if there are differences in responses between students, but if they are classified the same the test is said to be of good equivalence (Cooper & Schindler, 2008). Thus the equivalent instrument should be good at knowing the differences between students of all the groups.

Similar Documents

Premium Essay

Msa 507

...Stability is important because it allows us to obtain the same results with repeated measurement using the same person and the same instruments for reliability from one time to another. Therefore, if I had to design an instrument in which a private, for profit school would be able to evaluate the quality and value of its variety of curricula and courses I would ensure that the instrument has stability by doing an observational study of the same students who are participating in the test. Then those students would be observed a second time to determine reliability. The text states that it is often possible to do more than one observational study of the same subject and then compare them to see if they are consistent. However situational factors do sometimes change that can alter the observation (Cooper & Schindler, 2008). Equivalence is a perspective on reliability that relates to variations at one point in time among observers and samples of items (Cooper & Schindler, 2008). Therefore, to test equivalence, I would use the instrument with more than one group of students, but at the same time. Different groups of students at the same time. Thus, I would be able to determine equivalence is to compare the scores of different observers for the same study. Equivalence is more concerned with how a set of items will categorize an individual. Thus if there are differences in responses between students, but if they are classified the same the test is said to be of good...

Words: 285 - Pages: 2

Premium Essay

Case Study 1

...VALIDITY AND RELIABILITY For the statistical consultant working with social science researchers the estimation of reliability and validity is a task frequently encountered. Measurement issues differ in the social sciences in that they are related to the quantification of abstract, intangible and unobservable constructs. In many instances, then, the meaning of quantities is only inferred. Let us begin by a general description of the paradigm that we are dealing with. Most concepts in the behavioral sciences have meaning within the context of the theory that they are a part of. Each concept, thus, has an operational definition which is governed by the overarching theory. If a concept is involved in the testing of hypothesis to support the theory it has to be measured. So the first decision that the research is faced with is “how shall the concept be measured?” That is the type of measure. At a very broad level the type of measure can be observational, self-report, interview, etc. These types ultimately take shape of a more specific form like observation of ongoing activity, observing video-taped events, self-report measures like questionnaires that can be open-ended or close-ended, Likert-type scales, interviews that are structured, semi-structured or unstructured and open-ended or close-ended. Needless to say, each type of measure has specific types of issues that need to be addressed to make the measurement meaningful, accurate, and efficient. Another...

Words: 1512 - Pages: 7

Premium Essay

Validity and Reliability

...VALIDITY AND RELIABILITY For the statistical consultant working with social science researchers the estimation of reliability and validity is a task frequently encountered. Measurement issues differ in the social sciences in that they are related to the quantification of abstract, intangible and unobservable constructs. In many instances, then, the meaning of quantities is only inferred. Let us begin by a general description of the paradigm that we are dealing with. Most concepts in the behavioral sciences have meaning within the context of the theory that they are a part of. Each concept, thus, has an operational definition which is governed by the overarching theory. If a concept is involved in the testing of hypothesis to support the theory it has to be measured. So the first decision that the research is faced with is “how shall the concept be measured?” That is the type of measure. At a very broad level the type of measure can be observational, self-report, interview, etc. These types ultimately take shape of a more specific form like observation of ongoing activity, observing video-taped events, self-report measures like questionnaires that can be open-ended or close-ended, Likert-type scales, interviews that are structured, semi-structured or unstructured and open-ended or close-ended. Needless to say, each type of measure has specific types of issues that need to be addressed to make the measurement meaningful, accurate, and efficient. Another...

Words: 1512 - Pages: 7

Premium Essay

Reliability and Validity

...field studies. Within each of these methods are specific procedures that lead the researcher to various outcomes. The tools, or instruments, used to measure observations or statistics throughout the process are very important. To understand how well the instruments work and the extent to which the outcomes will produce similar results in the future, researchers examine different types of validity and reliability. Reliability is the extent to which an instrument produces consistent results and the probability that others can achieve the same results when reproducing the study. There are several types of reliability, including alternate-form reliability, internal-consistency reliability, item-to-item or judge-to-judge reliability, and test-retest reliability (Rosnow & Rosenthal, 2008). Internal-consistency reliability measures the amount of correlation between items on a test (Darity, 2008). The average correlation between items is indicated by item-to-item reliability. These types of reliability let the researcher know how well the items on a test go together (Rosnow & Rosenthal, 2008). A questionnaire or survey is not of much use if the questions on them are completely unrelated. Judge-to-judge reliability works in a similar way as item-to-item availability, except it measures the average reliability of an individual judge. Judges are used to observe behaviors and assign a rating to those behaviors and an average is taken of their scores, similar to item-to-item...

Words: 1099 - Pages: 5

Premium Essay

Observational Intervention Studies

...Quasi-Experimental Studies differs from interventional study designs because there is not a control group and randomization cannot occur; therefore cause and effect cannot be inferred (Melnyk, Morrison-Beedy, & Cole, 2015).  Quasi-experimental with pretest and post-test, comparison/control group, No random assignment  is similar to the two-group RCT with the pretest, posttest, and the comparison group (Melnyk, Morrison-Beedy, & Cole, 2015).  The threats to internal and external validity are present due to the participants are not randomly assigned ( Bevins, 1999). Interrupted time series design: does not employ random assignment or a control group. The benefit to this type of design the pretest assessments and posttest measurement over time which allow the researcher to determine a trend (Bevins, 1999, Melnyk, Morrison-Beedy, & Cole, 2015). The down side to this type of design is the history threat to internal validity ( Bevins, 1999, Melnyk, Morrison-Beedy, & Cole,...

Words: 1561 - Pages: 7

Premium Essay

Psychometric Properties of Psychological Assessment Measures

...Item analysis phase 7 1. Item difficulty (p) 7 2. Discrimination power 7 3. Preliminary investigation into item bias 8 5. Revising and standardizing the final version of the measure 8 6. Technical evaluation and establishing norms 8 1. Issues related to the reliability of a psychological measure 8 1. Definition 8 2. Measurement error 8 3. The reliability coefficient 9 4. Standard error of measurement 9 5. Types of reliability 10 2.6.1.5.1. Reliability measures of stability 10 - Test-retest reliability - Alternate-form reliability 2.6.1.5.2. Reliability measures of internal consistency 11 - Split-half reliability - Kuder-Richardson and coefficient Alpha reliability - Inter-scorer reliability 2.6.2. Issues...

Words: 6499 - Pages: 26

Premium Essay

Psychology

...DEPARTMENT OF INDUSTRIAL PSYCHOLOGY IPS 704 – 2012 PSYCHOMETRICS – ASSIGNMENT 1 LECTURER: Mr R Oliver DUE DATE: 26 April 2012 STUDENT 1: Jodine Hoffmann : 2038038 STUDENT 2: Nonstikeleo Mokoena : 3158789 STUDENT 3: Shafeeqah Jacobs : 2561402 STUDENT 4: Simone` Juganath : 3158797 PLAGARISM DECLARATIONI hereby declare that: | YES | NO | a. | I have perused and understood the relevant sections relating to plagiarism, citation and referencing; | √ | | b. | I know that plagiarism is wrong; | √ | | c. | I did not attempt to present the ideas of another as if they were my own; | √ | | d. | I did not attempt to represent the words or work of another as if they were my own; | √ | | e. | I did not utilize the ideas, words or work of another without acknowledgement; | √ | | f. | I did not use the printed text, electronic text, images, computer programme, sound, performance or creative works of another without proper acknowledgement; | √ | | g. | Where I engaged with group of student to create a particular piece of work, the work correctly reflected the contribution made (where a single piece of work is collected generated, all of the group carries the responsibility for that piece of work); | √ | | h. | I have not allowed, and will not allow, anyone to copy my work with the intention of passing it off as his or her own work. | √ | | i. | I have not copied another person’s assignment, essay or take-home test or any part thereof...

Words: 5356 - Pages: 22

Premium Essay

The Norwegian Version of the Peditric Nurses' Knowledge and Attitudes Survey Regarding Pain: Reability and Validity

...The Norwegian Version of the Pediatric Nurses’ Knowledge and Attitudes Survey Regarding Pain: Reliability and Validity Kari Raaum Hovde, Tone Høilo Granheim, Knut-Andreas Christophersen, Alfhild Dihle P ain in children is still both under-estimated and under-treated (Bell et al., 2009; Liossi, 2006; Stinson, Yamada, Dickson, Lamba, & Stevens, 2008; Zempsky, Cravero, & the Committee on Pediatric Emergency Medicine and Section of Anesthesiology and Pain Medicine, 2004). Inadequate treatment of pain in children may cause many negative psychological and physiological consequences, such as fear and anxiety, post-traumatic stress disorder, increased risk of infections, and delayed wound healing. In addition, poor pain control may result in increased length of stay in the hospital, contribute to the development of chronic pain, and even lead to death (American Academy of Pediatrics [AAP] & American Pain Society [APS], 2001; Liossi, 2006; Schechter, Berde, & Yaster, 2003; Vincent, 2005; Zempsky et al., 2004). The purpose of this study was to evaluate the reliability and validity of the Norwegian version of the Pediatric Nurses’ Knowledge and Attitudes Survey Regarding Pain (PNKAS-N) in nurses who work with children in pain. The PNKAS was translated into Norwegian in accordance with international guidelines and pilot tested with 10 nurses. The reliability was estimated using Cronbach’s alpha coefficient and the test-retest method using Pearson’s r coefficient. Construct validity...

Words: 5297 - Pages: 22

Free Essay

Measurement

...MEASUREMENT: Measurement is the heart of scientific discipline. It is the process of assigning numbers to objects or observations, the level of measurement being a function of the rules under which the numbers are assigned. According to Zikmund, “Measurement is the process of describing some property of a phenomenon of interest, usually by assigning numbers in a reliable and valid way”. In our daily life we are said to measure when we use some yardsticks to determine weight, height or some other features of a physical object. We also measure when we judge how well we like a song, intelligence, achievement, attitude, perception regarding quality. We, thus, measure physical object as well as abstract concepts. Measurement is a relatively complex & demanding task, especially so when it concerns qualitative or abstract phenomena. So, technically speaking, measurement is a process of mapping aspects of a domain onto other aspects of a range according to some rule of correspondence. It is a collection of quantitative data. TYPES OF MEASUREMENT: 1) Nominal measurement: All qualitative measurements are nominal, regardless of whether the categories differ from one another only in names. Nominal measurements represent the most elementary level of measurement in which values are assigned to an object for identification or classification purposes only. In nominal level of measurement, the categories differ from one another only in names. In other words, one category of a characteristic...

Words: 4319 - Pages: 18

Premium Essay

Case Study of Management Operational Control

...Abstract This empirical case study examines the operation of management control practices as a package in a growth firm context by paying particular attention to the couplings among cultural, personnel, action and results controls. The analysis focuses on two different management control packages in the face of similar contingencies at different points of time. The paper argues that the functionality of a control package depends on internal consistency, specifically on the reciprocal linkages of design and use between a primary mode of control and other control elements. Moreover, it argues that control package variety is driven by the way in which the management responds to functional demands. Two different control packages are considered equifinal to the extent of limited operational complexity, whereas an accounting-centric control package is also sufficient in the face of increasing levels of operational complexity. © 2008 Elsevier Ltd. All rights reserved. Keywords: Management control package; Control system variety; Internal consistency; Functional demands; Equifinality; Growth firm 1. Introduction This empirical case study examines the operation of management control practices as a package in a growth firm context. While it has been acknowledged that ‘soft’ and informal modes of control typically characterize small firms (e.g. Bruns and Waterhouse, 1975; Merchant, 1981; Flamholtz, 1983; Chenhall, 2003; Merchant and Van der Stede, 2007), an increasing body of literature...

Words: 13689 - Pages: 55

Premium Essay

Self-Congurity and Brend

...Xiaodong Qiu [Email address] Xiaodong Qiu [Email address] Abstract [Draw your reader in with an engaging abstract. It is typically a short summary of the document. When you’re ready to add your content, just click here and start typing.] Abstract [Draw your reader in with an engaging abstract. It is typically a short summary of the document. When you’re ready to add your content, just click here and start typing.] literature review Self-congruityand brand personality on consumer satisfaction literature review Self-congruityand brand personality on consumer satisfaction Since the theory of consumer satisfaction put forward, it has been attracting marketing researchers attention and came out many valuable theoretical results. However, it is relatively less about the influence of self-congruity to consumer satisfaction (Sirgy et al., 1997). This review will focus on the literature on self-congruity theory including self-concept and brand personality, and the influence on consumer satisfaction. Expand the mainstream view and research result and conclude the limit of former studies. Most of the literature focus on the changing of self-congruity, brand personality and consumer satisfaction. At the end of the 19 century, there were already some research topics related to self-congruity within the domain in psychology. However, until the late of the 1950s, researchers started to introduce the topic of self-congruity into the researches of customers’...

Words: 3725 - Pages: 15

Free Essay

Professor

...Effect of rating scale categories on reliability and fit statistics of the Malay Spiritual Well-Being Scale using Rasch analysis. 1Aqil Mohammad Daher, 2Sayed Ahmad Hassan, 3Than Winn, 4Mohd Ikhsan Selamat 1,4Population health and preventive medicine, faculty of medicine University Teknologi MARA, Sungai Buloh, Selangor Malaysia 2Department of Psychiatry, faculty of medicine University Teknologi MARA, Selayang, Selangor, Malaysia 3Department of Community Medicine, Mahsa University College, Kuala Lumpur, Malaysia Abstract Introduction Handful literatures have explored the effect of rating categories on reliability based on classical test theory indices, fewer studies have employed the item response theory in examining reliability. We carried out this study to examine the effect of rating scale categories (RSC) on reliability and fit statistics of the Malay Spiritual Well-Being Scale using Rasch model. Methods The Malay Spiritual Well-being scale (SWBS) with the original six and newly introduced three and four rating categories (RSC) was distributed randomly among three different samples of 50 participants each in different community screening programs at in Selangor state , Malaysia. Results The mean age of respondent in the three samples ranged 36-39 year, majority was females in all samples, and Islam was the major religion of respondents. Malay race was predominating followed by Chinese then Indian. Most participants were employed and more than a half in all...

Words: 4515 - Pages: 19

Free Essay

My Favorites

...84 CHAPTER 3 Research design, research method and population 3.1 INTRODUCTION Chapter 3 outlines the research design, the research method, the population under study, the sampling procedure, and the method that was used to collect data. The reliability and validity of the research instrument are addressed. Ethical considerations pertaining to the research are also discussed. 3.2 RESEARCH DESIGN It is the blueprint for conducting the study that maximises control over factors that could interfere with the validity of the findings. Designing a study helps the researcher to plan and implement the study in a way that will help the researcher to obtain intended results, thus increasing the chances of obtaining information that could be associated with the real situation (Burns & Grove 2001:223). 3.3 RESEARCH METHOD A quantitative, descriptive approach was adopted to investigate reasons why women who requested TOP services failed to use contraceptives effectively. 3.1 Quantitative This is a quantitative study since it is concerned with the numbers and frequencies with which contraceptive challenges were experienced by women who requested TOP services in terms of the 85 CTOP Act (no 92 of 1996) in the Gert Sibande District of the Mpumalannga Province of the RSA, during August and September 2003. 3.2 Description This study was descriptive because it complied with the characteristics of descriptive research as stipulated by Brink and Wood (1998: 283). • Descriptive designs...

Words: 4062 - Pages: 17

Premium Essay

Use of Personality Measures

...Human Resource Management Review 16 (2006) 155 – 180 www.socscinet.com/bam/humres The use of personality measures in personnel selection: What does current research support?☆ Mitchell G. Rothstein a,⁎, Richard D. Goffin b a Richard Ivey School of Business, University of Western Ontario, London, Ontario, Canada, N6A 3K7 b Department of Psychology, University of Western Ontario, Canada Abstract With an eye toward research and practice, this article reviews and evaluates main trends that have contributed to the increasing use of personality assessment in personnel selection. Research on the ability of personality to predict job performance is covered, including the Five Factor Model of personality versus narrow personality measures, meta-analyses of personality–criterion relationships, moderator effects, mediator effects, and incremental validity of personality over other selection testing methods. Personality and team performance is also covered. Main trends in contemporary research on the extent to which applicant “faking” of personality tests poses a serious threat are explicated, as are promising approaches for contending with applicant faking such as the “faking warning” and the forced-choice method of personality assessment. Finally, internet-based assessment of personality and computer adaptive personality testing are synopsized. © 2006 Elsevier Inc. All rights reserved. Keywords: Personality assessment; Personnel selection; Five factor model; Personality and job...

Words: 21212 - Pages: 85

Premium Essay

Segmentation, Targeting and Positioning

...A Cross-National and Cross-Cultural Approach to Global Market Segmentation: An Application Using Consumers’ Perceived Service Quality James Agarwal, Naresh K. Malhotra, and Ruth N. Bolton ABSTRACT The spread of global culture is being facilitated by the proliferation of transnational corporations, the rise of global capitalism, widespread aspiration for material possessions, and the homogenization of global consumption. The extent of convergence of cultural values across nations has been debated by international marketing researchers. However, from a practical standpoint, transnational firms require a cross-national, cross-cultural approach to market segmentation that can be used to guide the development of global marketing strategies. In this study, the authors investigate the application of cross-national versus cross-cultural approaches to market segmentation through a rigorous empirical investigation in the context of banking services. Although services constitute the fastest growing sector of the world economy, few studies have examined global market segmentation strategies for them. The authors develop theory-based crossnational hypotheses and test them by estimating a structural model of consumers’ perceived service quality using survey data from two countries: the United States and India. They test cross-cultural hypotheses by estimating the same model on culture-based clusters. They demonstrate that there are distinctive differences between cross-national and crosscultural...

Words: 15822 - Pages: 64