Free Essay

Type 1 and Type 2 Error

In:

Submitted By Krishnarangachar
Words 1375
Pages 6
Type I error (or, error of the first kind) and Type II error (or, error of the second kind) are precise technical terms used in statistics to describe particular flaws in a testing process, where a truenull hypothesis was incorrectly rejected (Type I error) or where one fails to reject a false null hypothesis (Type II error).
The terms are also used in a more general way by social scientists and others to refer to flaws in reasoning. This article is specifically devoted to the statistical meanings of those terms and the technical issues of the statistical errors that those terms describe.

Statistical test theory
In statistical test theory the notion of statistical error is an integral part of hypothesis testing. The test requires an unambiguous statement of a null hypothesis, which usually corresponds to a default "state of nature", for example "this person is healthy", "this accused is not guilty" or "this product is not broken". An alternative hypothesis is the negation of null hypothesis, for example, "this person is not healthy", "this accused is guilty" or "this product is broken". The result of the test may be negative, relative to null hypothesis (not healthy, guilty, broken) or positive (healthy, not guilty, not broken). If the result of the test corresponds with reality, then a correct decision has been made. However, if the result of the test does not correspond with reality, then an error has occurred. Due to the statistical nature of a test, the result is never, except in very rare cases, free of error. Two types of error are distinguished: type I error and type II error.
[edit]Type I error
A type I error, also known as an error of the first kind, is the wrong decision that is made when a test rejects a true null hypothesis (H0). A type I error may be compared with a so called false positive in other test situations. Type I error can be viewed as the error of excessive credulity.[1] In terms of folk tales, an investigator may be "crying wolf" (raising a false alarm) without a wolf in sight (H0: no wolf).
The rate of the type I error is called the size of the test and denoted by the Greek letter α (alpha). It usually equals the significance level of a test. In the case of a simple null hypothesis α is the probability of a type I error. If the null hypothesis is composite, α is the maximum (supremum) of the possible probabilities of a type I error.
[edit]Type II error
A type II error, also known as an error of the second kind, is the wrong decision that is made when a test accepts a false null hypothesis. A type II error may be compared with a so-called false negative in other test situations. Type II error can be viewed as the error of excessive skepticism.[1] In terms of folk tales, an investigator may fail to see the wolf ("failing to raise an alarm"; see Aesop's story of The Boy Who Cried Wolf). Again, H0: no wolf.
The rate of the type II error is denoted by the Greek letter β (beta) and related to the power of a test (which equals 1 − β).
What we actually call type I or type II error depends directly on the null hypothesis. Negation of the null hypothesis causes type I and type II errors to switch roles.
The goal of the test is to determine if the null hypothesis can be rejected. A statistical test can either reject (prove false) or fail to reject (fail to prove false) a null hypothesis, but never prove it true (i.e., failing to reject a null hypothesis does not prove it true).
[edit]Example
As it is conjectured that adding fluoride to toothpaste protects against cavities, the null hypothesis of no effect is tested. When the null hypothesis is true (i.e., there is indeed no effect, but the data give rise to rejection of this hypothesis, falsely suggesting that adding fluoride is effective against cavities), a type I error has occurred.
A type II error occurs when the null hypothesis is false (i.e., adding fluoride is actually effective against cavities, but the data are such that the null hypothesis cannot be rejected, failing to prove the existing effect).
In colloquial usage type I error can be thought of as "convicting an innocent person" and type II error "letting a guilty person go free".
Tabularised relations between truth/falseness of the null hypothesis and outcomes of the test: | Null hypothesis (H0) is true | Null hypothesis (H0) is false | Reject null hypothesis | Type I error
False positive | Correct outcome
True positive | Fail to reject null hypothesis | Correct outcome
True negative | Type II error
False negative |
[edit]Understanding Type I and Type II errors
From the Bayesian point of view, a type I error is one that looks at information that should not substantially change one's prior estimate of probability, but does. A type II error is that one looks at information which should change one's estimate, but does not. (Though the null hypothesis is not quite the same thing as one's prior estimate, it is, rather, one's pro forma prior estimate.)
Hypothesis testing is the art of testing whether a variation between two sample distributions can be explained by chance or not. In many practical applications type I errors are more delicate than type II errors. In these cases, care is usually focused on minimizing the occurrence of this statistical error. Suppose, the probability for a type I error is 1% , then there is a 1% chance that the observed variation is not true. This is called the level of significance, denoted with the Greek letter α (alpha). While 1% might be an acceptable level of significance for one application, a different application can require a very different level. For example, the standard goal of six sigma is to achieve precision to 4.5 standard deviations above or below the mean. This means that only 3.4 parts per million are allowed to be deficient in a normally distributed process
-------------------------------------------------
[edit]Consequences of type I and type II errors
Both types of errors are problems for individuals, corporations, and data analysis. A false positive (with null hypothesis of health) in medicine causes unnecessary worry or treatment, while a false negative gives the patient the dangerous illusion of good health and the patient might not get an available treatment. A false positive in manufacturing quality control (with a null hypothesis of a product being well-made), discards a product, which is actually well-made, while a false negative stamps a broken product as operational. A false positive (with null hypothesis of no effect) in scientific research suggest an effect, which is not actually there, while a false negative fails to detect an effect that is there.
Based on the real-life consequences of an error, one type may be more serious than the other. For example, NASA engineers would prefer to throw out an electronic circuit that is really fine (null hypothesis H0: not broken; reality: not broken; action: thrown out; error: type I, false positive) than to use one on a spacecraft that is actually broken (null hypothesis H0: not broken; reality: broken; action: use it; error: type II, false negative). In that situation a type I error raises the budget, but a type II error would risk the entire mission.
Alternatively, criminal courts set high bar for proof and procedure and sometimes release someone who is guilty (null hypothesis: innocent; reality: guilty; test find: not guilty; action: release; error: type II, false negative) rather than convict someone who is innocent (null hypothesis: innocent; reality: not guilty; test find: guilty; action: convict; error: type I, false positive). Each system makes its own choice regarding where to draw the line.
Minimizing errors of decision is not a simple issue; for any given sample size the effort to reduce one type of error generally results in increasing the other type of error. The only way to minimize both types of error, without just improving the test, is to increase the sample size, and this may or may not be feasible.

Similar Documents

Free Essay

Hta Analysis and Ngomsl Model

...Descriptions: To Display the class schedule of 5 courses 1. Run the program on any Java software * You can run your class scheduling program using any Java platforms like NetBeans, Eclipse, and Command prompt. 2. Enter the course details 3.1. Enter the course title * Type the name of the course in the dialog box that appears. 3.2.1. Click ok * After you have typed the course title in the dialog box , click ok 3.2.2. Click cancel * This terminates the program 3.2. Enter the no of Credit * Type the no of credit in the dialog box that appears in the screen. 3.3.3. Click Ok * After you have typed the no of credit 3.3.4. Click Ok on the Warning Dialog box * If you enter the no of credit wrongly a warning dialog box appears explaining the error made. Click Ok on the warning dialog box, then rectify your error 3.3.5. Click Cancel * This terminates the program 3.3. Enter the start time of the course * Type the start time of the course in the dialog box that appears in the screen. 3.4.6. Click Ok * After you have typed the start time of the course 3.4.7. Click Ok on the Warning Dialog box * If you enter the start time of the course wrong a warning dialog box appears explaining the error made. Click Ok on the warning dialog box, then rectify your error 3.4.8. Click Cancel * This terminates...

Words: 721 - Pages: 3

Premium Essay

Distribution Wksht

...University of Phoenix Material Distribution, Hypothesis Testing, and Error Worksheet Answer the following questions. Questions that are answered without the work will not receive full credit. When a question says explain or describe, please DO NOT copy word for word from a reference. You need to explain the concept so I know you understand what it means. For questions requiring material from Statdisk, make sure to turn labels on, take a screen capture (CTRL-Print Screen on most Windows-based computers), and paste the image into the worksheet. Crop the image as appropriate. 1. Describe a normal distribution in no more than 100 words (5 point). Answer: A normal distribution is a continuous random variable distribution with a bell shape, and has only two parameters: the mean, and the variance. A normal distribution can be represented by the formula: y=e^(-1/2)(x-μ/σ)/(σ√2pi). The mean can be any positive number and variance can be any positive number, so there are an infinite number of normal distributions. The shape of the distribution when graphed is symmetrical and bell-shaped. Use this information to answer questions 2-4. Following a brushfire, a forester takes core samples from the ten surviving Bigcone Douglas-fir trees in a test plot within the burn area, and a dendrochronologist determines the age of the source trees to be as follows (in years): 15 38 48 67 81 83 94 102 135 167 2. Construct a normal quantile plot in Statdisk, show the regression...

Words: 1895 - Pages: 8

Premium Essay

Conersion

...CONVERSION: Main Module Declare selection as integer Declare currency type as integer Declare international value as real Declare US value as real Do while user wants to continue Call display Menu Call get international value Call convert currency Call display results End while Print exit message End main module Display Menu Declare continue as Boolean Set continue = true While continue = true Display "Welcome to the international currency conversion program" Display "Please make a selection" Display "International Currency Types:" Display "1: Canadian Dollars" Display "2: Mexican Pesos" Display "3: English Pounds" Display "4: Japanese Yen" Display "5: French Francs" Display "6: Quit" Display "Enter a selection: "; Input currency type If currency type >= 1 and currency type <= 5 then Set continue = false else if currency type = 6 Display “Quitting Currency Conversion” continue = false else Display "Error 4: Invalid menu selection." continue = true end if end While End Display Menu Get International Value Declare value as integer Declare continue as Boolean Set continue = true While continue = true Display "Enter a currency value (positive number):" Input international value if international value > 0 and international value <=(NNN) NNN-NNNNthen continue = false else Display "Error 1: Invalid input--Negative Number" continue = true end if end while End Get International Value ...

Words: 892 - Pages: 4

Premium Essay

Whatever

...  Properties of Point Estimators      Statistical Inference: Hypothesis Testing    T test F test Chi square test   Measures of shape of the curve  Moments  Skewness  kurtosis Probability distributions  Normal Distribution  T-student Distribution  Chi-Square Distribution  F Distribution Index Number   Etc. Correlational Statistics  Covariance  Correlations  regressions 1 4/7/2014 Some Terminology  Variables are things that we measure, control, or  manipulate .They may be classified as: 1. Quantitative i.e. numerical  Continuous: takes fractional values ex. height in cm  Discrete : takes no fractional values ex. GDP  Random Variable: If the value of a variable cannot be  predicted in advance Non random : If the value of a variable cannot be  predicted in advance  Some Terminology 2. Qualitative i.e. non numerical 1. Nominal: Items are usually categorical and may have numbers assigned to them. Ex. gender (male, female), martial status ( married, unmarried, divorced) etc. 2. Ordinal: Items are usually set into some kind of order by their position on the scale. Ex. The first, third and fifth person in a race. 3. Interval: Interval data (also sometimes called integer) is measured along a scale in which each position is equidistant from one another. Interval data cannot...

Words: 1759 - Pages: 8

Premium Essay

Protein Lab Write Up

...Name: Obilor Kelechi Lab Section: 4 Title: Protein LAB Report Date: 11/2/14 INTRODUCTION: A protein is a compound that consist of one or more chains of amino acids. The body contains millions of protein that performs different functions. The DNA are transcribed to form a RNA which is translated to amino acids, this amino acids come together to form a protein. There is 20 amino acids in the body which combines in differently to form the different types of proteins. Proteins have different structures, they are: primary, secondary, tertiary and quaternary structures. The primary (1o) structures are sequence of amino acids in a long polypeptide chain. Neighboring carboxyl and amino acids groups bond together by hydrogen bonds to from different...

Words: 1250 - Pages: 5

Premium Essay

Marketing Strategy Statistics

...Research 4 1) Marketing Research 4 2) The Marketing Research process 4 II- Body 1: Litterature Review 6 1) Inferential Statistics 6 a) Dummy Variables 7 b) Experimental Analysis 7 2) Normal Distribution 8 Figure 1: A normal Distribution, bell-shaped curve 9 3) Skewness 9 4) The Kurtosis 9 5) Formula of Kurtosis and Skewness over their Standard error 10 6) Central Limit Theorem 10 7) T Test Hypothesis testing for one sample mean 11 a) State the Null and Alternative Hypothesis 11 b) Hypothesis of the Testing 11 c )Choosing the Level of Significance 12 d) Calculate the test statistic for One Sample Mean 13 8) Independent Samples t-Test 13 a) Stating the Null and Alternative Hypothesis 14 b) Assumptions of the Testing 14 c) Choosing the Level of Significance 15 d) Calculate the test statistic for independent samples 15 e) Interpreting the Results 16 9) Risks in Decision Making Using Hypothesis Testing 17 10) The β Risk 17 III- Body II: Application 1: One Sample Testing 18 1) The Research Topic 18 2) Decriptives for the Students Sample 18 a) Range, Mean, Standard Deviation 18 b) Variance, Skewness, Kurtosis 19 c) Histogram of the Student Sample 19 3) Hypothesis of the independent sample t test 20 4) T test for the sample mean 20 a) State the Null and Alternative Hypothesis 20 b) Selection of the significant level 21 c) SPSS Output 22 d) Conclusion of the T Test 23 e) Limitation of the research 23 IV- Body III: Application 2: Independent...

Words: 3983 - Pages: 16

Premium Essay

Statistics

...Business Statistics WISE-International Master Hypothesis Testing  A hypothesis is a claim (conjecture/assumption) about a population parameter:    population mean population proportion It is always about a population parameter, not a sample statistic A Common Theme Check the merits of this hypothesis based on sample information sample A hypothesis is formed about some population parameter  infer Hypothesis testing provides a general framework for approaching such inference problems ˆ The Null Hypothesis  Suppose that some hypothesis has been formed about the population parameter  and that this hypothesis will be believed unless sufficient contrary evidence is produced.  This hypothesis can be thought of as a maintained hypothesis. In the language of statistics, this hypothesis is called a null hypothesis, and is denoted as H0.    In hypothesis testing, the null hypothesis plays a role similar to that of a defendant on trial in many judicial systems. Just as a defendant is presumed to be innocent until proven guilty, the null hypothesis is presumed to be true until the data strongly suggest otherwise. The Alternative Hypothesis, H1  Having a null hypothesis requires having an alternative hypothesis that challenges the null hypothesis. In a Court of Law H 0 :   innocent H1 :   guilty The defendant is deemed innocent until the prosecution presents sufficiently strong contrary evidence...

Words: 3341 - Pages: 14

Premium Essay

Currency Conversion Procedure

...Display = International Currency Conversion Program Display = Make a Selection Display = International Currency Types Display = Canadian Dollars Display = Mexican Pesos Display = English Pounds Display = Japanese Yen Display = French Francs Display = Quit Display = Enter a Selection Input Currency Type If Currency Type >= 5 and Currency Type <=25 then Set = false Else if currency Type = 26 Display = Quitting Currency Conversion Display = Error 4 Invalid Menu Selection End Display Menu Display Menu Test Produce As the user is deciding which they would like to select an international currency type. When the user enters a number between 5 and 26 corresponding with a currency type, then the program is set to false. If 26 is selected the program displays “Quitting Currency Conversion.” But if the user does not enter a number between 5 and 26 the program displays “Error 4: Invalid Menu Selection” and the set it true. Test Case 1 Input: user enters 5 for Canadian Dollars Set = false Test Case 2 Input: user enters 6 for Mexican Pesos Set = false Test Case 3 Input: user enters 7 for English Pounds Set = false Test Case 4 Input: use enters 8 for Japanese Yen Set = false Test Case 5 Input: user enters 9 for French Francs Set = false Test Case 6 Input: user enter 10 for Quit Set = false Test Case 7 Input: user enters 11 Output: display “Error 4: Invalid Menu Selection” Set = true Get International Value Declare value as integer Declare set as Boolean ...

Words: 1040 - Pages: 5

Premium Essay

Significance Testing

...inference. Differentiate between statistical and practical “significance.” Explain, and distinguish between, two types of errors in hypothesis testing. Define and discuss the power of a test. AP Outline Fit: IV. Statistical Inference: Estimating population parameters and testing hypotheses (30%–40%) B. Tests of significance 1. Logic of significance testing, null and alternative hypotheses; P-values; one- and two-sided tests; concepts of Type I and Type II errors; concept of power 4. Test for a mean (large sample -- ( known) What you will learn: A. Significance Tests for µ (( known) 1. State the null and alternative hypotheses in a testing situation when the parameter in question is a population mean µ. 2. Explain in nontechnical language the meaning of the P-value when you are given the numerical value of P for a test. 3. Calculate the one-sample z-statistic and the P-value for both one-sided and two-sided tests about the mean µ of a Normal population. 4. Assess statistical significance at standard levels α by comparing P to α. 5. Recognize that significance testing does not measure the size or importance of an effect. 6. Recognize when you can use the z test and when the data collection design or a small sample from a skewed population makes it inappropriate. 7. Explain Type I error, Type II error, and power in a significance-testing problem. Section 11.1: Significance Tests: The Basics...

Words: 2804 - Pages: 12

Premium Essay

Gfdfd

...Chapter 13 Overall Audit Plan and Audit Program Key objectives: 1. Know the five types of audit tests. 4. Understand how the tests are emphasized in different circumstances. 5. Know the methodology for designing an audit program. 6. Understand the relationship of transaction-related audit objectives to balance-related audit objectives. 7. Integrate the four phases of the audit process. 1. Audit program The importance of the audit program cannot be underemphasized. It contains all the procedures that the auditor considers necessary in the circumstances. The specific knowledge needed to construct the audit program for tests of transactions (tests of details of balances) is covered in Chapter 13 (15). 2. Risk Assessment Procedures - The auditor is required to perform risk assessment procedures, including procedures to gain an understanding of the internal control system (Ch. 10). This includes sufficient documentation and inquiries, as well as a walk-through to support the understanding. 3. Types of Tests - there are four types of further audit procedures auditors perform after performing risk assessment procedures: | | |Relation to | | | |Risk Model | |Test |Name |CR |PDR | |TOC...

Words: 2113 - Pages: 9

Premium Essay

Statistics

...; ; ; possible combinations; (N ncr combi) ; ; ; σ = planning value ; ; E=Z∝/2 P(1-P)n ; ; ; ------------------------------------------------- Beware! Each of the following equations holds only if certain conditions are met: µx = np; µx = µ; and Finity | Infinity | σ=N-nN-1 (σn) | σ=σn | σ=N-nN-1 xp(1-p)n if n/N > 0,05 | σ=p1-pn if n/N <0,05 | 99% confident interval ∝/2 = 0,005 = 2,576 95% confident interval ∝/2 = 0,025 = 1.96 90% confident interval ∝/2 = 0,05 = 1.645 We can be ...% confident that μ is not less than.... but no more than..... * Both and   are random variables * Both and   have probability distribution called the sampling distribution of * Scetch sampling distribution is mean and standard deviation and shape. [-μ] or │- p│.= sampling error In order to solve a question you should know 3 things; mean of ; E() which is μ, the standard deviation of , called σ and of course the ‘shape’ is it a normal distribution? T distribution df(degrees of freedom)=n -1 A. Central Limit Theorem: in selecting simple random samples of size ‘n’ from a population with mean and standard deviation , the sampling distribution of the sample mean, , approaches a normal probability distribution with mean and standard deviation /n, as the sample size becomes large (that is, as n30). Note: first, the sampling distribution of can be approximated by a normal probability distribution...

Words: 396 - Pages: 2

Premium Essay

Chapter 17

...1. Ch. 17/7 – You conduct a survey of a sample of 25 members of this year’s graduating marketing students and find that the average GPA is 3.2. The standard deviation of the sample is 0.4. Over the last 10 years, the average GPA has been 3.0. Is the GPA of this year’s students significantly different from the long-run average? At what alpha level would it be significant? Null hypothesis H0: µ=3.0 Alternative hypothesis H1: µ≠3.0 Given the sample mean xbar=3.2, sample standard deviation s=0.4 and sample size n=25. We can use t-test. The test statistic can be computed as follows: T=(xbar-3.0)/s*sqrt(n) =(3.2-3.0)/0.4*sqrt(25) =0.2/0.4*5=2.5 Note that the degree of freedom df=25-1=24. So, we can get p-value=P(T>2.5) =0.0197 So, if we use significance level alpha = 0.02, then since p-value=0.0197<alpha=0.02, we should reject Null hypothesis H0. Hence, at a 0.02 significance level, we conclude that the GPA of this year’s students is significantly different from the long-run average. 2. Ch. 17/9 – You contact a random sample of 36 graduates of Western University and learn that their starting salaries averaged $28,000 last year. You then contact a random sample of 40 graduates from Eastern University and find that their average starting salary was $28,800. In each case, the standard deviation of the sample was $1,000. a. Test the null hypothesis that there is no difference between average salaries received by the graduates of the two schools. TO determine...

Words: 3567 - Pages: 15

Premium Essay

This Is It

...1. (20 pt) Go to http://www.stat.tamu.edu/~west/ph/sampledist.html. Use a skewed distribution. Take 1000 samples of sizes 2, 10 and 100. Construct a mean for each sample and look at the distribution of the sample means (third row). Record the mean and SD in the population for the original random variable. Make a table and record the mean of the means, and the standard deviation of the mean for each sample size (2, 10 and 100). For a bell-shaped distribution we showed that the mean of the sample means is very similar to the mean in the population for all population sizes. This is the property of unbiasedness. From your results, does the mean appear to be unbiased when the original distribution of the data is skewed? For a bell-shaped distribution we showed that the standard deviation of the mean, or the standard error of the mean, decreases with sample size. Is this true for skewed distributions? Calculate the standard error of the mean that is expected for each sample size and record in the table. How did the observed result compared to your calculation? * * Mean of the Population: 15.5297 * Median of the Population: 12.2861 * Standard Deviation of the Population: 12.5078 * * Sample Size | * Mean of the means | * Standard Deviation | * Standard Error of the Mean | * 2 | * 15.7741 | * 8.8115 | * 6.23067 | * 10 | * 15.4568 | * 3.847 | * 1.21653 | * 100 | * 15.5223 | * 1.2636 | * 0.12636 | * ...

Words: 1354 - Pages: 6

Free Essay

In a Business Environment, Why Should People Be Moral as Individuals?

...Arkansas Error Details | Steps to Resolve | Response Error Code:  101Primary Error Text:  DOB Is MissingDescription:  Date of Birth is missing in the Request | 1.     Look up the Transaction in ASSIST.A.     Verify that the Date of Birth appears in ASSIST on the Request page of the Transaction.          If the Date of Birth appears in the Request, have the Agent rerun the Transaction.          If the Date of Birth does not appear in the Request, have the Agent rerun the transaction by manually entering the ID # and Date of Birth.2.     Look up the Customer File in the Admin Console.A.     Verify that the Date of Birth appears on file.          If the Date of Birth appears on file, have the Agent rerun the transaction by using the Customer ID #.          If the Date of Birth does not appear on file, update if allowed according to the Customer Update Guidelines, and have the Agent rerun the Transaction. | Response Error Code:  102Primary Error Text:  ID Type MissingDescription:  Request is missing the ID Type (DL, SSN, CID, or ATL ID) | 1.     Look up the Transaction in ASSIST.A.     Verify that the ID # appears in ASSIST on the Request page of the Transaction.          If the ID # appears in the Request, have the Agent rerun the Transaction.          If the ID # does not appear in the Request, have the Agent rerun the transaction by manually entering the ID # and Date of Birth.2.     Look up the Customer File in the Admin Console.A.     Verify that the ID # appears...

Words: 141030 - Pages: 565

Premium Essay

Chapter 12 Cis 345

...need not be connected to a network. However, you must be logged on to the computer as a user with administrator-equivalent privileges. 1. Click Start, then click Control Panel. The Control Panel window appears. 2. In the Control Panel window, click System and Security. The System and Security window appears. 3. If the entire list of options doesn’t appear, scroll to the bottom of the window and click View event logs under the Administrative Tools heading. 4. A User Account Control window appears, requesting your permission to continue. Click Continue. 1. The Event Viewer window appears, with three columns of panes. The center pane lists a summary of administrative events. Notice that events are classified into the following types: Critical, Error, Warning, Information, Audit Success, and Audit Failure. The number of events that have been logged in each category are listed to the right of the classification entry. How many Error events has your Windows workstation logged in the last 24 hours? 410 errors In the last 7 days? 440 errors 2. If your workstation has logged any errors in the past 7 days, click the plus sign next to the event type Error. A list of error events appears. (If you do not have any entries in the Error category, click the plus sign next to the event type “Warning”...

Words: 1193 - Pages: 5