Premium Essay

Normalization in Databases

In:

Submitted By vkrailind
Words 1576
Pages 7
Contents
1- Introduction
2- Why to Normalize?
3- Normalization & Its types.
4- Applications of Normalization.
5- Conclusion

Introduction
While working with databases, one must have got chances to hear the term normalization. If someone asks "Is that database normalized?" or "Is that in BCNF?" All too often, the reply is "Uh, yeah." Normalization is often brushed aside as a luxury that only academics have time for. However, knowing the principles of normalization and applying them to daily database design tasks really isn't all that complicated and it could drastically improve the performance of DBMS.

In this report, I'll introduce the concept of normalization and take brief look at the most common normal forms.

Why to normalize?
While designing a database, the main problem existing in that “raw” database is redundancy. Redundancy is storing the same data item in more one place. A redundancy creates several problems like the following:

1. Extra storage space
2. Entering same data more than once.
3. Deleting data from more than one place.
4. Modifying data in more than one place.
5. Anomalies may occur in the database if insertion, deletion, modification etc are not done properly. It creates inconsistency and unreliability in the database.

To solve these problems, the “raw” database needs to be normalized.

Normalization
Normalization is the process of efficiently organizing data in a database. There are two goals of the normalization process:
• Eliminating redundant data (for example, storing the same data in more than one table)
• Ensuring data dependencies make sense (only storing related data in a table). Both of these are worthy goals as they reduce the amount of space a database consumes and ensure that data is logically stored.
Un-Normalized

Similar Documents

Premium Essay

Database Normalization

...Running Head: DATABASE NORMALIZATION Database Normalization *************** IST 7000 October 2006 Table of Contents ABSTRACT 3 Brief overview 4-5 Types of Normal Forms 5-9 Advantages and Disadvantages 9-10 Normalization Best practices 10-11 Conclusion 11-12 References 13 Appendix A 14 Example Normal Form Diagrams 15 Abstract In relational databases, normalization is a process that is used to eliminate redundancy, reduce the potential for anomalies during data processing, and maintain data consistency and integrity through out the database. It is a design technique used primarily as a guide in designing relational databases. Normalization is essentially a two step process that puts data into tabular form by removing repeating groups and then removes duplicated data from the relational tables. The theory of normalization is based on the concept of normal forms introduced by Edgar F. Codd in 1970. I choosed this topic because this is one of the areas in my role as a database administrator where I’m weak. During my training as a database administrator, I was never really offered a database design class, however, while at work, I have been occasionally given the challenging task of reviewing a data model and giving...

Words: 3045 - Pages: 13

Premium Essay

Database Normalization

...formal definitions Introduction If your database design doesn’t conform to (at least) the 3rd Normal Form (3NF), chances are high that you will find it hard to achieve the performance need for a successful application. Furthermore, you will find that writing good SQL-statements (SELECT, UPDATE, INSERT or DELETE) will be difficult, and sometimes actually impossible without using a lot of procedural coding (PL/SQL in Oracle, VB/C# in Microsoft products). BTW: For a more practical, down-to-Earth English-oriented explanation of the 5 Normal forms, visit my Database Normalization eBook page. Definition of terms Now, first a few definitions about database normalization: I cite a few as I have found them in the Hyper dictionary on the Internet, though you may find a number of differing definitions around. We are dealing with the "Relational Model" as the basis for our relational databases. The relational model as I see it is Definition: A data model introduced by E.F. Codd in 1959/1970, particularly well suited for business data management. In this model, data are organized in tables. The set of names of the columns is called the "schema" of the table.". The work of E. F. Codd, and also Chris Date, is based on relational algebra, which is well outside the scope of this article. Database model: The product (outcome) of the database design process which aims to identify and organize the required data conceptually and logically. A database model tells you what information is to...

Words: 767 - Pages: 4

Premium Essay

Database Normalization and Logical Process Concept Paper

...Database Normalization and Logical Process Concept paper This short paper will explain with a simple example the process and the need of normalization in the most of the business databases. Complete proposal break down (Timeline phases, Financial phases) will be submitted per management request. Overview At first any database should be designed with the end user in mind. Logical database design, also referred to as the logical model, is the process of arranging data into logical, organized groups of objects that can easily be maintained. The logical design of a database should reduce data repetition or go so far as to completely eliminate it. The needs of the end user should be one of the top considerations when designing a database. We should remember trough all design process that the end user is the person who ultimately uses the database. There should be ease of use through the user's front-end tool (a client program that allows a user access to a database), but this, along with optimal performance, cannot be achieved if the user's needs are not taken into consideration. Some user-related research and design considerations in any business database include the following: What data should be stored in the database? How will the user access the database? What privileges does the user require? How should the data be grouped in the database? What data is the most commonly accessed? How is all data related in the database? What measures should be taken to ensure accurate data...

Words: 1393 - Pages: 6

Free Essay

Normalization

...Normalization In normalized database, the database has more number of tables. As normalization means creating more tables from lesser ones. As the numbers of tables created are higher, the more joints will be needed to get at the data. Normalization normally removes the duplicity i.e. redundant data which results in increase in database performance. Impacts of Normalization on Database Performance Normalization has various impacts on database performance. These are: 1. Removal of Redundancy: Normalization increases performance by removing the redundancy. 2. Less Storage Space: Normalization results into less storage space, because the redundant data is removed. 3. Reliability: It helps in producing the reliable information. 4. Efficient System: In order to provide efficient system, Normalization plays a very important role. In spite of increasing performance, Normalization requires much access time to execute the queries. Moreover, it also needs more number of joints. Example: We have two tables i.e. orders and customers 1. Orders table: * Select * from orders; Order_id | Order_name | Customer_id | 1 | Order_1 | 1 | 2 | Order_2 | 8 | 4 | Order_4 | 2 | 5 | Order_5 | 3 | 7 | Order_7 | 3 | 9 | Order_9 | 1 | 10 | Order_10 | 3 | 2. Customers table: * Select * from customers; Customer_id | Customer_name | 1 | Customer1 | 3 | Customer3 | 5 | Customer5 | * select * from db.orders O INNER JOIN db.customers C ON O.Customer_ID=...

Words: 662 - Pages: 3

Free Essay

Usefulness of Databases in the Food Industry

...Usefulness of Databases for the Food Industry Almost every company that exists can benefit from some sort of automation by utilizing databases to store information. A database provides a business with the ability to more effectively manage the information about their business. As databases have advanced, the ease of implementing such databases has advanced too. A database can be utilized to remove much of the manual paperwork that is required in the form of administrative work that would normally be handled by people. In that regard a database can actually save the company money as there is a reduced need for employees to manage the data and instead the database is used for this. The purpose of this project is to allow the company to move from paper records to a database system to track their recipes. Since it is a small food company the goal would be to start small and then grow from there as the business grows. In addition, the database should help the company become more efficient at their business allowing them to make more money in the long term. Based on the size of the company the best possible option for this company would be to utilize Microsoft Access for its ease of implementation as well as the price of the database. This database would provide the company the jumpstart they need to have a database that would create value for the company. As the company grew there would be the opportunity to upgrade to a more robust database platform such as Microsoft SQL Server...

Words: 1095 - Pages: 5

Premium Essay

Pos 410 Week 1 Dq 1&2

...Product Description PRODUCT DESCRIPTION POS 410 Week 1 Dq 1&2, A relational database matches data by using common characteristics found within the data set. The resulting groups of data are organized and are much easier for many people to understand. For example, a data set containing all the real-estate transactions in a town can be A relational database matches data by using common characteristics found within the data set. The resulting groups of data are organized and are much easier for many people to understand. For example, a data set containing all the real-estate transactions in a town can be In the field of relational database design, normalization is a systematic way of ensuring that a database structure is suitable for general purpose querying and free of certain undesirable characteristics; insertion, update, and deletion anomalies that could lead to a loss of data integrity. Informally, a relational database table is often described as “normalized” if it is in the Third Normal Form. Most 3NF tables are free of insertion, update, and deletion anomalies, i.e. in In the field of relational database design, normalization is a systematic way of ensuring that a database structure is suitable for general purpose querying and free of certain undesirable characteristics; insertion, update, and deletion anomalies that could lead to a loss of data integrity. Informally, a relational database table is often described as “normalized” if it is in the Third Normal Form. Most...

Words: 362 - Pages: 2

Premium Essay

Normalization

...1.Introduction Relational database design theory is mainly based on a class of constraints called Functional Dependencies (FDs).FDs are a generalization of keys. This theory defines when a relation is in normal form (e.g., in Third Normal Form or 3NF) for a given set of FDs. It is usually a sign of bad DB design if a schema contains relations that violate the normal form requirements. If a normal form is violated, data is stored redundantly, and information about different concepts is intermixed. To reduce the redundancy in a given relation we need to decompose it in two or more sub relations . Decomposition into highest normal forms required maintaining properties like loss less join decomposition and dependency preserving decomposition. To normalize a relation R we have to remove certain dependencies that exist in it like partial dependency& transitive dependency. Basic definitions 1.1 Functional dependency: In a given table, an attribute Y is said to have a functional dependency on a set of attributes X (written X → Y) if and only if each X value is associated with precisely one Y value. For example, in an "Employee" table that includes the attributes "Employee ID" and "Employee Date of Birth", the functional dependency {Employee ID} → {Employee Date of Birth} would hold. It follows from the previous two sentences that each {Employee ID} is associated with precisely one {Employee Date of Birth}. 1.2 Trivial functional dependency A trivial functional dependency...

Words: 2812 - Pages: 12

Premium Essay

Databases

...Why Normalization Failed to Become the Ultimate Guide for Database Designers? Marin Fotache Alexandru Ioan Cuza University of Iasi, Romania Faculty of Economics and Business Administration Dept. of Business Information Systems Blvd. Carol I, nr. 22, Iasi, 700505, Romania Tel: + 40 744 497 654, Fax: + 40 232 217 000 fotache@uaic.ro ABSTRACT With an impressive theoretical foundation, normalization was supposed to bring rigor and relevance into such a slippery domain as database design is. Almost every database textbook treats normalization in a certain extent, usually suggesting that the topic is so clear and consolidated that it does not deserve deeper discussions. But the reality is completely different. After more than three decades, normalization not only has lost much of its interest in the research papers, but also is still looking for practitioners to apply it effectively. Despite the vast amount of database literature, comprehensive books illustrating the application of normalization to effective real-world applications are still waited. This paper reflects the point of 1 view of an Information Systems academic who incidentally has been for almost twenty years a practitioner in developing database applications. It outlines the main weaknesses of normalization and offers some explanations about the failure of a generous framework in becoming the so much needed universal guide for database designers. Practitioners might be interested in finding out (or confirming)...

Words: 12110 - Pages: 49

Premium Essay

Rfgr

...the arrow 4.7 keys is a group of one or more attributes that are uniquely identifies a row 4.8 yes and yes 4.9 Deletion anomaly is a removal of a row of data but it also removes every relationship with that row. Ex: a removal of a computer in an inventory with everything to do with that computer 4.10 Insertion anomaly is adding an entry in a database but can because it hast. Ex: adding a computer to a computer inventory but cant 4.11 4.12 second normal form is the second step in normalization in a database. 4.13 third normalization form is the 3rd step in normalizing a database 4.14 BCNF is a form of database normalization 4.15 multi-value dependency occurs when the presence of one or more rows in a table implies the presence of one or more other rows in the same tables. Ex: Car Company that manufacture many models of cars, but always make both red and blue colors of each models. 4.16 4.17 Forth-normal form is a level of database normalization where there is no non-trivial decency other than a candidate key 4.18 Domain/key normal form is a normal form used in a database normalization that requires that the database contains no constraints other domain constraints and key constraints 4.19 4.22...

Words: 333 - Pages: 2

Premium Essay

Database Management

...redundancy. Relational DataBase tables – relational model emerged due to list problems. It is a collection of separate tables. Table holds data for only one theme, not multiple. When evaluating a database design, consider mod actions: insert/delete/update. Structured Query Language SQL – leading tech for data definition and manipulation. Can reconstruct lists from their underlying tables, query for specific data conditions, perform computations, can update/delete/insert data. Database System four components: * Database – collection of related tables and other structures * Database Management System DBMS – computer program used to create, process, and administer the database. Receives requests encoded in SQL and translates them into action on the database. Large complicated program licensed by software vendors. * Database Application – Set of one or more computer programs that serves as an intermediary between the user and the DBMS, These programs read or modify data by sending SQL statements to the DBMS. * Users – use the DB Application to keep track of things. Use forms to read, enter, and query data. Database – defined as self-describing (a description of the structure if the database is contained within the database itself) relational collection of tables. Metadata – data about the structure of the a database. (IE names of tables, names of columns, and the tables in which they belong, properties of tables and columns) Database contents – user data...

Words: 2798 - Pages: 12

Premium Essay

Nation Park

...| National Park Service System | Normalization Summary | Thurmond Burdette 4-14-2015 | In this summary you will understand the importance of normalization for this project. Normalization is the process the team used to efficiently organize data within the database. The goal of this process is to remove repeated data such as storing the same data in more than one table, and ensuring data dependencies. The ultimate goals of the normalization process ensures that redundancy is reduced and that data is organized so that it is relatable, decreases database space allowing the data to be understood. The NPS (National Park Service) has contracted with our organization to track data of visitation to all 51 of their parks which have recently experienced an increase in visitation. The data will reflect how work load demand and distribution affect funding and salary increases going forward. Presently utilizing Microsoft Access the NPS has used this relational database to keep data up to date and would like our analysts to create a “back-end” process for the system currently being used. The Normalization Process streamlines the above process by using guidelines that begins with the lowest form up to the fifth normal form. The first and second normal form sets a framework for the data to be organized and addresses duplication of data. The data tables in these two steps are organized independently and clearly identifiable. The Second normal form...

Words: 382 - Pages: 2

Free Essay

Case Study

...from "Databases R US" IT consulting and managing firm. You just got hired by "Home Solutions Inc" a construction and Parts Company to help them with their customer and parts data management. Unfortunately, the company has been depending on the owner's nephew for their data needs and Joe has done his best to come through for the family business. His lack of experience with data management is hurting the business as they have been unable to successfully keep track of their inventory system, customers and sales. Joe has put together a spreadsheet, see below, and they have been using it as a database of sorts, to track their customers and inventory. Your specific tasks to help the business are in parenthesis are the percentages that correspond to the per item payment (grade weight): 1) Review the existing spreadsheet, and their sample data and make any assumptions you need about the company. Make any comments about the approach that you are going to follow, as you see fit. (10%) 2) Redesign the spreadsheet into a database with tables, add any fields that you think could be useful, even though it is not absolutely necessary. Explain your actions as you go along. 3) Perform a functional dependency analysis, and include it as part of your deliverables, for every step of the normalization process (25%) 4) Clearly take the existing un - normalized structure through distinct 1NF, 2NF and finally 3NF stages, using the shorthand representation. All three stages of normalization have...

Words: 600 - Pages: 3

Premium Essay

Database Administration

...DATABASE DESIGN AND THE IMPORTANCE OF DESIGNING A FUNCTIONAL PROCESS CIS475 Instructor: IBRAHIM ELHAG, PhD 3/6/2012 DATABASE DESIGN One of the most important if not the most important aspect of a database is the database design process. It is a must that the design is good and functional. A database consists of many different parts of an engineer’s design, which together makes up a complete database system. A database system is designed to manage large bodies of information. Database design is the construction of a database as a subsystem of an information system. Therefore, the database design process is an integral component of the information system design process. ( Chilson and Kudlac, 1983). One of the greatest problems in the application design process is the lack of foresight. During the design process there must be a thorough evaluation of your database, what it should hold, how data relates to each other, and most importantly, whether it is scalable. When it comes to the maintenance of your database it should be easy to maintain, this means storing as limited amount of repetitive data as possible. Let’s say you have a lot of duplicate data, and one instant of that data undergoes a name change, that change has to be made for all occurrences of that data. You can overcome this duplication problem by creating a table of possible values and use a key to refer to the value. That way, if the value changes names, the change occurs only...

Words: 2041 - Pages: 9

Premium Essay

Normalization

...Chapter 5 Normalization of Database Tables Discussion Focus Why are some table structures considered to be bad and others good and how do you recognize the difference between good and bad structures? From an information management point of view, possibly the most vexing and destructive problems are created through uncontrolled data redundancies. Such redundancies produce update and delete anomalies that create data integrity problems. The loss of data integrity can destroy the usefulness of the data within the database. (If necessary, review Chapter 1, Section 1.4.4, “Data Redundancy”, to make sure that your students understand the terminology and that they appreciate the dangers of data redundancy.) Table structures are poor whenever they promote uncontrolled data redundancy. For example, the table structure shown in Figure IM5.1 is poor because it stores redundant data. In this example, the AC_MODEL, AC_RENT_CHG, and AC_SEATS attributes are redundant. (For example, note that the hourly rental charge of $58.50 is stored four times, once for each of the four Cessna C-172 Skyhawk aircraft – check records 1, 2, 4, and 9.) Figure IM5.1 A Poor Table Structure [pic] |[pic] The figures shown in this discussion show the contents of the IM_Discussion database. This database is located on the teacher’s CD. The | |Student Online Companion also includes SQL script files (Oracle and SQLServer) for all of the data sets used throughout the book. ...

Words: 9797 - Pages: 40

Premium Essay

Normalization

...Normalization of the ERD The Huffman Trucking Database (HTD) is normalizing up to 3NF level. The first step in normalizing is to fine the candidate key in the database. The “Employer ID Number”, “Driver Log ID”, and “Citation Number” are candidate keys in the database. First Normal Form (1NF) values of a database are: • Each column must have a unique name. • The order of the rows and columns doesn’t matter. • Each column must have a single data type. • No two rows can contain identical values. • Each column must contain a single value. • Columns cannot contain repeating groups. The tables in HTD consists unique names and each column has a single data type. Each table consists of primary key, which makes the row contain identical values. The HTD passes all the requirement of firs normal form. Second Normal Form (2NF) form with two conditions: • The table is in 1NF. • All the non key fields depend upon on all the key fields. All the three tables are depending upon the primary key or key fields; this makes the table to update anomalies. The HTD passes the requirement of second normal form Third Normal Form (3NF) form with two conditions: • The table is in 1NF. • It contains no transitive dependencies. A transitive dependency is when one non-key field’s value depends on another non-key field’s value. The drives log table need to driver log key to form 3NF, because Employee ID, Date, and Tractor ID are inter dependent. The other two table are depend upon the primary...

Words: 257 - Pages: 2