Let’s try understanding normalization
How many normal forms are there?
There are seven normal forms.
They are
- First Normal Form
- Second Normal Form
- Third Normal Form
- Boyce-Codd Normal Form
- Fourth Normal Form
- Fifth Normal Form
- Sixth or Domain-key Normal form
Why do we need to do normalization?
To eliminate redundancy of data i.e. having same information stored at multiple places, which eventually be difficult to maintain and will also increase the size of our database.
With normalization we will have tables with fewer columns which will make data retrieval and insert, update and delete operations more efficient.
What do we mean when we say a table is not in normalized form?
Let’s take an example to understand this,
Say I want to create a database which stores my friends name and their top three favorite artists.
This database would be quite a simple so initially I’ll be having only one table in it say friends table. Here FID is the primary key.
| FID | FNAME | FavoriteArtist |
| 1 | Srihari | Akon, The Corrs, Robbie Williams. |
| 2 | Arvind | Enigma, Chicane, Shania Twain |
This table is not in normal form why?
FavoriteArtist column is not atomic or doesn’t have scalar value i.e. it has having more that one value.
Let’s modify this table
| FID | FNAME | FavoriteArtist1 | FavoriteArtist2 | FavoriteArtist3 |
| 1 | Srihari | Akon. | The Corrs | Robbie Williams. |
| 2 | Arvind | Enigma | Chicane | Shania Twain |
This table is also not in normal form why?
We have now changed our table and now each column has only one value!! (So what’s left?)
Because here we are having multiple columns with same kind of value.
I.e. repeating group of data or repeating columns.
So what we need to do to make it normal or at least bring it in First Normal Form?
- We’ll first break our single table into two.
- Each table should have information about only one entity so it would be nice if we store our friend’s information in one table and his favorite artists’ information in another
(For simplicity we are working with few columns but in real world scenario there could be column like friend’s phone no, email , address and favorites artists albums, awards received by them, country etc. So in that case having two different tables would make complete sense)
| FID | FNAME |
| 1 | Srihari |
| 2 | Arvind |
| FID | Favorite Artist |
| 1 | Akon. |
| 1 | The Corrs |
| 1 | Robbie Williams |
| 2 | Enigma |
| 2 | Chicane |
| 2 | Shania Twain |
FID foreign key in FavoriteArtist table which refers to FID in our Friends Table.
Now we can say that our table is in first normal form.
Remember For First Normal Form
Column values should be atomic, scalar or should be holding single value
No repetition of information or values in multiple columns.
So what does Second Normal Form means?
For second normal form our database should already be in first normal form and every non-key column must depend on entire primary key.
Here we can say that our Friend database was already in second normal form l.
Why?
Because we don’t have composite primary key in our friends and favorite artists table.
Composite primary keys are- primary keys made up of more than one column. But there is no such thing in our database.
But still let’s try to understand second normal form with another example
This is our new table
| Gadgets | Supplier | Cost | Supplier Address |
| Headphone | Abaci | 123$ | New York |
| Mp3 Player | Sagas | 250$ | California |
| Headphone | Mayas | 100$ | London |
In about table ITEM+SUPPLIER together form a composite primary key.
Let’s check for dependency
If I know gadget can I know the cost?
No same gadget is provided my different supplier at different rate.
If I know supplier can I know about the cost?
No because same supplier can provide me with different gadgets.
If I know both gadget and supplier can I know cost?
Yes than we can.
So cost is fully dependent (functionally dependent) on our composite primary key (Gadgets+Supplier)
Let’s start with another non-key column Supplier Address.
If I know gadget will I come to know about supplier address?
Obviously no.
If I know who the supplier is can I have it address?
Yes.
So here supplier is not completely dependent on (partial dependent) on our composite primary key (Gadgets+Supplier).
This table is surely not in Second Normal Form.
So what do we need to do to bring it in second normal form?
Here again we’ll break the table in two.
| Gadgets | Supplier | Cost |
| Headphone | Abaci | 123$ |
| Mp3 Player | Sagas | 250$ |
| Headphone | Mayas | 100$ |
| Supplier | Supplier Address |
| Abaci | New York |
| Sagas | California |
| Mayas | London |
We now how to normalize till second normal form.
But let’s take a break over here and learn some definitions and terms.
Composite Key: -Composite key is a primary key composed of multiple columns.
Functional Dependency – When value of one column is dependent on another column.
So that if value of one column changes the value of other column changes as well.
e.g. Supplier Address is functionally dependent on supplier name. If supplier’s name is changed in a record we need to change the supplier address as well.
S.Supplier–àS.SupplierAddress
“In our s table supplier address column is functionally dependent on the supplier column”
Partial Functional Dependency – A non-key column is dependent on some, but not all the columns in a composite primary key.
In our above example Supplier Address was partially dependent on our composite key columns (Gadgets+Supplier).
Transitive Dependency– A transitive dependency is a type of functional dependency in which the value in a non-key column is determined by the value in another non-key column.
With these definitions in mind let’s move to Third Normal Form.
For a table in third normal form
- It should already be in Second Normal Form.
- There should be no transitive dependency, i.e. we shouldn’t have any non-key column depending on any other non-key column.
Again we need to make sure that the non-key columns depend upon the primary key and not on any other non-key column.
| Album | Artist | No. of tracks | Country |
| Come on over | Shania Twain | 11 | Canada |
| History | Michael Jackson | 15 | USA |
| Up | Shania Twain | 11 | Canada |
| MCMXC A.D. | Enigma | 8 | Spain |
| The cross of changes | Enigma | 10 | Spain |
Although the above table looks fine but still there is something in it because of which we will normalize it further.
Album is the primary key of the above table.
Artist and No. of tracks are functionally dependent on the Album(primary key).
But can we say the same of Country as well?
In the above table Country value is getting repeated because of artist.
So in our above table Country column is depended on Artist column which is a non-key column.
So we will move that information in another table and could save table from redundancy i.e. repeating values of Country column.
| Album | Artist | No. of tracks |
| Come on over | Shania Twain | 11 |
| History | Michael Jackson | 15 |
| Up | Shania Twain | 11 |
| MCMXC A.D. | Enigma | 8 |
| The cross of changes | Enigma | 10 |
| Artist | Country |
| Shania Twain | Canada |
| Michael Jackson | USA |
| Enigma | Spain |
Normally this is considered enough and we don’t really go on applying the other normal forms.
Most of real-world application has databases which are in third normal forms.
Bye
References:-
Head First SQL,
Murach Sql for Sql server,
http://www.stuart.edu/courses/im510/database/2NF.htm
Discover more from Nishant Rana's Weblog
Subscribe to get the latest posts sent to your email.

great! may i say your information is the best “normalized information” have had so far. thanks and keep the good work.
LikeLike
i want that take only one example and reduce that example to various normal form not by taking different examples ……..however explaination was very good but i require to solve one example for all the normal forms
LikeLike
Its great work. To the point explaination without unnecessary things. Keep up. Cheers..
LikeLike
Beautifully explained. Thanks
LikeLike
the way in which the topic is explained here is lucid as well as outstanding,,,i appreciate it a lot and thankful too,,,
LikeLike
Please explain BCNF too
LikeLike
you explained well… i understood these concepts but stil i want to know abt remaining normal forms
with clear examples.. pls post that too….
LikeLike
nice one boss
LikeLike
explain me about 4nf & 5nf
LikeLike
hey u explained very well….This will help me a lot….
thanku….
LikeLike
Nice and easier way to explain!!
LikeLike
NICE !!easier way to explain!
LikeLike
nicely explained !!
LikeLike
Well explained… but where are the explaination after 3NF… please let me know the link
LikeLike
thanks ,,,.really nice explanation please describe remaining normal forms…………………
LikeLike
tats really useful.. well explained..
LikeLike
Great – Never seen a site explaining Normalization like this .
LikeLike
thankuu so much for such a simple and outstanding explanation
LikeLike
too good
LikeLike
Hi thanks alot…..u have put it so simply….:)
LikeLike
This blog has been cleared all confusions about normal forms…………..
Thanks buddy and please allobrate another normal forms also………………….
LikeLike
really good work
LikeLike
kidilan
LikeLike
Hello nishant,
I am developing a project related to student management system in VB.Net
Can we get the bar graphs or something like that to show the performance of the student??
LikeLike
simply great !!!!!!!!!
LikeLike
great …. i understand clearly …. from u r explanation only….
thanks..
LikeLike
awesome writing about normalization………
LikeLike
really awesome
LikeLike
Simply Superb. For last couple years i know these concepts but i was never clear enough.
Now, i am completely clear about 1 to 3 NFs.
Thank you So much.
LikeLike
Information is precise , comprehensive and clear cut. Good Job. I want to know about BCNF and 4th n 5th normal forms too.
LikeLike
thanq so much……..really i felt like …….’yah this is easy’…for the first time…..thnks for the hlp
LikeLike
mast dost, kya samjhaya hai.
LikeLike
awesome post Nishant….whether its CRM or Dotnet all your postsa re very helpful and very comprehensive…thanks nishant …keep this rocking this way:)
LikeLike
kya bat hai..bada pyaar aa raha hai uspe…;)
LikeLike
Manish, I’d suggest you to learn manners first instead of normalization.
LikeLike
I was always confused about what exactly the normal forms are, before reading this post.. this is awesome..
Thanks for the post.
LikeLike
Too good Till now i was so confused about normalization. Thanks a lot
LikeLike
thanks but please tell about boycee codd n 4 th normal form ,5 th normald form
LikeLike
thnks dude helpd me in solving my lessons
LikeLike
bhai…kaam toh badiya hai…lekin 4 n bcnf bhi toh samjha dete !
LikeLike
Bhai….samjhaya toh badiya hai ..lekin 4 nf n bcnf bhi samjha deta !
LikeLike
Awesome explanation of the Normal Forms… Thanks …
LikeLike
great work..it made my life easy bt would have been better if u cud also write 4th and boyce codd nf.
LikeLike
One other issue issue is video games are usually serious anyway with the major focus on understanding rather than fun.
Although, it has an entertainment facet to keep the kids engaged, each one game is frequently designed to
work towards a specific experience or program, such as numbers or research.
Thanks for your article.
LikeLiked by 1 person
after reading this article, read it. Normal forms [III rd unit]
http://sulabhbansal.blogspot.in/2010/08/lecture-notes.html
LikeLike
obviously like your web site however you need to take a look at the spelling
on several of your posts. A number of them are rife with spelling problems and I find it very
troublesome to tell the reality however I will certainly
come again again.
LikeLike