Five basic rules of database structure
- Order doesn’t matter
- No duplicate rows
- Every cell contains one value
- One column per type of information
- No redundant information
1. Order doesn’t matter
- The information should not be dependent on the order of the rows or the order
of the columns
data:image/s3,"s3://crabby-images/bbaf8/bbaf8378334c5978ee4d8191a38f56d3a5863ece" alt="Order of rows doesn't matter example"
2. No duplicate rows
data:image/s3,"s3://crabby-images/b6004/b60042b2087c4ad17328e4cd9f8b01082ff54e47" alt="No duplicate rows example"
3. Every cell contains one value
- This is an example of what not to do.
data:image/s3,"s3://crabby-images/d1ae4/d1ae4b763b544cc0dae3f4cd53661b713a8c1cfa" alt="One value per cell example"
- How would you query for
'Shrubland'
?
- This is also an example of what not to do.
data:image/s3,"s3://crabby-images/1b093/1b093a00639ac6602d3f5d250c5da4897c647ce4" alt="One column per type of information example"
- How would you query for records with
'Grassland' AND 'Shrubland'
?
Restructure the examples of what not to do for #3 and #4.
data:image/s3,"s3://crabby-images/bd22a/bd22a5b2504c90b8b5d0a6542362592293bfa461" alt="How to restructure to keep no duplicate rows and one value per cell"
- The proper structure lets us easily subset the data however we want.
Cross-tablulated data is difficult for SQL to work with.
data:image/s3,"s3://crabby-images/e268f/e268f7fc769da8d4557c2bb48e5940f6bbccbc6a" alt="Cross-tab table restructure"
data:image/s3,"s3://crabby-images/1ac84/1ac8457c82f08734aadd4f1dddc418f94492b8f6" alt="No redundant information example"
- Redundant information makes it more difficult to update or revise data.
- If something changes we want to be able to change it in one place, not hundreds of places.
- Use multiple tables to avoid redundant information.
- Easier and less error prone
- Use a Unique
RecordID
to link tables with complementary information.
Multiple tables
- It is often not efficient to include all information of interest in a single
table.
data:image/s3,"s3://crabby-images/7927a/7927a51792cb4a84e7c29bce830aea3c1f18a075" alt="Table with redundant information"
- To solve these problems,
- store data in multiple tables, and
- connect the data in different tables using
JOIN
to describe
relationships between tables (hence “relational” database)
- Each table contains a single data type
data:image/s3,"s3://crabby-images/49161/4916109d8d67c78ca1e11b74f77113ea7de540c3" alt="Restructuring a redundant table into two"