23 Hilarious Database normalization Puns

The basics of database normalization with TypeORM and PostgreSQL wanago.io/2021/11/22/data…

👍︎ 39

💬︎

👤︎ u/_gnx

📅︎ Nov 22 2021

🚨︎ report

I've always been curious about this one thing regarding database normalization...

Sorry for the click-bait-y title.

Given: a columnar storage system, like an OLAP database like Snowflake. Or even a columnar storage format like parquet.

If you had a column that had time as a text string, e.g. "11:30", would it improve database normalization to split that into one field with "11", one with ":", and the last with "30"? My thought is that the number of unique values you need to store in the original case is 720, assuming a 12-hr format, and 1440 assuming a 24-hr format. For the latter case, even though you would have more column objects, the number of unique values you have to carry is 73 , assuming a 12-hr format and 85, assuming a 24-hr format.

Wouldn't that compress better? and not degrade the information persistence.

This is assuming a dataset that is written to significantly more than its read so joining the fields back when queried isn't that big of a deal.

If so, would it be possible to write a program that just does this basic level of scanning through a database and finds ways to optimize its storage

👍︎ 3

💬︎

👤︎ u/BoiElroy

📅︎ Nov 29 2021

🚨︎ report

The basics of database normalization with TypeORM and PostgreSQL wanago.io/2021/11/22/data…

👍︎ 6

💬︎

👤︎ u/_gnx

📅︎ Nov 22 2021

🚨︎ report

Database - Normalization of Data

Hi Reddit!

I am a full stack junior web/mobile developer and I am having a lot of doubts about databases.

The only database that I know how to use is DynamoDB, which is a NoSQL database, and I am frequently searching and learning the difference between NoSQL and SQL. My point is about data normalization, which is a "feature" of SQL database, right? So, isn't the data validation and structure made by the frontend strictly followed? I mean, if my application will use/create the same data structure, why should I concern about data normalization?

I know that you can send requests from outside the application, but with a simple validation, you can block that request. I think that I am getting so used to AWS's way to create APIs and databases that I am messing everything in my mind.

Thank you in advance!!!

👍︎ 5

💬︎

👤︎ u/Jancera

📅︎ Aug 23 2021

🚨︎ report

Help - Process of Database Normalization

Hello,

I have done a fair amount of reading on database normalization. I feel as though I understand the concept and why it is implemented. However, i struggle with putting it into practice (particulary at the 2nd and 3rd normal forms). I think I may just have the wrong thought process. From the second you are tasked with trying to normalizing a database (from the 2nd form, onwards), what are you specifically looking for? Does anybody have tips or tricks?

Thanks in advance

👍︎ 2

💬︎

👤︎ u/PinstripePride3

📅︎ Mar 01 2021

🚨︎ report

What is the right amount of normalization in a SQL database (example)?

For example say we have an application where the user can store API URLs along with headers and parameters. For example, the user could save a custom Google search searching for the query "cats", and the parameters would be q=cats.

Table Design #1 (denormalization)

1 table named "Api" storing the URL, and Json columns for headers and parameters

Table Design #2 (normalization)

4 tables

Separate tables for Api, headers, params, and KeyValuePairs (the last is optional, but I see it in the code I'm inheriting)

---

Which is the better design?

My hunch is that design #1 (denormalization) is a lot simpler to work with as a developer and likely more performant as well, and I don't see any benefit of the additional complexity of having additional database tables in this case aside from maybe making it easier to run analytics on the data we're storing (which is a moot point in this scenario).

But that being said I come from more of a frontend background, so tell me if I'm off here.

---

EDIT: The database is Postgres, so it supports Json. This isn't a side project so I can't just swap out the database.

EDIT: Actually the more I think about it, #2 (separate tables) doesn't actually sound that bad (minus the KeyValuePairs table which I think is useless). It's a bit more work upfront, but it keeps things more robust, particularly since I'm working on a team with other people. By having a looser schema, we could say hire a contractor who inputs the json incorrectly and causes everything to fail.

EDIT: Leaning back towards denormalization since the APIs aren't going to change much, and we likely won't be querying by headers/parameters so there's little benefit to normalization here. Thanks for the answers!

👍︎ 8

💬︎

👤︎ u/JSavageOne

📅︎ Apr 22 2020

🚨︎ report

Announcing keechma/entitydb, second iteration of EntityDB - a client side database and normalization engine for ClojureScript github.com/keechma/keechm…

👍︎ 26

💬︎

👤︎ u/retro_one

📅︎ Jul 31 2020

🚨︎ report

Database Normalization

Trying to normalize first database for practice.

Company ID, Company Name, Amenity 1, Amenity 2, Amenity 3, Address 1, Address 2, City, State, Postal Code

How would you normalize this?

👍︎ 7

💬︎

👤︎ u/Someoneisstalkingme1

📅︎ Jun 27 2020

🚨︎ report

De-normalization of database for read performance

How we denormalized our table to get ride of complex and slow queries.

https://medium.com/@taukeer/de-normalization-of-database-for-read-performance-220cd50ac827

👍︎ 3

💬︎

👤︎ u/tkeer

📅︎ Oct 15 2020

🚨︎ report

Database Normalization In DBMS | Normal Forms 1NF 2NF 3NF Explained learncomputerscienceonlin…

👍︎ 2

💬︎

👤︎ u/LearnComputerScience

📅︎ Aug 18 2020

🚨︎ report

Database Normalization Question

Hello all, I'm working on a database schema for a small program I'm making. The program's goal is to take an item we manufacture and create a list of components (Bill of Material). I have three database tables that seem to accomplish this well, but the third database isn't in a normal form and I don't know how to fix it.

Manufactured Unit

PK: UnitID

VarChar(255): Unit Name

Boolean: NeedsPump

Equipment

PK: EquipmentID

VarChar(255): Cut Sheet

VarChar(255): Equipment Name

UnitEquipment

FK: UnitID

FK: EquipmentID

Int: Quantity

The natural fit seems to be combining UnitID and EquipmentID into a primary key and then using it. That said, I'm not sure of the underlying mechanisms for how the two IDs are concatenated and I have concerns. For instance, if unitID is 1 and equipmentID is 12, the concatenated PK would be '112'; this would be the same as unitID of 12 and equipmentID of 1. Are these concerns unfounded? Does making the primary key a combo of the two FKs put this database in 3NF? Sorry if this question is extremely simple, I'm not an SQL expert.

👍︎ 10

💬︎

👤︎ u/JaredTheGreat

📅︎ Apr 17 2019

🚨︎ report

Database normalization hands on online learning

Hi new to DB here. I have always been fascinated about how various categories of data can be organized and normalized. Is there any online site that is similar to free code camp that lets you normalize tables from scratch and tells you if it’s correct or not? I feel that’s the best way to learn and get better at it. Please feel free to mention other good ways to practice. Thanks.

👍︎ 7

💬︎

👤︎ u/punkfay

📅︎ May 06 2020

🚨︎ report

Database Normalization. If I have different commodities but no other info, don't split right?

https://imgur.com/a/hJVTuFH. Using Access Bible 2016. Trying normalization. Beginner on Ch. 8 Queries

Account Manager A has contact 1 for Pizza and Hot Dogs at company Z owned by Walmart.
Account Manager A has contact 2 for Pizza at company X owned by Walmart.
Account Manager B has contact 3 for Hot Dogs at company Q owned by Winn Dixie.

Customer Contact is parent. Contact and Company are children?

ContactID = table for info describing contact person for commodity
CommodityID = Split? The commodity won't change. If contact at Winn Dixie switches to Pizza from Hot Dogs, I could just change their commodity to Hot Dogs right? I use foreign keys from a commodity table.
CompanyID = table for info describing Company
Senior Sales = Account Head, no more info needed. no split. just type heads name
Account Manager = short text. same as senior sales. no more info needed. just type manager name?

Tried separate tables for account manager and commodity.

For Account Manager: it's 1 manager TO many customer contacts and 1 customer contact has ONLY 1 manager, but only 1 column of Manager info (Name)
For CommodityID: it's 1 Customer Contact TO Many Commodities and 1 Commodity has many different Customers, but again table has 1 column (commodity name)

So why not leave in main table?

👍︎ 2

💬︎

👤︎ u/ExcelShoobie

📅︎ Jan 20 2020

🚨︎ report

"How many people have you put a bullet into?" - On a post about database normalization

👍︎ 23

💬︎

👤︎ u/ThePsion5

📅︎ Sep 10 2018

🚨︎ report

We can develop a normalized database schema they said. We will be sure to adhere to normalization rules they said.

👍︎ 89

💬︎

👤︎ u/iDrinan

📅︎ Jun 28 2017

🚨︎ report

Classes, data and normalization in python with an SQL database

I am trying to figure out how python classes work with data tables that I want to save to a postgresql database.

For example, I am trying to make a (practice) program that will keep track of workers in a large company (corporation incorporated) including their name, phone number(s), email address, and physical address. (Basically one could enter information about an employee and have it update the database, read info off the database, or erase an employee.)

One way I could do this is to make a Employee class which has as attributes all of these items. For example:

class Employee: def init (self, f_name, l_name, phone, email, st_address, postal_code, employee_id) #with the definition of each of these items under it, etc.

But when I write these items to a database, I start getting data normalization issues.

For example: What if someone has 3 phone numbers? Or several emails? Or if two of the employees have the same address and gasp shared landline?

If I were just normalizing a (rational) database I would have a separate table for each of these things. For example, I would have a phone number table (with a phone number ID as a primary key) which would have an employee_id associated with it as a foreign key. This way two employees that gasp share a landline won't create issues--i.e. when one of these employees retires and I erase the phone number associated with it, I don't loose data for the other employee.

I am confused in how to set up objects in python dealing with this data.

Should I have a separate class for each table in the database? This seems like it creates an awful lot of files. Maybe that's ok. Is that the convention?

That's how I am thinking of handling it currently. For example, phone number is a class that has a phone_id, type (mobile, landline, etc) and an owner's employee_id--all stored in a table that looks like the class.

Or am I thinking about classes wrong? Am I thinking about classes in the way that I think about tables because I have a little background in databases and no background in OOP?

Is there a different convention for how one structures classes for data that is to be stored and read from a database? (If so what is it, and could you explain why it is done?)

👍︎ 9

💬︎

👤︎ u/indoorfarmboy

📅︎ Feb 26 2018

🚨︎ report

Interviewer: What is database normalization? Explain the various normal forms.

Me: Well, I don't remember. I just ensure I don't have to store redundant data when designing schemas.

Interviewer: Gives a wry smile.

Me: (Fuck, I've messed up the interview. I guess I won't get this job...)

I come back home and read up on normalization.

Five Hours Later:

Interviewer: Congratulations, you got the job!!

On the first day, I install MySQL and insert a subset of the production data.

Tables were denormalized like hell.
Foreign key constraints were not enforced.
Every query had to include a is_active clause to filter out archived data.

I realized why I got the job.

👍︎ 72

💬︎

👤︎ u/pramodliv1

📅︎ Jun 02 2017

🚨︎ report

Database Normalization and Table Structures bytes.com/topic/access/in…

👍︎ 58

💬︎

👤︎ u/stesch

📅︎ Dec 26 2010

🚨︎ report

Database normalization problem

I'm trying to normalize the following conceptual model. How have I done?

https://imgur.com/a/5p9qBJe

👍︎ 2

💬︎

👤︎ u/nso95

📅︎ Oct 09 2018

🚨︎ report

How to understand Database normalization?

I can't quite understand or describe why I'm having such a hard time understanding this but after dozens of attempts it still hasn't clicked with me. 1NF makes perfect sense in most cases but as soon as the "functional dependency" kicks in, my mind somehow blanks out and I'm never able to consistently "get it right".
Do you have any recommendation of a proper source with good examples/solutions? Preferably one I can try in a MySQL database and see why doing it in a different way would lead to inconsistencies.

👍︎ 5

💬︎

👤︎ u/RDwelve

📅︎ Oct 29 2018

🚨︎ report

Strategy for database normalization in Rails?

This might just be my misconception about DB architecture, so I am looking for a good strategy for normalizing my database structure.

The issue I'm having is that some of the models I'm building share many attributes as entities that are not necessarily related to one another. Here's my example (albeit simplified). I have the following models:

User
Profile
Organization
Submission

I also have models for the following:

Location (Address)
Email
Phone
Document

A profile belongs to a user, however an organization does not. Both organizations and profiles have locations, emails, and phones. An application is associated with a profile, however I need to allow profile to be updated by the User (or Admin), however the application must remain frozen once submitted. So, application is really just a representation of the state several aggregated models at the given point of submission. Similarly, I would like the profile to also simply represent state of other associated models. Thus, if a user updates their profile I am retaining the state of their original profile, by creating a new profile record and new address record. So, the profile table would just be a series of foreign keys that reference a combination of records.

The issue is that since Location, Email, Phone, etc. are associated with multiple models. Organizations and Profiles both reference. However, if Rails seems to always put the foreign key on the child model. In my particular situation, it would seem to make more sense to apply the foreign key to the parent model. So, Organization records contain a foreign key for email. So do Profiles, or any other models that use that table.

Is it bad to put the foreign key on the parent model? Will ActiveRecord allow me to do this? Is there a better way to set up these associations?

👍︎ 10

💬︎

👤︎ u/Randy_Watson

📅︎ Apr 03 2016

🚨︎ report

Why Normalization Failed to Become the Ultimate Guide for Database Designers? | LtU lambda-the-ultimate.org/n…

👍︎ 28

💬︎

👤︎ u/schaueho

📅︎ Jan 11 2010

🚨︎ report

I made a Database Normalization/Schema Joke imgur.com/gallery/OWrhr/c…

👍︎ 17

💬︎

👤︎ u/ChronoChris

📅︎ Sep 07 2017

🚨︎ report