Why is naming a table's Primary Key column "Id" considered bad practice? [closed]

Question 1

My t-sql teacher told us that naming our PK column "Id" is considered bad practice without any further explanations.

Why is naming a table PK column "Id" is considered bad practice?

Question 2

Well, it is not a description actually and Id means "identity" which is very self explanatory. That is my opinion.

Question 3

I'm sure there are plenty of shops that use Id as a PK name. I personally use TableId as my naming convention but I wouldn't tell anyone its THE ONE TRUE WAY. It sounds like your teacher is just trying to present her opinion as widely accepted best practice.

Question 4

Definitively the type of bad practice that isn't that bad. The point is to get consistent. If you use id, use it everywhere or don't use it.

Question 5

You have a table...it's called People, it's got a column, called Id, what do you think the Id is? A car? A Boat? ...no it's the People Id, that's it. I don't think it's not a bad practice and it's not necessary to name the PK column anything other than Id.

Question 6

So the teacher gets in front of the class and tells you this is a bad practice without a single reason? That's a worse practice for a teacher.

Question 7

I'm going to come out and say it: It's not really a bad practice (and even if it is, its not that bad).

You could make the argument (as Chad pointed out) that it can mask errors like in the following query:

SELECT * 
 FROM cars car
 JOIN manufacturer mfg
 ON mfg.Id = car.ManufacturerId
 JOIN models mod
 ON mod.Id = car.ModelId
 JOIN colors col
 ON mfg.Id = car.ColorId

but this can easily be mitigated by not using tiny aliases for your table names:

SELECT * 
 FROM cars
 JOIN manufacturer
 ON manufacturer.Id = cars.ManufacturerId
 JOIN models
 ON models.Id = cars.ModelId
 JOIN colors
 ON manufacturer.Id = cars.ColorId

The practice of ALWAYS using 3 letter abbreviations seems much worse to me than using the column name id. (Case in point: who would actually abbreviate the table name cars with the abbreviation car? What end does that serve?)

The point is: be consistent. If your company uses Id and you commonly make the error above, then get in the habit of using full table names. If your company bans the Id column, take it in stride and use whatever naming convention they prefer.

Focus on learning things that are ACTUALLY bad practices (such as multiple nested correlated sub queries) rather than mulling over issues like this. The issue of naming your columns "ID" is closer to being a matter of taste than it is to being a bad practice.

A NOTE TO EDITORS : The error in this query is intentional and is being used to make a point. Please read the full answer before editing.

Question 8

@Chad, it was a reference to the fact that in that particular query you used 3 letters for aliases to table names, even when it made no sense (cars -> car. Thank God--you saved my fingers). Don't read too deeply into it.

Question 9

worse than my alias names, was my mixing of plurality cars and manufacturer. One is plural, the other isn't. If people want to pick at the db, that's the bad practice that should be picked upon.

Question 10

I think it is bad practice. Obviously, as far as bad practice goes it's not terrible.. but it's so easy to avoid, why not do so? Now, I agree, for an intro class, is this the thing to focus on? Probably not....

Question 11

@user606723, actually, I think for an intro class it's an important thing to make a point of. Teaching best practices should be fundamental. It's not until you're experienced that you understand the consequences, and tradeoffs and when you should deviate from best practices.

Question 12

@Chad, Agree to disagree. Trying to teach students "best practices" without allowing them to understand why those are the best practices is a futile effort. And since you can't cover everything, glossing over this point with "you just shouldn't do it, you'll figure out why later" is pretty sane choice from a professor's standpoint. Curious students can post on here or hopefully find this question already answered. (Or ask after class)

Question 13

Because when you have a table with a foreign key you can't name that foreign key "Id". You have table name it TableId

And then your join looks like

SELECT * FROM cars c JOIN manufacturer m ON m.Id = c.ManufacturerId

And ideally, your condition should have the same field name on each sides

SELECT * FROM cars c JOIN manufacturer m ON m.ManufacturerId = c.ManufacturerId

So while it seems redundant to name the Id as ManufacturerId, it makes it less likely that you have errors in your join conditions as mistakes become obvious.

This seems simple, but when you join several tables, it gets more likely you'll make a mistake, find the one below...

SELECT * 
 FROM cars car 
 JOIN manufacturer mfg
 ON mfg.Id = car.ManufacturerId
 JOIN models mod
 ON mod.Id = car.ModelId
 JOIN colors col
 ON mfg.Id = car.ColorId

Whereas with proper naming, the error sticks out...

SELECT * 
 FROM cars car 
 JOIN manufacturer mfg
 ON mfg.ManufacturerId = car.ManufacturerId
 JOIN models mod
 ON mod.ModelId = car.ModelId
 JOIN colors col
 ON mfg.ManufacturerId = car.ColorId

Another reason naming them Id is "bad" is that when you are querying for information from several tables you will need to rename the Id columns so you can distinguish them.

SELECT manufacturer.Id as 'ManufacturerId'
 ,cars.Id as 'CarId'
 --etc
 FROM cars 
 JOIN manufacturer
 ON manufacturer.Id = cars.Id

With accurate names this is less of an issue

Question 14

Not sure this is a good enough explanation to me. There's nothing wrong with saying SELECT * FROM cars c JOIN manufacturer m ON manufacturer.Id = c.ManufacturerId. I have used id for years and never found what you described to be a real problem.

Question 15

I would say that the bad practice here is to alias tables with name like mfg or mod. manufacturers.id = cars.manufacturer_id is very readable and the error will stick out too.

Question 16

@Chad > I already got problems with dumb variable names. Many times. For the reccord, here what I would say to a dev that does this in my team « mfg doesn't mean manufacturer, it means you are to lazy to type manufacturer ».

Question 17

@Stargazer712: Doing SELECT * from 2 tables gives 2 x ID columns. ID is now ambiguous: do you reference by ordinal or name? SELECT * is not good practice either. Poor arguments. Fragile code. Chad is correct: defensive coding basically

Question 18

@gbn, again, all ambiguity is gone if you simply do SELECT manufacturer.id FROM .... Every difficulty resulting from id can be overcome very easily, making it simply a matter of taste.

Question 19

Ruby's ActiveRecord library and Groovy's GORM use "id" for the surrogate key by default. I like this practice. Duplicating the table name in each column name is redundant, tedious to write, and more tedious to read.

Question 20

+1 for "and more tedious to read." - naming conventions shouldn't be thought of as a band-aid for sloppy code, they should be improving readability as a primary concern.

Question 21

ID is far more tedious to read

Question 22

@HLGEM: One can always qualify the column name with the table name.

Question 23

I would agree except for the more tedious to read. I actually prefer reading more descriptive column names and spending less time figuring out what that column actually is for.

Question 24

+1, Hate seeing tables with columns like Posts.postID, Posts.postName where simply using post.id and post.name is far prettier.

Question 25

Common or key column names like "Name" or "Id" should be prefixed with the TableName.

It removes ambiguity, easier to search for, means far less column aliases when both "Id" values are needed.

A lesser used or audit column or non-key (say LastUpdatedDateTime) doesn't matter

Question 26

If you do this, I hate you for making me do extra typing!!!! The table's name is Person, what do you think the Id is going to be? Car? no, it's Person.

Question 27

@jim, I don't know about you, but typing 6 extra characters takes me roughly half a second. And considering I rarely ever select from one table, and thus would end up with two columns named 'Id' and will need to include the table/alias name anyhow, there is no savings in the number of characters typed.

Question 28

@Chad I find it superfluous. if I'm doing a join, c.id = m.manufacturerid, is ok with me. These columns are typically "mapped" to a class somehow, and to have a class with Person.PersonId makes me want to vomit...Yes,I am fully aware I have issues.

Question 29

I also disagree with this. Why stop at name and id? Why not have every column prefixed with its table name? It seems arbitrary to pick those two names to mandate a prefix. Conceptually, you must have the table in order to have the context of a column anyway. Why not just use that table name to clarify the query: Person.Name, Animal.Name, Part.Name,...

Question 30

@Bill Leeper, DRY is not always appropriate in database development. In databases what is important is performance and making the database do extra work to fullfill DRY principles (such as using scalar functions or views that call views or queries that return too many columns or using a cursor to add 1000000 records to use an existing proc) is often contraindicated. Do not think that just because something is good in the Object-oriented world that it is appropriate in database design. Your downvote was inappropriate. Using ID is a known SQL antipattern.

Question 31

Not using Id is a bad practice. The Id column is special; it is the primary key. Any table can have any number of foreign keys, but it can have only one key that is primary. In a database where all primary keys are called Id, as soon as you look at the table you know exactly which column is the primary key.

For months I've spent all day every day working in lots of big databases (Salesforce) and the best thing I can say about the schemas is that every table has a primary key called Id. I never get confused about joining a primary key to a foreign key because the primary key is called Id.

Tables can have long silly names like Table_ThatDoesGood_stuff__c; that name is bad enough because the architect had a hangover the morning he thought up that table, but now you are telling me that it's bad practice not to call the primary key Table_ThatDoesGood_stuff__cId (remembering that SQL column names aren't in general case sensitive).

The problems with most people who teach computer programming are that they haven't written a line of production code in years, if ever, and they have no idea what a working software engineer actually does. Wait until you start working and then make up your own mind what you think is a a good idea or not.

Question 32

That's only the case if none of your primary keys is a composite key, which is, unfortuately, far too often the case. One should really only use surrogate keys in particular circumstances.

Question 33

Using Id like that you end up with a situation where without thinking developers add primary key id to each and every table they make. One of the foundations of relational databases is the use meaningfull and aggregate primary keys and using id does not help.

Question 34

This answer just seems like you're saying "I prefer Id" with opinionated arguments. Your argument is you can instantly see which key is the primary key by finding the one called Id. Well, it's the exact same with tableId. I can guarantee you I never get confused which key is the primary key either. I just look for the one that has the table name before the id. Not only that, but what kind of heathen tables are you working with where the first column isn't the primary key? I feel like all your arguments in this answer are purely preferential based and akin to "tableId feels wrong to me".

Question 35

@dallin all the answers are opinionated, otherwise someone would just link to the official standard, and there isn't one!

Question 36

@James I feel like the goal of StackExchange sites is to reduce opinionated answers. That's why questions get closed as being too opinionated. Granted, I think that's kind of why this question got closed - because it elicits opinionated answers, but I feel this specific answer was overly opinionated without any real supporting facts or arguments based on facts.The entire answer can be summarized by saying, "In my opinion, you should use Id, just because that's what I like". That's not a valuable answer.

Question 37

From data.stackexchange.com

Id in Posts

BOOM, question answered.
Now go tell your teacher that SO practice bad database design.

Question 38

My guess at the FKs, based on the names: PostTypeId -> PostTypes.Id; AcceptedAnswerId -> Answers.Id; OwnerUserId -> Users.Id. Why should a practice that is that easy be considered 'bad'?

Question 39

How exactly does this prove anything about best practices?

Question 40

Whether something is used at stack or not does not prove if it's good or bad practice.

Question 41

What it does prove is that this practice in no way prohibits the scalability and usefulness of an application.

Question 42

Actually SO practice is not perfect. I would use this naming: PostType -> PostType.Id; AcceptedAnswer -> Answer.Id; OwnerUser -> User.Id

Question 43

I don't consider it bad practice. Consistency is king, as usual.

I think it's all about context. In the context of the table on its own, "id" just means exactly what you expect, a label to help uniquely identify it against others that might otherwise be (or appear) identical.

In the context of joins, it's your responsibility to construct the joins in such a way as to make it readable to you and your team. Just as it is possible to make things look difficult with poor phrasing or naming, it is equally possible to construct a meaningful query with effective use of aliases and even comments.

In the same way a Java class called 'Foo' doesn't have its properties prefixed by 'Foo', don't feel obliged to prefix your table IDs with table names. It is usually clear in context what the ID being referred to is.

Question 44

Relational database tables are not classes.

Question 45

They are however data structures and they're analogous to PODO classes. The same naming problems apply.

Question 46

@Slomojo: No, they're not analogous to simple objects. Object-oriented design and database design are not the same, and are not even related. While they can, in some cases, yield similar (or even the same) design, that does not indicate that they are related. For example, m:m relationships are trivial in classes, but are impossible have between two tables without a third association table.

Question 47

Quite how this relates to a naming strategy, I don't know. My analogy is (clearly?) only scoped to that extent.

Question 48

I'm sorry my implied meaning wasn't very clear, I should have said "in this sense they are analogous to classes". Either way, I don't think being overly pedantic about this is particularly constructive. In terms of naming, tables and classes do share a significant amount of similarities. Best practices which develop in a cul-de-sac are fair game for revision, or at the very least are open to discussion. There's plenty within this Q&A that illustrate this effectively, I don't have anything else of note to add.

Question 49

There is a situation where sticking "ID" on every table isn't the best idea: the USING keyword, if it's supported. We use it often in MySQL.

For example, if you have fooTable with column fooTableId and barTable with foreign key fooTableId, then your queries can be constructed as such:

SELECT fooTableId, fooField1, barField2 FROM fooTable INNER JOIN barTable USING (fooTableId)

It not only saves typing, but is much more readable compared to the alternative:

SELECT fooTable.Id, fooField1, barField2 FROM fooTable INNER JOIN barTable ON (fooTable.Id = barTable.foTableId)

Question 50

This is the answer that sticks out most for me. The USING keyword is supported by postgres/mysql/sqlite database, means less typing which some of the other answers list as a reason for using id, and finally in my subjective opinion is more readable.

Question 51

It makes it hard (and confusing) to perform a natural join on the table, therefore yeah, it's bad if not very bad.

Natural Join is an ancient artifact of SQL Lore (i.e. relational algebra) you may have seen one of these: ⋈ in a database book perhaps. What I mean is Natrual Join is not a new fangled SQL idea, even though it seemed to take forever for DBMS's to have implemented it, therefore it's not a new fangled idea for you to implement it, it might even be unreasonable for you to ignore its existence nowadays.

Well, if you name all your primary key's ID, then you lose the ease and simplicity of the natural join. select * from dudes natural join cars will need to be written select * from dudes inner join cars where cars.dudeid = dudes.id or select * from dudes inner join cars where dudes.carid = cars.id. If you are able to do a natural join, you get to ignore what the relation actually is, which, I believe, is pretty awesome.

Question 52

Unless you are writing a stored procedure when is the last time as an application developer your actually wrote a fully formatted SQL selector? Modern languages all include some sort of ORM feature that manages this relationship for you. The column name is far more important than being able to write clean manual SQL.

Question 53

@Bill I do all the time, many, many times a day, depends more on your codebase than the language you're developing. And, for diagnostics, if you want to do some good and deep relations you can string together those natural joins and completely avoid looking up field ID's. And, as St. Jerome famously said, "Ignorance of SQL is ignorance of databases".

Question 54

Aside from the fact that natural joins are not universally supported, IMO, natural joins are harder to read. Are there two columns in relationship or only one? Far better to be explicit and avoid natural joins.

Question 55

@Thomas, I wouldn't put natural joins in code either, but for diagnostics, I've found them pretty useful when the database is modeled so that they actually work.

Question 56

Why not just ask your teacher?

Think about this, when all your tables PK columns are named ID it makes using them as foreign keys a nightmare.

Column names need to be semantically significant. ID is to generic.

Question 57

too generic for what? the id of a table?

Question 58

@Jim of which table? id alone doesn't mean anything, especially in the context of a foreign key to another table. This has nothing to do with classes and everything to do with good basic fundamental relational database design.

Question 59

To be slightly fatuous, the table which it belongs to. table.id is a perfectly acceptable way of referring to an id field. Prefixing the field name with the table name is redundant.

Question 60

@Slomoj it is no more typing than including the name in the column and more explict when aliasing table names to single or double letter abbreviations in monster joins.

Question 61

Of what nightmare are you referring? Suppose you have a self-referencing structure of employees with a column representing their manager. What do you call the foreign key? You can't call it EmployeeId as that is presumably your PK. The name of the FK does not have to match the PK. It should be named for what it represents to the entity in which it is contained.

Question 62

ID is bad for the following reasons:

If you do a lot of reporting queries you always have to alias the columns if you want to see both. So it becomes a waste of time when you could name it properly to begin with. These complex queries are hard enough (I write queries that can be hundreds of lines long) without the added burden of doing unnecessary work.

It is subject to causing code errors. If you use a database that allows the use of the natural join (not that I think you should ever use that but when features are available somebody will use them), you will join on the wrong thing if you get a developer that uses it.

If you are copying joins to create a complex query, it is easy to forget to change the alias to the one you want and get an incorrect join. If each id is named after the table it is in, then you will usually get a syntax error. It is also easier to spot if the join ina complex query is incorrect if the pPK name and the FK name match.

Question 63

+1: These are compelling reasons in my opinion, while the other answers opposing ID don't convince me at all.

Question 64

Re: "If you are copying joins to create a complex query" - so your problem is copy&paste. Stop copy&paste and you will see how convenient car.id naming is. For FK joining use car.mfg = mfg.id, car.color=color.id, car.model = model.id - very simple and matches what you would write in LINQ.

riwalk – riwalk · Accepted Answer · 2011-10-17 17:08:29Z

I'm going to come out and say it: It's not really a bad practice (and even if it is, its not that bad).

You could make the argument (as Chad pointed out) that it can mask errors like in the following query:

SELECT * 
 FROM cars car
 JOIN manufacturer mfg
 ON mfg.Id = car.ManufacturerId
 JOIN models mod
 ON mod.Id = car.ModelId
 JOIN colors col
 ON mfg.Id = car.ColorId

but this can easily be mitigated by not using tiny aliases for your table names:

SELECT * 
 FROM cars
 JOIN manufacturer
 ON manufacturer.Id = cars.ManufacturerId
 JOIN models
 ON models.Id = cars.ModelId
 JOIN colors
 ON manufacturer.Id = cars.ColorId

The practice of ALWAYS using 3 letter abbreviations seems much worse to me than using the column name id. (Case in point: who would actually abbreviate the table name cars with the abbreviation car? What end does that serve?)

The point is: be consistent. If your company uses Id and you commonly make the error above, then get in the habit of using full table names. If your company bans the Id column, take it in stride and use whatever naming convention they prefer.

Focus on learning things that are ACTUALLY bad practices (such as multiple nested correlated sub queries) rather than mulling over issues like this. The issue of naming your columns "ID" is closer to being a matter of taste than it is to being a bad practice.

A NOTE TO EDITORS : The error in this query is intentional and is being used to make a point. Please read the full answer before editing.

@Chad, it was a reference to the fact that in that particular query you used 3 letters for aliases to table names, even when it made no sense (cars -> car. Thank God--you saved my fingers). Don't read too deeply into it.
worse than my alias names, was my mixing of plurality cars and manufacturer. One is plural, the other isn't. If people want to pick at the db, that's the bad practice that should be picked upon.
I think it is bad practice. Obviously, as far as bad practice goes it's not terrible.. but it's so easy to avoid, why not do so? Now, I agree, for an intro class, is this the thing to focus on? Probably not....
@user606723, actually, I think for an intro class it's an important thing to make a point of. Teaching best practices should be fundamental. It's not until you're experienced that you understand the consequences, and tradeoffs and when you should deviate from best practices.
@Chad, Agree to disagree. Trying to teach students "best practices" without allowing them to understand why those are the best practices is a futile effort. And since you can't cover everything, glossing over this point with "you just shouldn't do it, you'll figure out why later" is pretty sane choice from a professor's standpoint. Curious students can post on here or hopefully find this question already answered. (Or ask after class)

Stack Exchange Network

Why is naming a table's Primary Key column "Id" considered bad practice? [closed]

19 Answers 19

Linked

Hot Network Questions

Why is naming a table's Primary Key column "Id" considered bad practice? [closed]

19 Answers 19

Linked

Related

Hot Network Questions