Is a new Boolean field better than a null reference when a value can be meaningfully absent?

Question 1

For example, suppose I have a class, Member, which has a lastChangePasswordTime:

class Member{
 .
 .
 .
 constructor(){
 this.lastChangePasswordTime=null,
 }
}

whose lastChangePasswordTime can be meaningful absent, because some members may never change their passwords.

But according to If nulls are evil, what should be used when a value can be meaningfully absent? and https://softwareengineering.stackexchange.com/a/12836/248528 , I shouldn't use null to represent a meaningfully absent value. So I try to add a Boolean flag:

class Member{
 .
 .
 .
 constructor(){
 this.isPasswordChanged=false,
 this.lastChangePasswordTime=null,
 }
}

But I think it is quite obsolete because:

When isPasswordChanged is false, lastChangePasswordTime must be null, and checking lastChangePasswordTime==null is almost identical to checking isPasswordChanged is false, so I prefer check lastChangePasswordTime==null directly
When changing the logic here, I may forget to update both fields.

Note: when a user changes passwords, I would record the time like this:

this.lastChangePasswordTime=Date.now();

Is the additional Boolean field better than a null reference here?

Question 2

This question is quite language-specific, because what constitutes a good solution largely depends on what your programming language offers. For example, in C++17 or Scala, you'd use std::optional or Option. In other languages, you may have to built an appropriate mechanism yourself, or you may actually resort to null or something similar because it's more idiomatic.

Question 3

Is there a reason you wouldn't want the lastChangePasswordTime set to password creation time (creation is a mod, after all)?

Question 4

@ChristianHackl hmm, agree that there are different "perfect" solutions, but i don't see any (major) language though where using a separate boolean would be a better idea in general than doing null/nil checks. Not totally sure about C/C++ as I haven't been active there for quite a while, though.

Question 5

@FrankHopkins: An example would be languages where variables can be left uninitialised, e.g. C or C++. lastChangePasswordTime may be an uninitialised pointer there, and comparing it to anything would be undefined behaviour. Not a really compelling reason not to initialise the pointer to NULL / nullptr instead, especially not in modern C++ (where you wouldn't use a pointer at all), but who knows? Another example would be languages without pointers, or with bad support for pointers, maybe. (FORTRAN 77 comes to mind...)

Question 6

I would use an enum with 3 cases for this :)

Question 7

I don't see why, if you have a meaningfully absent value, null should not be used if you are deliberate and careful about it.

If your goal is to surround the nullable value to prevent accidentally referencing it, I would suggest creating the isPasswordChanged value as a function or property that returns the result of a null check, for example:

class Member {
 DateTime lastChangePasswordTime = null;
 bool isPasswordChanged() { return lastChangePasswordTime != null; }
}

In my opinion, doing it this way:

Gives better code readability than a null-check would, which might lose context.
Removes the need for you having to actually worry about maintaining the isPasswordChanged value that you mention.

The way that you persist the data (presumably in a database) would be responsible for ensuring that the nulls are preserved.

Question 8

+1 for the opening. Wanton usage of null is generally disapproved of only in cases where null is an unexpected outcome, i.e. where it wouldn't make sense (to a consumer). If null is meaningful, then it's not an unexpected outcome.

Question 9

I would put it a little stronger as never have two variables that maintain the same state. It will fail. Hiding a null value, assuming the null value makes sense inside the instance, from the outside is a very good thing. In this case where you may later decide that 1970年01月01日 denotes that the password has never been changed. Then the logic of the rest of the program does not have to care.

Question 10

You could forget to call isPasswordChanged just like you can forget to check if it's null. I see nothing gained here.

Question 11

@Gherman If the consumer doesn't get the concept that null is meaningful or that he needs to check for the value being present, he is lost either way, but if the method is used, it is clear to everyone reading the code why there is a check and that there is a reasonable chance that the value is null/not present. Otherwise it is unclear if it was just a developer adding a null check just "because why not" or whether it's part of the concept. Sure, one can find out, but one way you have the information directly the other you need to look into the implementation.

Question 12

@FrankHopkins: Good point, but if concurrency is used, even the inspection of a single variable may require locking.

Question 13

nulls aren't evil. Using them without thinking is. This is a case where null is exactly the correct answer - there is no date.

Note that your solution creates more problems. What is the meaning of the date being set to something, but the isPasswordChanged is false? You've just created a case of conflicting information that you need to catch and treat specially, while a null value has a clearly defined, unambiguous meaning and cannot be in conflict to other information.

So no, your solution isn't better. Allowing for a null value is the right approach here. People who claim that null is always evil no matter the context don't understand why null exists.

Question 14

Historically, null exists because Tony Hoare made the billion-dollar mistake in 1965. People who claim that null is "evil" are over-simplifying the topic, of course, but it's a good thing that modern languages or programming styles are moving away from null, replacing it with dedicated option types.

Question 15

@ChristianHackl: just out of historical interest - die Hoare ever say what the better practical alternative would have been (without switching to a completely new paradigm)?

Question 16

@AnoE: Interesting question. Don't know about what Hoare had to say on that, but null-free programming is often a reality these days in modern, well-designed programming languages.

Question 17

@Tom wat? In languages like C# and Java, the DateTime field is a pointer. The whole concept of null only makes sense for pointers!

Question 18

@Tom: Null pointers are only dangerous on languages and implementations that allow them to be blindly indexed and dereferenced, with whatever hilarity may result. In a language which traps attempts to index or dereference a null pointer, it will be often less dangerous than a pointer to a dummy object. There are many situations where having a program abnormally terminate may be preferable to having it produce meaningless numbers, and on systems that trap them, null pointers are the right recipe for that.

Question 19

Depending on your programming language, there might be good alternatives, such as an optional data type. In C++(17), this would be std::optional. It can either be absent or have an arbitrary value of the underlying data type.

Question 20

There's Optional in java too; the meaning of a null Optional being up to you...

Question 21

Came here to connect with Optional as well. I'd encode this as a type in a language which had them.

Question 22

@FrankHopkins It's a shame that nothing in Java prevents you from writing Optional<Date> lastPasswordChangeTime = null;, but I wouldn't give up just because of that. Instead, I would instill a very firmly held team rule, that no optional values are ever allowed to be assign null. Optional buys you too many nice features to give up on that easily. You can easily assign default values (orElse or with lazy evaluation: orElseGet), throw errors (orElseThrow), transform values in a null safe way (map, flatmap), etc.

Question 23

@FrankHopkins Checking isPresent is almost always a code smell, in my experience, and a pretty reliable sign that the devs who are using these optionals are "missing the point". Indeed, it gives basically no benefit over ref == null, but it's also not something you should be using often. Even if the team is unstable, a static analysis tool can lay down the law, like a linter or FindBugs, which can catch most cases.

Question 24

@FrankHopkins: It may be that the Java community just needs a little paradigm shift, which may yet take many years. If you look at Scala, for example, it supports null because the language is 100% compatible to Java, but nobody uses it, and Option[T] is all over the place, with excellent functional-programming support like map, flatMap, pattern matching and so on, which goes far, far beyond == null checks. You are right that in theory, an option type sounds like little more than a glorified pointer, but reality proves that argument wrong.

Question 25

Correctly using null

There are different ways of using null. The most common and semantically correct way is to use it when you may or may not have a single value. In this case a value either equals null or is something meaningful like a record from database or something.

In these situations you then mostly use it like this (in pseudo-code):

if (value is null) {
 doSomethingAboutIt();
 return;
}
doSomethingUseful(value);

Problem

And it has a very big problem. The problem is that by the time you invoke doSomethingUseful the value may not have been checked for null! If it wasn't then the program will likely crash. And the user may not even see any good error messages, being left with something like "horrible error: wanted value but got null!" (after update: although there may be even less informative error like Segmentation fault. Core dumped., or worse yet, no error and incorrect manipulation on null in some cases)

Forgetting to write checks for null and handling null situations is an extremely common bug. This is why Tony Hoare who invented null said at a software conference called QCon London in 2009 that he made the billion-dollar mistake in 1965: https://www.infoq.com/presentations/Null-References-The-Billion-Dollar-Mistake-Tony-Hoare

Avoiding the problem

Some technologies and languages make checking for null impossible to forget in different ways, reducing the amount of bugs.

For example Haskell has the Maybe monad instead of nulls. Suppose that DatabaseRecord is a user-defined type. In Haskell a value of type Maybe DatabaseRecord can be equal Just <somevalue> or it may be equal to Nothing. You can then use it in different ways but no matter how you use it you cannot apply some operation on Nothing without knowing it.

For instance this function called zeroAsDefault returns x for Just x and 0 for Nothing:

zeroAsDefault :: Maybe Int -> Int
zeroAsDefault mx = case mx of
 Nothing -> 0
 Just x -> x

Christian Hackl says C++17 and Scala have their own ways. So you may want to try to find out if your language has anything like that and use it.

Nulls are still in wide use

If you don't have anything better then using null is fine. Just keep watching out for it. Type declarations in functions will help you somewhat anyway.

Also that may sound not very progressive but you should check if your colleagues want to use null or something else. They may be conservative and may not want to use new data structures for some reasons. For instance supporting older versions of a language. Such things should be declared in project's coding standards and properly discussed with the team.

On your proposal

You suggest using a separate boolean field. But you have to check it anyway and still may forget to check it. So there is nothing won here. If you can even forget something else, like updating both values each time, then it's even worse. If the problem of forgetting to check for null is not solved then there is no point. Avoiding null is difficult and you should not do it in such a way that makes it worse.

How not to use null

Finally there are common ways to use null incorrectly. One such way is to use it in place of empty data structures such as arrays and strings. An empty array is a proper array like any other! It is almost always important and useful for data structures, that can fit multiple values, to be able to be empty, i.e. has 0 length.

From algebra standpoint an empty string for strings is much like 0 for numbers, i.e. identity:

a+0=a
concat(str, '')=str

Empty string enables strings in general to become a monoid: https://en.wikipedia.org/wiki/Monoid If you don't get it it's not that important for you.

Now let's see why it's important for programming with this example:

for (element in array) {
 doSomething(element);
}

If we pass an empty array in here the code will work fine. It will just do nothing. However if we pass a null here then we will likely get a crash with an error like "can't loop through null, sorry". We could wrap it in if but that's less clean and again, you might forget to check it

How to handle null

What doSomethingAboutIt() should be doing and especially whether it should throw an exception is another complicated issue. In short it depends on whether null was an acceptable input value for a given task and what is expected in response. Exceptions are for events that were not expected. I will not go further into that topic. This answer very is long already.

Question 26

Actually, horrible error: wanted value but got null! is much better than the more typical Segmentation fault. Core dumped....

Question 27

@TobySpeight True, but I would prefer something that lays inside user's terms like Error: the product you want to buy is out of stock.

Question 28

Sure - I was just observing that it could be (and often is) even worse. I completely agree on what would be better! And your answer is pretty much what I would have said to this question: +1.

Question 29

@Gherman: Passing null where null is not allowed is a programming error, i.e. a bug in the code. A product being out of stock is part of the ordinary business logic and not a bug. You cannot, and should not try to, translate the detection of a bug to a normal message in the user interface. Crashing immediately and not executing more code is the preferred course of action, especially if it's an important application (e.g. one that performs real financial transactions).

Question 30

@EricDuminil We want to remove(or reduce) possibility of making a mistake, not remove null itself entirely, even from the background.

Question 31

In addition to all of the very good answers previously given, I'd add that every time you're tempted to let a field be null, think carefully if it might be a list type. A nullable type is equivalently a list of 0 or 1 elements, and often this can be generalized to a list of N elements. Specifically in this case, you might want to consider lastChangePasswordTime being a list of passwordChangeTimes.

Question 32

That's an excellent point. The same is often true for option types; an option can be seen as special case of a sequence.

Question 33

Ask yourself this: which behavior requires the field lastChangePasswordTime?

If you need that field for a method IsPasswordExpired() to determine if a Member should be prompted to change their password every so often, I would set the field to the time the Member was initially created. The IsPasswordExpired() implementation is the same for new and existing members.

class Member{
 private DateTime lastChangePasswordTime;
 public Member(DateTime lastChangePasswordTime) {
 // set value, maybe check for null
 }
 public bool IsPasswordExpired() {
 DateTime limit = DateTime.Now.AddMonths(-3);
 return lastChangePasswordTime < limit;
 }
}

If you have a separate requirement that newly created members have to update their password, I would add a separated boolean field called passwordShouldBeChanged and set it to true upon creation. I would then change the functionality of the IsPasswordExpired() method to include a check for that field (and rename the method to ShouldChangePassword).

class Member{
 private DateTime lastChangePasswordTime;
 private bool passwordShouldBeChanged;
 public Member(DateTime lastChangePasswordTime, bool passwordShouldBeChanged) {
 // set values, maybe check for nulls
 }
 public bool ShouldChangePassword() {
 return PasswordExpired(lastChangePasswordTime) || passwordShouldBeChanged;
 }
 private static bool PasswordExpired(DateTime lastChangePasswordTime) {
 DateTime limit = DateTime.Now.AddMonths(-3);
 return lastChangePasswordTime < limit;
 }
}

Make your intentions explicit in code.

Question 34

First, nulls being evil is dogma and as usual with dogma it works best as a guideline and not as pass/no pass test.

Secondly, you can redefine your situation in a way that it makes sense that the value can never be null. InititialPasswordChanged is a Boolean initially set to false, PasswordSetTime is the date and time when the current password was set.

Note that while this does come at a slight cost, you can now ALWAYS calculate how long it has been since a password was last set.

Question 35

Both are 'safe/sane/correct' if the caller checks before use. The issue is what happens if the caller doesn't check. Which is better, some flavour of null error or using an invalid value?

There is no single correct answer. It depends on what you are worried about.

If crashes are really bad but the answer isn't critical or has an accepted default value, then perhaps using a boolean as a flag is better. If using the wrong answer is a worse issue than crashing, then using null is better.

For the majority of the 'typical' cases, fast failure and forcing the callers to check is the fastest way to sane code, and hence I think null should be the default choice. I wouldn't put too much faith in the "X is the root of all evil" evangelists though; they normally haven't anticipated all the use cases.

TZHX TZHX 5,0722 gold badges27 silver badges33 bronze badges · Accepted Answer · 2019-02-26 08:50:42Z

72

I don't see why, if you have a meaningfully absent value, null should not be used if you are deliberate and careful about it.

If your goal is to surround the nullable value to prevent accidentally referencing it, I would suggest creating the isPasswordChanged value as a function or property that returns the result of a null check, for example:

class Member {
 DateTime lastChangePasswordTime = null;
 bool isPasswordChanged() { return lastChangePasswordTime != null; }
}

In my opinion, doing it this way:

Gives better code readability than a null-check would, which might lose context.
Removes the need for you having to actually worry about maintaining the isPasswordChanged value that you mention.

The way that you persist the data (presumably in a database) would be responsible for ensuring that the nulls are preserved.

Share

Improve this answer

answered Feb 26, 2019 at 8:50

TZHX's user avatar

TZHX TZHX

5,0722 gold badges27 silver badges33 bronze badges

12

29

+1 for the opening. Wanton usage of null is generally disapproved of only in cases where null is an unexpected outcome, i.e. where it wouldn't make sense (to a consumer). If null is meaningful, then it's not an unexpected outcome.

Flater
– Flater

2019年02月26日 09:29:30 +00:00
Commented Feb 26, 2019 at 9:29
28

I would put it a little stronger as never have two variables that maintain the same state. It will fail. Hiding a null value, assuming the null value makes sense inside the instance, from the outside is a very good thing. In this case where you may later decide that 1970年01月01日 denotes that the password has never been changed. Then the logic of the rest of the program does not have to care.

Bent
– Bent

2019年02月26日 10:24:58 +00:00
Commented Feb 26, 2019 at 10:24
3

You could forget to call isPasswordChanged just like you can forget to check if it's null. I see nothing gained here.

Gherman
– Gherman

2019年02月26日 17:01:58 +00:00
Commented Feb 26, 2019 at 17:01
9

@Gherman If the consumer doesn't get the concept that null is meaningful or that he needs to check for the value being present, he is lost either way, but if the method is used, it is clear to everyone reading the code why there is a check and that there is a reasonable chance that the value is null/not present. Otherwise it is unclear if it was just a developer adding a null check just "because why not" or whether it's part of the concept. Sure, one can find out, but one way you have the information directly the other you need to look into the implementation.

Frank Hopkins
– Frank Hopkins

2019年02月26日 19:55:58 +00:00
Commented Feb 26, 2019 at 19:55
2

@FrankHopkins: Good point, but if concurrency is used, even the inspection of a single variable may require locking.

Christian Hackl
– Christian Hackl

2019年02月27日 05:30:28 +00:00
Commented Feb 27, 2019 at 5:30

| Show 7 more comments

Stack Exchange Network

Is a new Boolean field better than a null reference when a value can be meaningfully absent?

8 Answers 8

Correctly using null

Problem

Avoiding the problem

Nulls are still in wide use

On your proposal

How not to use null

How to handle null

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Hot Network Questions

Is a new Boolean field better than a null reference when a value can be meaningfully absent?

8 Answers 8

Correctly using null

Problem

Avoiding the problem

Nulls are still in wide use

On your proposal

How not to use null

How to handle null

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Related

Hot Network Questions