The Null Coalescing Operator (??)

One of the most useful yet little-known features to come out of C# 2.0

Introduction

I'm constantly surprised by the number of developers who aren't aware of this handy piece of syntax. It's my favourite thing to come out of C# 2.0 and no developer should be without it.

Like the conditional (?:) operator's big brother... introducing it for your coding pleasure...

The Null Coalescing Operator (??)

The null-coalescing-operator is a brilliant new terse operator that provides syntax for beautifully concise if statements. Essentially, it returns the left-hand-side of the ?? operator, unless null, in which case it executes and returns the right-hand-side of the operator. This may be a statement or a variable reference. Let's jump straight to some examples:

Here's an interesting example derived from an idea in the discussions below. It shows how an operator override can be used within an object's definition to enable shorthand syntax for double-null checking. The scenario is checking an object property for null using a null-coalescing-operator, but also defaulting when null-object-reference occurs; which would normally cause a runtime exception. (Note that I don't recommend actually using this approach, I just thought it made an interesting example.)

The Rules

To use the null-coalescing-operator, there are some compile-time ground rules.

The left-hand-side must evaluate to a reference or nullable type.

All evaluated statements must be of matching type, unless they can be implicitly converted.

Summary

As you can see from the examples above, this little gem is very powerful and the possibilities are endless. Of course the benefits are purely syntactical, but it helps keep the code clean and easier to follow. I hope you enjoy it as much as I do.

Comments and Discussions

Whilst I respect your opinion, I would like to put forward an alternate point of view. I’m expecting a barrage of abuse here… but someone has to question it …

Why is a “new terse operator that provides syntax for beautifully concise if statements.” a good thing? I’ve coded in assembler for about half a dozen different processors and probably twice that many languages, and this is the sort of thing I would like to get away from, especially when inheriting someone else’s code. The purpose of a compiled language is (amongst other things) to increase legibility, maintainability; reduce errors and provide a greater degree of abstraction from the real processor. Not to make things more “terse”. If I want terse, then I’ll go back to assembler. I am quite prepared to go terse to get performance, but don't want it in a high level language.

I could come up with a language that uses "#Variable{…}" which means repeat until variable is 0… or other symbol combinations to remove the need for other language elements like if, while, for and foreach. Would these all be good things?

If this line of reasoning is followed to its conclusion, then code will more resemble hieroglyphics which needs decoding than something understandable by simply reading it.

I know this is a trivial case, but it is the view of it being taken as read as a good thing that bothers me. I would quite happily type 6 or 7 more characters to make things more readable for others (or myself) later, and many environments these days help cut down typing time finger strain if that's the worry.

Interesting, never heard anyone argue against the use of terse operators.

Surely they require less lines of code and therefore less prone to bugs, are much easier to read and understand, and produce more consistent reliable output due to the way it's compiled.

My opinion is that there's a happy medium between code being incomprehensible due to being too low level (such as assembly), and code being incomprehensible due to being overly bloated.

I agree they shouldn't be thrown in without due care and attention, otherwise again code just becomes incomprehensible (look at perl). As it stands I find C# a clean language, and welcome a hand-full of terse operators to simply common code blocks. Personally I don't want to be writting assembly any more than I want to be writting null cheecking if statements.

Luckily the good old 'if' is always there if you prefer going 'old school'.

Funnily enough I have worked in a number of shops (C++ rather than C#) where the tertiary operator was disallowed as it was hard to read. Of course this is all "in the opinion of the writers of the standards" but it was based on bugs generated, debugging effort and especially people relatively new to C++. For similar reasons the "clever" constructs that you can do with pointer dereferencing and incrementing were not allowed.

I agree less lines of code leads to less bugs... but that is a volume thing... i.e. zero lines means no bugs, thousands of lines means many bugs! Many things drive quantity of bugs ranging from requirements analysis, software architecture all the way down to quality of written lines of code. And yes, even the compiler and sometimes even processor microcode!

The following lines of code are reasonably concise – but probably load of B***cks, because I've not spent any time on it, but does serve to illustrate how I feel about it. I claim no correctness here, just an illustration…

Personally I would just feel a bit more comfortable to fly in something if this was coded in a bit more long hand and spelt out. Why do I feel this way – I think a competent coder writing it longhand (spelt out) is less likely to have bugs, and more importantly someone coming along later (possibly less competent) is less likely to add bugs.

If I feel this way about such an important “application”(!) then I should also professionally feel the same way about any production code I write. It’s a viewpoint that’s all. For me, after a few years of C and C++ I do not accept that terseness should necessarily be taken as a good thing for its own sake – especially when it’s someone else’s code Fine, when fast code is needed (say drivers, audio processing, graphics etc), but all I’m saying is that’s not what I want or expect from C#.

A competent optimising compiler should have a pretty good stab at optimising the longhand anyway. If not, and that important, I would suggest dropping down a level to C++, C or Assembler depending on your tastes.

I apologise if this seems “old school” (bit below the belt I feel – shame on you!)… one of the projects I worked on was team leader for the Right 3 computers flying the Boeing 777. This sharpens the mind somewhat and makes you try to make everything very obvious, spelt out and easily maintainable.

Also, your hint at “old school” is true. Started coding assembler in 1978, and coded many languages and app domains since and still feel I’m working on some leading edge stuff .NET 3, now 3.5 WPF, Workflow, LINQ etc and own a SW house of 40 people. In that, I don’t mind being considered “old School”!

You did say “I agree they shouldn't be thrown in without due care and attention” and actually I agree – but in moderation. The provision of terse operators can easily be abused by someone who can see them as a measure of their cleverness. Finally, this is not a big issue! It’s not the operator I mind, so much as the statement implying that “terse is better”… that I cannot agree with.

Big Al 07 - I have to agree with you on this. While using this operator may make the code more concise (don't know about "beautifully"), it certainly makes it far less readable and harder to understand. Without some other clear benefit, I'm not seeing how that's a good thing. Interesting article though - didn't know this operator existed.

I've got to agree, I came across this operator back when the c# 2.0 language spec was released and went wow, that's as good as the "?:;" operator which I almost never use.
It’s a great operator, but is someone maintaining my code has to dive off to the language reference to fix a bug or add a feature I've failed as a developer.

As developers we have the heavy burden of writing code that performs a function (compiles to a working program) and clearly tells the story of what we were trying to achieve (without the need for comments, as they will always be derivative). This feature doesn't add anything to the earlier and reduces the quality of the later.

As for the issue of more code introduces errors, I think this is one off those off the cuff comments that has become so common that it hast lost all meaning. It refers to the complexity of the code as a function of the complexity of the problem, not size of your CS files as such. For example, writing a recursive XML node walker with hard coded element names instead of using XPath has a lot higher scope for error and will require more maintenance. While writing MissileControl.TestLaunchAllMissiles() instead MC.Go(false) has a lot less scope for error and someone unfamiliar with the code will have a good sense of what it does.

...but is someone maintaining my code has to dive off to the language reference to fix a bug or add a feature I've failed as a developer

Depending upon your usages then I'd question if you have failed or whether they have - it's hardly difficult to comprehend the use of this operator, therefore I would say if they have to dive off to the language reference it is they that have failed to do their job.

If you had written something along the lines

a ?? b ?? c ?? d ?? e ?? "Hello World"

then yes, one could question your code but if it's something as simple as

Terse means to be very expressive with very little spoken. Assembler is not terse by any stretch; in fact, it's verbose. Probably the most meaningfully verbose language, by design.

As for what you actually meant to say, which is that adding vocabulary decreases legibility, I'd say this one is justified. The idea that zero/empty/nothing/unassigned can be expressed by anything more than null is bad enough; allowing the developer to address that ambiguity and unify the concept of "null" in an atomic expression allows for the code to be read more at its intended level of abstraction, thus keeping the reader's train of thought on track.

Furthermore, I don't think this particular operator is as much of a pedagogical challenge as, say, nullable types, which requires the reader to not only know the operator but also understand new concepts. You'll only ever have to look it up once in your life (which takes less than a minute thanks to modern technology ) to appreciate it.

It all comes down to readability of the source code. Does the terse operator make the code more or less readable?

A terse construct can improve readability by expressing a common concept in a distinctive, non-redundant way, that can be conceived with least cognitive load. It can also worsen readability, by encoding an uncommon, yet complex construct in a simplistic, non-intuitive way.

For the operator in question, ther next best alternative to value ?? default would be either the ternary operator, value != null ? value : default;, or restructuring the code into an if/else block. Both introduce more code points to understand, and have an redundant occurance of value.

Worse, I could write code that looks like thi pattern, but isn't - either intentionally, or as a mistake:object x = (Container.Matrix[i,j] != null) ? Container.Matrix[j,i] : Container.DefaultValue;

I don't think it's fair to compare assembler to this. Assembler has a completely different problem: the same idea can be expressed in many valid (and - depending on context preferrable) ways, e.g.