What is this talk of "release"? Klingons do not make software "releases". Our software escapes, leaving a bloody trail of designers and quality assurance people in its wake!
— From "Top 12 Things a Klingon Programmer Would Say"
Have you ever been on that project? The project that feels like all the code was written by a ham-fisted Klingon? I have, fortunately only once or twice. But there is rarely a code base that doesn't have weak spots here and there. And while language-specific coding guidelines can be found 'a-plenty, I find that the majority of their advice is the same. About the only parts that are different are the parts that don't really make much difference to readability, maintainability or other such values ... things like capitalization of identifiers and brace alignment. So, this is my endeavor to collect the language-agnostic coding guidelines I have come across over the years and put them in one place where hopefully they can just be referred to and language-specific guidelines can concentrate on the differences.
Because there are very few complete absolutes when talking about coding standards, it is useful to document not only the rule itself, but how stringently we expect adherence to the rule to be. To this end, each rule will be stated as one of the following:
- DO - Do guidelines should always be followed (or have a really, really good reason why it needs to be broken in this specific case)
- CONSIDER - Consider guidelines should generally be followed, but if you fully understand the reasoning behind a guideline and have a good reason to not follow it anyway, you should not feel bad about breaking the rules
- AVOID - Avoid guidelines indicate that something is generally not a good idea, but there are known cases where breaking the rule makes sense
- DO NOT - Do not guidelines are something you should almost never do
Stolen from Framework Design Guidelines
Always code as if the person who ends up maintaining your code is a violent psychopath who knows where you live.
— Coding Horror blog
Readability should be the #1 concern when writing code. Code is read far more often than it is written, so one should optimize for that fact.
- DO follow formatting and naming standards for the language
Thou shalt make thy program's purpose and structure clear to thy fellow man by using the One True Brace Style, even if thou likest it not, for thy creativity is better used in solving problems than in creating beautiful new impediments to understanding.
— 8th C Commandment
If there aren't clear standards for the language, pick a standard and stick with it. Any standard is better than no standard.
- DO use a coding standard enforcing tool to automate the drudgery of compliance checking
Also, a standard that isn't enforced isn't really a standard. And coding standards really need a tool to properly enforce them, RuboCop is an example of a coding standard-enforcing tool for Ruby. StyleCop is an example for C#. Pick a tool, configure it so that it runs automatically on all code and then it is easy for everyone to follow the standard.
- DO follow the basic input → process → output → handle errors structure
By creating this essentially Three-act structure, it makes your code easier to read and understand. Sadly, many methods follow a "Choose-Your-Own-Adventure" structure like this:
You exit the passageway into a large cavern. Unless you came from page 59, in which case you fall down the sinkhole into a large cavern. A huge troll, or possibly a badger (if you already visited Queen Pelican), blocks your path. Unless you threw a button down the wishing well on page 8, in which case there nothing blocking your way. The [troll or badger or nothing at all] does not look happy to see you.
— Confident Ruby
- DO keep lines to 100 characters or less
- DO keep functions short
Short functions are more readable functions. The more lines of code a function has, the harder it is to understand everything it is doing. Often, one will see code like this:
public void foo() {
// Frob the frobnicator
FrobnicatorFactory frobnicatorFactory = new FrobnicatorFactory();
Frobnicator frobnicator = frobnicatorFactory.createInstance();
frobnicator.frob();
// Reticulate splines
SplinesFactory splinesFactory = new SplinesFactory();
Splines splines = splinesFactory.createInstance();
splines.reticulate();
// ... snip ... many more chunks of code
}
If the code is complicated enough that we need comments to understand what each block of code is doing, extract those blocks of code into functions to make the whole thing much more readable:
public void foo() {
frobFrobnicator();
reticulateSplines();
}
The maximum recommended length of functions will vary by language, but in almost any language fifty lines is too long and twenty lines or less is pretty good.
See also: Clean Code, chapter 3
- DO write easy-to-understand, idiomatic code
What I mean by idiomatic code is code that is written the way the majority of other developers would write it. For example, here are two loops to print the numbers from 1 to 100 in C:
/* Bad */
int i = 0;
while (i < 100) {
printf("%d\n", i + 1);
++i;
}
/* Good */
for (int i = 0; i < 100; i++) {
printf("%d\n", i + 1);
}
They both result in the exact same output. They probably even compile to the same code. Which is more readable? The second one, because when you need to iterate over a series with known limits, also known as "definite iteration", a for
loop is the standard, idiomatic method in C. The reader sees that it is a standard chunk of code that they are familiar with and it is much easier for the reader to understand.
- DO NOT go overboard with simplifying your code
Some people take the above practice to mean that we shouldn't use language constructs that are hard for beginners to grasp at first or that beginners don't normally use right away. A couple examples are the ternary if-else
operator and lambdas in languages like C#. Code should be most readable to the intermediate majority.
See also: About Face, chapter 3: Perpetual Intermediates
- DO capitalize acronyms in identifiers as if they were normal words
Let's use the common acronym HTML in an identifier name as an example:
Casing | Identifier |
---|---|
camelCase | htmlText |
PascalCase | HtmlText |
snake_case | html_text |
SCREAMING_CAPS | HTML_TEXT |
- DO NOT use Hungarian Notation in identifier naming
- AVOID using numeric literals in your code
// Bad
Thread.sleep(5000);
// Good
Thread.sleep(STANDARD_WAIT_TIME);
- DO document your code using a documentation comments facility available for the language
- AVOID putting comments inside the body of functions
Comments inside the body of a function are almost always signs that the code needs to be rewritten for more clarity.
- DO NOT check in commented out code
You might be tempted to say that you may need it later, or that it is used for debugging, or it is there for reference. But code that is commented out is not maintained along with the rest of the system. It is not tested. It quickly begins to not match up with the code around it. And before long, even if you were going to uncomment it at some point, you would have to rewrite it anyway. Just delete it and be done with it or uncomment it and put some sort of configuration switch in that will allow it to be tested and therefore maintained.
There have been a lot of misconceptions about exceptions and exception handling over the years. What exceptions do for you is create a separate code path for dealing with error states. This way you can write your mainline code the way you've always wanted, one statement after another assuming that everything works along the way. When something fails, control is automatically passed to the error handler. This is the way that the best programmers have been writing their code for decades, there just wasn't built-in language support for it before.
- DO catch exceptions which you can handle or recover from
- CONSIDER catching exceptions to log them or repackage them as more specific exceptions, but always re-throw
- AVOID simply swallowing exceptions
It is rarely the right thing to do, but there are circumstances where it can be. For example, in some languages parsing data can throw an exception when the data isn't recognized. In limited cases like this one, catching the exception and moving on can be acceptable.
-
DO NOT catch
Exception
(or whatever the generic exception type is in the language in which you are writing code)
Because you should only catch exceptions that you can handle or recover from (unless you're rethrowing them), you should never catch the base exception class. Additionally, by letting exceptions crash the application while you're developing it, you can find out what is causing problems and fix the problems.
See also: Martin Fowler's article Fail Fast
The one exception to the rule of never catching Exception
is writing an exception handler at the outermost scope of the application or thread. In some applications the right thing to do is to catch an exception at the outermost scope to log it just before allowing the application to terminate. And in many programming languages an uncaught exception in a thread crashes the entire application not just the single thread (and rightly so). The right thing to do there is to catch the exception, log it and signal the main thread what happened.