CITS5501 lab 7 (week 8) – reviews  – solutions

Reading

It is strongly suggested you complete the recommended readings for weeks 1-7 before attempting this lab/workshop.

A. Code review

Code review is careful, systematic study of source code by people who are not the original author of the code. It’s analogous to having someone proofread a written essay or assignment before you submit it.

It has two main purposes:

Code review is widely practiced in open source projects like Apache and Mozilla, as well as in industry.

Most companies and large projects have coding style standards (for example, the Google Java Style). These can get pretty detailed, even to the point of specifying whitespace (how deep to indent) and where curly braces and parentheses should go. We won’t get into that much detail, but we stress that

If you’re interested in getting more details about how code review works, then there is more information available in the Pressman textbook, as well as a whole StackExchange site dedicated to code review.

B. Exercise

For this exercise you should split up into small groups of 2-3 people.

For each of the following code samples, each person in a pair or group should take the time to read through the code carefully, bearing in mind the “Code review instructions” at the end of this sheet and recording problems they find (and possibly suggesting concrete improvements). Spend about 5 minutes on this. Then with your partner or group, discuss what problems you have found, and see if you agree or disagree on what they are (5–10 mins).

Then share this with the class as a whole; your facilitator may suggest some more.

Code sample 1 – dayOfYear

public static int dayOfYear(int month, int dayOfMonth, int year) {
    if (month == 2) {
        dayOfMonth += 31;
    } else if (month == 3) {
        dayOfMonth += 59;
    } else if (month == 4) {
        dayOfMonth += 90;
    } else if (month == 5) {
        dayOfMonth += 31 + 28 + 31 + 30;
    } else if (month == 6) {
        dayOfMonth += 31 + 28 + 31 + 30 + 31;
    } else if (month == 7) {
        dayOfMonth += 31 + 28 + 31 + 30 + 31 + 30;
    } else if (month == 8) {
        dayOfMonth += 31 + 28 + 31 + 30 + 31 + 30 + 31;
    } else if (month == 9) {
        dayOfMonth += 31 + 28 + 31 + 30 + 31 + 30 + 31 + 31;
    } else if (month == 10) {
        dayOfMonth += 31 + 28 + 31 + 30 + 31 + 30 + 31 + 31 + 30;
    } else if (month == 11) {
        dayOfMonth += 31 + 28 + 31 + 30 + 31 + 30 + 31 + 31 + 30 + 31;
    } else if (month == 12) {
        dayOfMonth += 31 + 28 + 31 + 30 + 31 + 30 + 31 + 31 + 30 + 31 + 31;
    }
    return dayOfMonth;
}

What problems can you see with this code? What improvements would you suggest?

Does it violate any of the guidelines given in the “Code review instructions”?

Some problems with code sample 1:

There are many possible ways to improve this code, but here’s one (quick) possibility:

/** Given a date, specified by providing a day of the month (bounds
 * depend on the exact month, but should always be between 1 and 31
 * inclusive), a month (from 1 to 12), and a 4-digit year,
 * return what day of the year that date occurs on.
 */
public static int dayOfYear(int dayOfMonth, int month, int year) {
  // For non-leap years: days in Jan-Nov.
  int monthLength = { 31, 28, 31, 30, 31, 30, 31, 31, 30, 31, 31 };
  int result = dayOfMonth;

  // e.g. days in Feb have added to them the no. of days in Jan
  // (array pos 0) added to them, only.
  for int (i = 0; i < month - 1; i++) {
    result += monthLength[i];
  }
  // months past Feb are affected by leap years
  if (isLeapYear(year) && month > 2) {
    result += 1;
  }
  return result;
}

It too could still be improved:

Code sample 2 – leap

Hopefully you spotted one problem with the previous code sample – it gives the wrong answer, when the year is a leap year!

So suppose a colleague writes the following method, intended to determine whether a year is a leap year. Again, look for problems in and potential improvements to this code (5 mins), then discuss these with your partner(s) (5–10 mins).

public static boolean leap(int y) {
    // convert to string so we can access 1st, 2nd digit etc
    String tmp = String.valueOf(y);
    if (tmp.charAt(2) == '1' || tmp.charAt(2) == '3' || tmp.charAt(2) == 5 || 
        tmp.charAt(2) == '7' || tmp.charAt(2) == '9') {
          if (tmp.charAt(3)=='2'||tmp.charAt(3)=='6') return true; /*R1*/
          else
              return false; /*R2*/
    }else{
        if (tmp.charAt(2) == '0' && tmp.charAt(3) == '0') {
            return false; /*R3*/
        }
        if (tmp.charAt(3)=='0'||tmp.charAt(3)=='4'||tmp.charAt(3)=='8')
          return true; /*R4*/
    }
    return false; /*R5*/
}

Some problems with code sample 2:

There are many possible ways to improve this code, but here’s one (quick) possibility, which makes improvements including the following:

/** Returns whether the proleptic Gregorian calendar year
  * <code>year</code> is a leap year.
  *
  * Uses the algorithm described in *Calendrical Calculations* by
  * Reingold & Dershowitz, chapter 3.
  */
public static boolean isLeapYear(int year) {
  if (year % 4 != 0) {
    return false;
  } else if (year % 100 != 0) {
    return true;
  } else if (year % 400 != 0) {
    return false;
  } else {
    return true;
  }
}

If following a well-known algorithm (especially one this short), inline comments probably aren’t necessary (but Javadoc documentation still is).

C. Linters and formatters

In general, it’s expensive to spend the time of humans doing something a computer could do equally well – like formatting code correctly, or spotting errors that can be checked for mechanically. So before a human reviews your code, you should ensure it’s already been checked for basic errors by a computer.

Programs called code formatters, code beautifiers or pretty-printers apply consistent formatting to code. Some apply a single rigid, built-in style (e.g. “all indents are 4 spaces”) but most can be customised to reproduce a style the team prefers. One commonly-used formatter used in Java projects is Spotless, which can be integrated easily with the Microsoft Visual Studio Code editor, or the IntelliJ IDEA IDE. If you use either of those tools to write your Java code, you might like to experiment with it.

Linters analyse your source code looking for common programming errors, poor style, and programming language constructs which should be used with caution. They are performing a simple form of static code analysis (which we’ll talk more about later). Examples of linters for Java include Checkstyle, Spotbugs, and PMD. In your own time, you might like to try running one of these tools over code you have written, to see what sort of problems they can flag.

D. Ant build files and code coverage tools

This exercise is to get you familiar with code coverage tools and some of their weaknesses.

A repository with Java code is provided at https://github.com/cits5501/week-8-lab-coverage-example which contains a simple class for detecting whether a Java string is a palindrome (Palindrome.java), and tests for this class using JUnit.

You can use GitPod to work with this code online. You will need a GitHub account; once you have one set up, follow the link below:

  https://gitpod.io/new/#https://github.com/cits5501/week-8-lab-coverage-example

and select “New workspace”.

After a minute or two, you should get access to an online version of the VS Code editor in which the source code for the repository is available. Take a look at the build.xml file; this file is used by the Apache Ant build automation tool. It defines several “targets” representing tasks that can be run using the build file:

(Some of these tasks you can also do just using VS Code. For instance, open PalindromeTest.java, expand the “outline” and “java projects” panes, and wait for the background tasks to finish. A green triangle should appear next to the start of the PalindromeTest class, indicating that it can be clicked to run the JUnit tests it contains.)

Click the “Terminal” pane, and type ant test to run the JUnit tests. The messages should indicate that four tests were run with no failures.

Now run ant report, and not only will the tests be run, but the paths they take through the code of PalindromeTest will be recorded by a tool called JaCoCo, and a report produced.

You can view this by clicking “Go Live” in the VS Code taskbar (towards the right). This serves up the contents of the repository directory via HTTP, so it can be viewed in the browser. Click “report”, and by clicking through, you should be able to see a coverage report for our tests:

Read the JaCoco documentation on “Coverage Counters”, here, for more information about exactly what is being measured.

Exercises

Solution

The coverage report doesn’t change in any of the above cases.

This suggests we could have a very weak test suite, consisting of just one test case – say, an empty string ("") – and with no assertions, and full coverage would still be reported.

If changes are made to the code, we could easily introduce errors that aren’t picked up by the test suite.

When we look at mutation testing, we will see ways of assessing the quality of our tests which do not depend on measuring coverage as reported by tools like JaCoCo.

As for a missing test case: an empty string (a degenerate or “border” case) would be a good thing to test for, as would single-letter strings, but this is not done.

E. Summary

Code review is a widely-used technique for improving software quality by human inspection. Code review can detect many kinds of problems in code, but in this workshop we looked at code samples which highlightede the following general principles:

These general principles help ensure that code is:

Appendix – Code review instructions

This document describes how and why to perform code reviews.

You can’t learn how to write without learning how to read – and programming is no exception. Code reviewing is widely used in software development, both in industry and in open source projects. Some companies like Google even make it a policy that all code must be reviewed before being committed to version control – this means that for every single line of code in Google’s repository, another software developer has read it, given feedback on it, and signed off on it.

Code review can be seen as a type of static quality assurance technique (it doesn’t require the code to be run), and has multiple benefits:

So code reviewing is not only a practically important skill that you will need in the real world, but also a learning opportunity.

What to look for

In the exercises for this workshop, you can make comments about anything you think is relevant. But some guiding principles are that we want code to be:

So read the code with those principles in mind. What follows are some concrete examples of problems to look for – feel free to add others.

Bugs or potential bugs.

Unclear, messy code.

Misusing (or failing to use) essential design concepts.

Positive comments are also a good thing. Don’t be afraid to make comments about things you really like, for example:

Credits

The code samples, instructions and exercises in sections A–B and E are adapted from MIT’s course 6.005, “Software Construction”, reading 4, “Code Review”, available from the MIT website, which was collaboratively authored with contributiopn by Saman Amarasinghe, Adam Chlipala, Srini Devadas, Michael Ernst, Max Goldman, John Guttag, Daniel Jackson, Rob Miller, Martin Rinard, and Armando Solar-Lezama. This worksheet (as well as the original MIT content) is licensed under the CC BY-SA 4.0 license.