The Personal Software Process: an Independent Study
Prev	Chapter 7. Lesson 7: Design and Code Reviews	Next

Report R4: Midterm Report

Define a process for analyzing your PSP data and producing a report. Use this process to produce report R4. You will update this process to produce the final report at the end of the course.

Requirements

Task 1-- Develop a process for analyzing the data and creating a report on programs 1A through 6A. This should include the results listed below under Task 3. This process must include a planning phase, the task performance phases, and a postmortem phase. Produce and submit a process script and planning forms for enacting this process.

Task 2-- Plan and enact the process defined in Task 1. Use the planning form to record the planned time for this work and track and record the actual time you spend. Submit the planned and actual process data together with the analysis report.

Task 3-- Analyze the data for programs 1 through 6. Spreadsheet analyses are suggested and graphical summaries and presentations are encouraged. At a minimum, produce the following:

An analysis of LOC and development time estimating accuracy and how it has evolved during the programs developed to date.
An analysis of the defect types injected and removed for the programs developed to date. These data should be shown in a format similar to that in table D23.
An analysis of the defect types found by the compiler. These data should be shown in a format similar to that in Table D24.
An analysis of the defect fix times, using the format used in report R3 (Table D22).
Develop a design review checklist to find the most-frequent design defects in a design review.
Develop a code-review checklist to find the most-frequent coding defects in a code review.

Result: Use the GQM method to draw conclusions and set personal improvement goals. Submit the required analyses, tables, and checlists, together with a brief written report. Use graphs wherever possible.

Table 7-1. D23: Percentage of defects injected and removed by phase

	Number Injected		Percentage Injected		Number Removed		Percentage Removed
Type	Design	Code	Design	Code	Compile	Test	Compile	Test

Table 7-2. D24: Percentage of Defects Found in Compile

Defect Type

Number of defects at compile entry

Number of defects found in compile

Percentage of type found by the compiler

--[Humphrey95]

Report Process

Without much guidance, I'll be flailing a bit on this, but I'll base this process "script" off Humphrey's format for the PSP scripts.

Table 7-3. Report R4 Development Script

Phase Number	Purpose	To guide the reporting of PSP data
	Entry Criteria	Requirements statement Time and defect logs in .ppl format Project summary logs for all programs, listing LOC estimates and actual LOC. Tools for analysis and graphing (evalpplog and gnuplot) For gnuplot, the gnuplot scripts `generate_loc_graph.gnu`, `generate_loc_accuracy_graph.gnu`, and `generate_time_graph.gnu`.
1	Planning	Verify you have the data necessary to fulfil the requirements Make an estimate of the time necessary to produce the required report Complete the time recording log
2	Data Collation	Enter rough-guess estimated LOC vs actual LOC in a file named "loc.plt", in a format suitable for analysis by GnuPlot (ie rows of "x y" pairs). Enter PROBE-estimated LOC vs actual LOC in a file named "loc_accuracy.plt" Enter estimated time and actual time in a file named "time.plt", in a format suitable for analysis by GnuPlot. Create error data for LOC and time by using the equation: Error% = 100 * (actual - estimate)/estimate The script "make_error_numbers.py" can be used to automate this, as well as the use of makefile targets (see lesson 7 Makefile.am); these will be files consisting of single numbers, each to their own line
3	Data analysis	Generate graphs for LOC estimation, LOC estimation accuracy, and time estimation accuracy, using the commands gnuplot 'commandfile.gnu'; use the script files `generate_loc_accuracy_graph.gnu`, `generate_loc_error_graph.gnu`, `generate_loc_graph.gnu`, `generate_time_error_graph.gnu`, and `generate_time_graph.gnu`. The use of Makefile targets is encouraged. Use evalpplog to gather and collate time and defect log statistics. The evalpplog tool can collate data from multiple logs. If your logs are all named "cppppl", and are located under the current directory tree somewhere, the command line find . -name "cppppl" \| xargs evalpplog** will analyze all the logs.
4	Report generation	Using the graphs from phase 3 and the historical data, write an analysis of LOC and development time estimation accuracy and how it has evolved. Using the format in the requirements, use the composite error log generated in phase 3 to analyze the defect types injected and removed for the programs. Using the composite error log, create an analysis of defect types found by the compiler. Using the composite error log, create an analysis similar to report R3 on defect types. Using the composite error log, analyze design-injected defects to update or create a design review checklist to find the most-frequent design defects in a design review. Using the composite error log, update or create a code review checklist to find the most-frequent code-injected defects in a code review.
5	Postmortem	Complete a report summary with actual time data
	Exit criteria	Completed report summary with actual and estimated time data. Completed time logs. LOC/Time estimation accuracy report. Defect type injection/removal report Defect types found by compiler report Defect fix times report Updated design review checklist Updated code review checklist

Report R5

LOC and time estimation accuracy (C++ only)

LOC data is taken from programs 2A through 6A; first graph is a depiction of rough-guess estimated LOC vs actual program LOC, showing the best-fit linear regression line through the data points. As the graph shows, the data is not particularly accurate, and the "best fit" approximation is not particularly good.

rough-guess estimated LOC vs actual LOC

Rough-guess estimated new/changed LOC vs actual program new/changed LOC; with these four data points, the fit is fairly bad.

A more "calibrated" comparison is the comparison of PROBE- generated estimates vs actual LOC; the "actual" numbers are the same as the last graph, but the estimates are those generated from the full PROBE algorithm. A "perfect fit" line is plotted to show how the PROBE estimates varied from a perfect 1-to-1 match

PROBE estimates vs actual LOC

PROBE estimates vs actual data. The samples are still very scattered and not particularly accurate-looking.

In fact, it doesn't look like we're doing well on accuracy at all, until we take a closer look at accuracy by using the "percent error" equation on the numbers.

%error in LOC estimation by program

Percent error in each estimation. Note the downward trend.

A look at the time figures reveals some similar trends:

Estimated dev. time vs actual dev.time

Estimated development time vs actual development time. Aside from an obvious outlier, this has actually been fairly regular.

Percent error in time estimation by program

Percent error in each time estimation; note the similarity to the LOC percent error graph.

...and a similar analysis of the time estimation error reveals an extremely similar curve. This makes sense, since the time estimates are made directly from size estimates; they should form similar curves. Still, it's good to see that they are similar-- and that our estimation abilities seem to be getting better over time. Note the very large spike in percent error near the beginning of the graphs; this is program 2a, which was significantly larger than predicted and certainly took a great deal of time.

Defect types injected/removed

Table 7-4. Percentage of defects injected and removed by phase -- C++

	Number Injected		Percentage Injected		Number Removed		Percentage Removed
Type	Design	Code	Design	Code	Compile	Test	Compile	Test
md (missing design)	15	1	10.6	0.7	1	2	0.9	4.3
me (missing error-handling)	0	1	0	0.7	0	1	0	0.7
wn (wrong name)	1	19	0.7	13.5	15	6	10.6	4.3
mi (missing implementation)	0	1	0	0.7	0	1	0	0.7
wa (wrong algorithm)	4	8	2.8	5.7	0	13	0	9.2
wc (wrong condition)	2	0	1.4	0	0	2	0	1.4
ch	0	1	0	0.7	1	0	0.7	0
wt (wrong type)	0	4	0	2.8	3	1	2.1	0.7
we (wrong expression)	2	1	1.4	0.7	0	3	0	2.1
is (interface specification)	0	1	0	0.7	0	1	0	0.7
ic (interface capability)	17	5	12.1	3.5	2	4	1.4	2.8
sy (syntax)	0	43	0	30.5	43	0	30.5	0
id (interface description)	1	1	0.7	0.7	1	0	0.7	0
ma (missing assignment)	0	1	0	0.7	0	1	0	0.7
iu (interface use)	0	8	0	5.7	5	4	3.5	2.8
Eiffel
mc (missing call)	0	5	0	5.5	4	1	4.4	1.1
md (missing design)	2	0	2.2	0	1	1	1.1	1.1
wn (wrong name)	0	13	0	14.3	10	3	11.0	3.3
mi (missing implementation)	0	2	0	2.2	0	2	0.0	2.2
wa (wrong algorithm)	0	10	0	11	1	9	1.1	9.9
wc (wrong condition)	0	3	0	3.3	0	3	0.0	3.3
sy (syntax)	0	31	0	34.1	31	1	34.1	1.1
wt (wrong type)	0	1	0	1.1	1	0	1.1	0
is (interface specification)	0	1	0	1.1	1	0	1.1	0
ic (interface capability)	2	5	2.2	5.5	5	1	5.5	1.1
ma (missing assignment)	0	8	0	8.8	4	4	4.4	4.4
iv (invariant)	0	1	0	1.1	0	1	0	1.1

Defect types found by the compiler

Table 7-5. D24: Percentage of Defects Found in Compile -- C++

Defect Type	Number of defects at compile entry	Number of defects found in compile	Percentage of type found by the compiler
md	3	1	33
me	1	0	0
wn	21	15	71
mi	1	0	0
wa	13	0	0
wc	2	0	0
wc	2	0	0
ch	1	1	100
wt	4	3	75
we	3	0	0
sy	43	43	100
ic	6	2	33
is	1	0	0
id	1	1	100
ma	1	0	0
iu	9	5	55
Eiffel
mc	5	4	80
md	2	1	50
wn	13	10	77
mi	2	0	0
wa	10	1	10
wc	3	0	0
sy	32	31	97
wt	1	1	100
ic	6	5	83
is	1	1	100
ma	8	4	50
iv	1	0	0

Defect fix times

Table 7-6. Defect Fix Times

C++

Defects found in compile

Defects found in test

Total Defects found

Defects injected in design

Total Fix Time:	0.9
Total defects:	1
Average fix time:	0.90

Total Fix Time:	108.3
Total defects:	11
Average fix time:	9.8

Total Fix Time:	109.2
Total defects:	12
Average fix time:	9.1

Defects injected in code

Total Fix Time:	31.2
Total defects:	69
Average fix time:	0.5

Total Fix Time:	155.2
Total defects:	25
Average fix time:	6.2

Total Fix Time:	186.4
Total defects:	94
Average fix time:	1.98

Total defects injected

Total Fix Time:	32.1
Total defects:	70
Average fix time:	0.45

Total Fix Time:	263.5
Total defects:	36
Average fix time:	7.31

Total Fix Time:	295.6
Total defects:	106
Average fix time:	2.79

Eiffel

Defects found in compile

Defects found in test

Total Defects found

Defects injected in design

Total Fix Time:	2.7
Total defects:	2
Average fix time:	1.3

Total Fix Time:	4.7
Total defects:	1
Average fix time:	4.7

Total Fix Time:	7.4
Total defects:	3
Average fix time:	2.5

Defects injected in code

Total Fix Time:	44.6
Total defects:	56
Average fix time:	0.8

Total Fix Time:	80.0
Total defects:	24
Average fix time:	3.3

Total Fix Time:	124.6
Total defects:	80
Average fix time:	1.55

Total defects injected

Total Fix Time:	47.3
Total defects:	58
Average fix time:	82

Total Fix Time:	84.7
Total defects:	25
Average fix time:	3.4

Total Fix Time:	132
Total defects:	83
Average fix time:	1.6

It's interesting to note that defects found in test seem to be substantially larger, or at least require longer fix times. The too-simple conclusion is that errors found later in the development cycle "cost more" than errors found earlier. This is probably true, but it's also true that the compiler is catching very simple errors automatically-- errors which couldn't be caught during test, because the program simply won't compile enough to test without them.

Design Review Checkliste

The largest categories of design-injected failures are "missing design" and "interface capability" (which is essentially a lesser manifestation of "missing design"). Less frequent are "wrong name", "wrong algorithm", "wrong expression" (a lesser manifestation of wrong algorithm), and "wrong expression" (a boolean manifestation of wrong expression).

It's tough to know where to start with this. These are all fairly fuzzy things; I'll start with Humphrey's design review checklist on [Humphrey95] page 705, and attempt to expand it as necessary. The problem is, there isn't much necessary-- the "missing design" errors are covered by the "complete" category, and and "wrong algorithm" errors are covered mostly by the "logic" section. Still, I've added a few minor items, shown by emphasis.

Table 7-7. Design Review Checklist

Purpose	To guide you in conducting an effective design review
General	As you complete each review step, check off that item in the box to the right. Complete the checklist for one program unit before you start to review the next.
Complete	Ensure that the requirements, specifications, and high-level design are completely covered by the design: All specified outputs are produced All needed inputs are furnished All required includes are stated All appropriate data features have accessors/mutators Classes have make/constructor when appropriate C++: Classes with virtual functions have virtual destructor
Logic	Verify that program sequencing is proper: Stacks, lists, and so on are in the proper order Recursion unwinds properly Verify that all loops are properly initiated, incremented, and terminated Ensure complex algorithms are factored into easily-computable parts where possible
Special Cases	Check all special cases: Ensure proper operation with empty, full, minimum, maximum, negative, zero values for all variables. Protect against out-of-limits, overflow, underflow conditions Ensure "impossible" conditions are absolutely impossible Handle all incorrect input conditions
Functional Use	Verify that all functions, procedures, or objects are fully understood and properly used Verify that all externally referenced abstractions are precisely defined
Names	Verify the Following: All special names and types are clear or specifically defined The scopes of all variables and parameters are self-evident or defined All named objects are used within their declared scopes
Contracts	Check/create require/ensure/check/invariant contracts for all methods and classes
Standards	Review the design for conformance to all applicable design standards

Code Review Checklist

Big items injected in the coding phase are wrong name and syntax, with interface use, wrong algorithm, interface capability, and wrong type finishing up. With C++, wrong type is very sneaky, because it's good at automatically converting without any input from the programmer.

Humphrey gives us a C++ code review checklist, so I'll supplement it with emphasized lines (and modifications). I'll try and merge the Eiffel code review checklist with this, noting the obvious places.

Table 7-8. Code Review Checklist

Purpose	To guide an effective code review
General	As you complete each review step, check off that item in the box to the right Complete the checklist for one program unit before you start to review the next unit
Complete	Verify that the code covers all the design
Includes (ignore for Eiffel)	Verify that includes are complete
Initialization	Check variable and parameter initialization: At program initiation At start of every loop At function/procedure entry In object construction
Calls	Check function call formats: Pointers (C++ only) Parameters Use of '&' (C++ only)
Names	Check name spelling and use Is it consistent? Is it within the declared scope? Do all structures and classes use '.' reference? (C++ only)
Pointers (C++ only)	Check that Pointers are initialized NULL Pointers are deleted only after new New pointers are always deleted after use
Output format	Check the output format: Line stepping is proper Spacing is proper
Block closure	Ensure that blocks are properly opened/closed (brace matching in C++, proper closure in Eiffel)
Logic operators/assignment	Verify proper use of == vs = Verify proper use of = vs := Verify proper boolean conditions; simplify where appropriate Check logic for proper parentheses nesting Check complex operations for possible refactorings
Line-by-line check	Check every LOC for Instruction syntax Proper punctuation
Standards/prettify	Ensure code conforms to coding standards Prettify code (indent for C++, pretty for Eiffel)
File open/close	Verify that all files are Properly declared Opened Closed
Special interest items	Ensure loop indices are updated (Eiffel) Ensure header-file/implementation-file parity, with special emphasis on const (C++)

Postmortem

Time preparing this report was spent as follows:

Table 7-9. Time Recording Log

Student:	Victor B. Putz	Date:	000117
		Program:	R4

Start	Stop	Interruption Time	Delta time	Phase	Comments
000117 14:07:57	000117 14:31:55	0	23	estimation accuracy
000117 14:31:58	000117 14:48:23	0	16	defect injection/removal
000117 14:49:10	000117 14:55:34	0	6	defects found by compiler
000117 14:55:38	000117 15:06:13	0	10	defect fix times
000117 15:08:17	000117 15:41:27	0	33	design review checklist
000117 15:41:29	000117 15:57:34	0	16	code review checklist

I'm not doing an extensive postmortem on this because I don't feel the time involved would justify the reward. If I continue using the PSP, this section will have to be heavily automated, and will differ so much from the current hand-drawn method that it will no longer be recognizable.