[go: nahoru, domu]

Jump to content

Essential complexity: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
→‎Measure of being close to (or precisely) a structured program: let's use his word, although I think it's not in any dictionary
→‎Measure of the "structureness" of a program: actually, it's not exactly the same
Line 13: Line 13:


== Measure of the "structureness" of a program ==
== Measure of the "structureness" of a program ==
'''Essential complexity''' is also a numeric measure defined by McCabe in his highly-cited, 1976 paper better known for introducing [[cyclomatic complexity]]. McCabe, defined essential complexity as the cyclomatic complexity of the reduced [[control flow graph]] after iteratively replacing (reducing) all [[structured programming]] [[control structure]]s, i.e. those having a single entry point and a single exit point (for example if-then-else and while loops) with placeholder single statements.<ref name="mccabe76">{{cite journal| last=McCabe| date=December 1976| journal=IEEE Transactions on Software Engineering| pages=308–320| title=A Complexity Measure|format=}}</ref>{{rp|317}}<ref>http://www.mccabe.com/pdf/mccabe-nist235r.pdf</ref>{{rp|80}}<!-- note that the original paper has an error in the final formula for ev, but this is corrected the technical report-->
'''Essential complexity''' is also a numeric measure defined by McCabe in his highly cited, 1976 paper better known for introducing [[cyclomatic complexity]]. McCabe, defined essential complexity as the cyclomatic complexity of the reduced [[control flow graph]] after iteratively replacing (reducing) all [[structured programming]] [[control structure]]s, i.e. those having a single entry point and a single exit point (for example if-then-else and while loops) with placeholder single statements.<ref name="mccabe76">{{cite journal| last=McCabe| date=December 1976| journal=IEEE Transactions on Software Engineering| pages=308–320| title=A Complexity Measure|format=}}</ref>{{rp|317}}<ref>http://www.mccabe.com/pdf/mccabe-nist235r.pdf</ref>{{rp|80}}<!-- note that the original paper has an error in the final formula for ev, but this is corrected the technical report-->


McCabe's reduction process is intended to simulate the conceptual replacement of control structures (and actual statements they contain) with subroutine calls, hence the requirement for the control structures to have a single entry and a single exit point.<ref name="mccabe76"/>{{rp|317}} (Nowadays a process like this would fall under the umbrella term of [[refactoring]].) All structured programs evidently have an essential complexity of 1 as defined by McCabe because they can all be iteratively reduced to a single call to a top-level subroutine.<ref name="mccabe76"/>{{rp|318}} As McCabe explains in his paper, his essential complexity metric was designed to provide a measure of how far off this ideal (of being completely structured) a given program was.<ref name="mccabe76"/>{{rp|317}} Thus higher essential complexity numbers, which can only be obtained for non-structured programs, indicate that they are further away from the structured programming ideal.<ref name="mccabe76"/>{{rp|317}}
McCabe's reduction process is intended to simulate the conceptual replacement of control structures (and actual statements they contain) with subroutine calls, hence the requirement for the control structures to have a single entry and a single exit point.<ref name="mccabe76"/>{{rp|317}} (Nowadays a process like this would fall under the umbrella term of [[refactoring]].) All structured programs evidently have an essential complexity of 1 as defined by McCabe because they can all be iteratively reduced to a single call to a top-level subroutine.<ref name="mccabe76"/>{{rp|318}} As McCabe explains in his paper, his essential complexity metric was designed to provide a measure of how far off this ideal (of being completely structured) a given program was.<ref name="mccabe76"/>{{rp|317}} Thus higher essential complexity numbers, which can only be obtained for non-structured programs, indicate that they are further away from the structured programming ideal.<ref name="mccabe76"/>{{rp|317}}
Line 47: Line 47:
</source>
</source>


The notion of CFG reducibility introduced by McCabe for defining essential complexity finds applications in compiler optimization. The areas of the CFG that cannot be reduced (to structured programs) are now called ''improper regions''; these cause additional difficulties in optimizing generated code.<ref name="Muchnick1997">{{cite book|author=Steven S. Muchnick|title=Advanced Compiler Design Implementation|year=1997|publisher=Morgan Kaufmann|isbn=978-1-55860-320-2|pages=196-197}}</ref>
The idea of CFG reducibility by successive collapses of sub-graphs (ultimately to a single node for well-behaved CFGs) is also used in modern compiler optimization. The areas of the CFG that cannot be reduced to "nice" programs, technically called [[natural loop]]s (which have a rather involved definition that we elide in this article) are called ''improper regions'', and these regions end up having a relatively simple definition: multiple-entry strongly connected components of the CFG. (Multiple exits do not cause problems to modern compilers. Thus the simplest improper region is a loop with two entry points.) Improper regions cause additional difficulties in optimizing code.<ref name="Muchnick1997">{{cite book|author=Steven S. Muchnick|title=Advanced Compiler Design Implementation|year=1997|publisher=Morgan Kaufmann|isbn=978-1-55860-320-2|pages=196-197}}</ref>


==See also==
==See also==

Revision as of 04:45, 17 July 2014

Essential complexity has been defined in various ways by various people. One definition is as an antonym to accidental complexity. Another definition is a number attempting to measure how close (or far) a given program is from being a structured program.

Antonym to accidental complexity

Essential complexity refers to a situation where all reasonable solutions to a problem must be complicated (and possibly confusing) because the "simple" solutions would not adequately solve the problem.[citation needed] It stands in contrast to accidental complexity, which arises purely from mismatches in the particular choice of tools and methods applied in the solution.

This term has been used since, at least, the mid-1980s. Turing Award winner Fred Brooks has used this term and its antonym of accidental complexity since the mid-1980s. He has also updated his views in 1995 for an anniversary edition of Mythical Man-Month, chapter 17 "'No Silver Bullet' Refired".[1][2][3] [4]

Measure of the "structureness" of a program

Essential complexity is also a numeric measure defined by McCabe in his highly cited, 1976 paper better known for introducing cyclomatic complexity. McCabe, defined essential complexity as the cyclomatic complexity of the reduced control flow graph after iteratively replacing (reducing) all structured programming control structures, i.e. those having a single entry point and a single exit point (for example if-then-else and while loops) with placeholder single statements.[5]: 317 [6]: 80 

McCabe's reduction process is intended to simulate the conceptual replacement of control structures (and actual statements they contain) with subroutine calls, hence the requirement for the control structures to have a single entry and a single exit point.[5]: 317  (Nowadays a process like this would fall under the umbrella term of refactoring.) All structured programs evidently have an essential complexity of 1 as defined by McCabe because they can all be iteratively reduced to a single call to a top-level subroutine.[5]: 318  As McCabe explains in his paper, his essential complexity metric was designed to provide a measure of how far off this ideal (of being completely structured) a given program was.[5]: 317  Thus higher essential complexity numbers, which can only be obtained for non-structured programs, indicate that they are further away from the structured programming ideal.[5]: 317 

To avoid confusion between various notions of reducibility to structured programs, it's important to note that McCabe's paper briefly discusses and then operates in the context of a 1973 paper by S. Rao Kosaraju, which gave a refinement (or alternative view) of the structured program theorem. The seminal 1966 paper of Böhm and Jacopini showed that all programs can be [re]written using only structured programming constructs, (aka the D structures: sequence, if-then-else, and while-loop), however, in transforming a random program into a structured program additional variables may need to be introduced (and used in the tests) and some code may be duplicated.[7]

In their paper, Böhm and Jacopini conjectured, but did not prove that it was necessary to introduce such additional variables for certain kinds of non-structured programs in order to transform them into structured programs.[8]: 236  An example of program (that we now know) does require such additional variables is a loop with two conditional exits inside it. In order to address the conjecture of Böhm and Jacopini, Kosaraju defined a more restrictive notion of program reduction than the Turing equivalence used by Böhm and Jacopini. Essentially, Kosaraju's notion of reduction imposes, besides the obvious requirement that the two programs must compute the same value (or not finish) given the same inputs, that the two programs must use the same primitive actions and predicates, understood as expressions used in the conditionals. Because of these restrictions, Kosaraju's reduction does not allow the introduction of additional variables; assigning to these variables would create new primitive actions and testing their values would change the predicates used in the conditionals. Using this more restrictive notion of reduction, Kosaraju proved Böhm and Jacopini's conjecture, namely that a loop with two exists cannot be transformed into a structured program without introducing additional variables, but went further and proved that programs containing multi-level breaks (from loops) form an hierarchy, such that one can always find a program with multi-level breaks of depth n that cannot be reduced to a program of multi-level depth less than n, again without introducing additional variables.[8][9]

McCabe notes in his paper that in view of Kosaraju's results, he intended to find a way to capture the essential properties of non-structured programs.[5]: 315  He proceeds by first identifying the control flow graphs corresponding to the smallest non-structured programs, which he uses to formulate a theorem analogous to Kuratowski's theorem, and thereafter introduces his notion of essential complexity in order to give a scale answer ("measure of the structureness of a program" in his words) rather than a yes/no answer to the question of whether a program's control flow graph is structured or not.[5]: 315  Finally, the notion of reduction used by McCabe to shrink the CFG is not the same as Kosaraju's notion of reducing flowcharts. The reduction defined on the CFG not know or care about the program's inputs, it is simply a graph transformation.[10]

For example, the following C program fragment has an essential complexity of 1, because the inner if statement and the for can be reduced, i.e. it is a structured program.

   for(i=0;i<3;i++) {
      if(a[i] == 0) b[i] += 2;
   }

The following C program fragment has an essential complexity of four; its CFG is irreducible. The program finds the first row of z which is all zero and puts that index in i; if there is none, it puts -1 in i.

  for(i=0;i<m;i++) {
    for(j=0;j<n;j++) {
      if(z[i][j] != 0)
        goto non_zero;
    }
    goto found;
non_zero:
  }
  i = -1;
found:

The idea of CFG reducibility by successive collapses of sub-graphs (ultimately to a single node for well-behaved CFGs) is also used in modern compiler optimization. The areas of the CFG that cannot be reduced to "nice" programs, technically called natural loops (which have a rather involved definition that we elide in this article) are called improper regions, and these regions end up having a relatively simple definition: multiple-entry strongly connected components of the CFG. (Multiple exits do not cause problems to modern compilers. Thus the simplest improper region is a loop with two entry points.) Improper regions cause additional difficulties in optimizing code.[11]

See also

References

  1. ^ Brooks, Proc. IFIP
  2. ^ Brooks, IEEE Computer
  3. ^ Brooks, Mythical Man-Month, Silver Bullet Refired
  4. ^ McCabe, Watson (1996). "Structured Testing: A Testing Methodology Using the Cyclomatic Complexity Metric Chapter 10: Essential Complexity".
  5. ^ a b c d e f g McCabe (December 1976). "A Complexity Measure". IEEE Transactions on Software Engineering: 308–320.
  6. ^ http://www.mccabe.com/pdf/mccabe-nist235r.pdf
  7. ^ David Anthony Watt; William Findlay (2004). Programming language design concepts. John Wiley & Sons. p. 228. ISBN 978-0-470-85320-7.
  8. ^ a b J. Computer and System Sciences, 9, 3 (December 1974), doi:10.1016/S0022-0000(74)80043-7
  9. ^ For more modern treatment of the same results see: Kozen, The Böhm–Jacopini Theorem is False, Propositionally
  10. ^ McCabe footnotes the two definitions of on pages 315 and 317.
  11. ^ Steven S. Muchnick (1997). Advanced Compiler Design Implementation. Morgan Kaufmann. pp. 196–197. ISBN 978-1-55860-320-2.

Further reading

For the first sense of the term:

  • Brooks, Fred P. (1986). "No Silver Bullet - Essence and Accident in Software Engineering". Proceedings of the IFIP Tenth World Computing Conference: 1069–1076.
  • Brooks, Fred P. (April 1987). "No Silver Bullet - Essence and Accidents of Software Engineering". IEEE Computer. 20 (4): 10–19. {{cite journal}}: Unknown parameter |authormask= ignored (|author-mask= suggested) (help)
  • Brooks, Fred P. (1995). "Chap. 17". 'No Silver Bullet' Refired (Anniversary Edition with four new chapters ed.). Addison-Wesley. ISBN 0-201-83595-9. {{cite book}}: |work= ignored (help); Unknown parameter |authormask= ignored (|author-mask= suggested) (help)