Performance in the Fibonacci benchmark is almost entirely determined by factors other than function call overhead

In the [readme](https://github.com/bddicken/languages/tree/main/fibonacci#readme) for the Fibonacci benchmark, we see the following requirements for different implementations of the benchmark:
```
ALL IMPLEMENTAITONS MUST...
Have a function that recursively compute a fibonacci number with this naive algorithm
Base case for input 0
Base case for input 1
Must make two recursive calls for each non-base invocation
No result caching, conversion to tail recursion, or iterative solutions.
```

This is because the benchmark `Emphasizes function call overhead, stack pushing / popping, and recursion.`

This is misleading. The recursive calls in the reference implementation *are* tail calls, and `gcc` at optimization levels higher than `-O1` *does* optimize this by eliminating one of function calls. Thus the compiler produces assembly that is roughly equivalent to the following `c`  code:
```c
int32_t fibonacci(int32_t n) {
  int32_t result = 0;
  while (n > 1) {
    result += fibonacci(n - 1);
    n -= 2;
  }
  if (n == 1) result += 1;
  return result;
}
```

The Fortran compiler `gfortran` goes even further. It "unrolls" the recursive calls, and produces assembly that is roughly equivalent to the following `c` code:
```c
int32_t fibonacci(int32_t n) {
  if (n == 0) return 0;
  if (n == 1) return 1;
  if (n == 2) return 1;
  if (n == 3) return 2;
  int32_t a = fibonacci(n - 3);
  int32_t b = fibonacci(n - 4);
  if (n == 4) return a + 2;
  a += b;
  int32_t c = fibonacci(n - 5);
  b += c;
  a += b;
  if (n == 5) return b + 1;
  return a + b + c + fibonacci(n - 6);
}
```

(See the output on [Compiler Explorer](https://godbolt.org/#z:OYLghAFBqd5TKALEBjA9gEwKYFFMCWALugE4A0BIEAZgQDbYB2AhgLbYgDkAjF%2BTXRMiAZVQtGIHgFYBQogFUAztgAKAD24AGfgCsp5eiyagA%2BudTkVjVEQJDqzTAGF09AK5smIAEzknADIETNgAcp4ARtikUgAc5AAO6ErE9kyuHl6%2BicmpQkEh4WxRMTzx1ti2aSJELKREGZ7efhVVQjV1RAVhkdFxVrX1jVktg13BPcV9ZQCUVujupKicXACkPgDMwageOADU6z5KRIToAHRIh6taAILrW0w77vuHx5j0BBEXV7f327vYA4%2BI4nYJEb7A653W5gjY%2BUxEPZ0CJCFioVAECCw%2BGIpgzA4AdgAQlC9nsCDQ9hAmAcNgARVb0vZafGkbBERY0rSMkm3MkUqk0xkMpk8Vnszl7Hg80l7Nkc0g05Go9GYpgAWjFQKJSM%2BKox1PVPhmMt%2BBIZvxhwj2bBYwSpYL2dWAlj2qCQdQAVJ6naRgAA3fGrYmy7EIvbuWl0p0kTHO/2raRE6XSOkmja8m784Rw8OkKPM01ZpFkB05nHkgvSjOVxnOCM88nrEk%2BIlBkN8sly7XC3Uo1iqrHpzNk4MW4sJUhgmgQK4%2BaSYRPOJiHchy4dQsdcOb0bjSfjeLg6cjobj1pQLJaA%2B58chEbTbuYAaxAABZpGd31/pN/v4ZuK%2B/BsCA0haOQh7HqeXD8EoIBgfeR7buQcCwCgGA4PgxBkJQ1B0IwrAcNwt6CMIYgSJwPCvnIwjKGomiIeQ%2Bh%2BEYJggII9SkMYVjYDYdgOBATjDN4PAEv4TCYN0RQlCAcI5CkfHpG4TRSKJSTyWkkm9DEsmtApHRDEpWQidxvHVGMmlTNpoydEJKkDJ0FnSXCcwXosyzcJO6DAJxbA2naK6dvybAJB8GK4kI2CytmRDYMA0QQE%2BwSLvSr74iAIARmuFBNoFboepxtjxfhwrzgAbGlGXOpuuXiPQ9B7HFRCmBgbC2uJpjOp4zBEBAPBrs6Jq5WyLCYBAzprp6%2BLuNVxZkvmvbcpas17Jg6C1qKa6RuqUpRV2c0FvNrZ9vqmIEINy1OCt6AzXtk6Op6WUzRgwj%2BUoN1dmyqCLCk/qAjQ7iPApx0DgaeJytgSjuPQPU0Ode17WCsXxYl4nCqla6I8IWJg%2BlewBctCPCEjpAJUlaMVUi73w%2BSlLUlGC34kQSDMLt1NkpSC2s12PEqAKdPCr2WpMyzuVs0iVZc2SPORaLbMc0yyog2qezbVqzbA2ioMq3sxqS3sl0Urtl3/YDaQa6qM2XZ53nsH5wRIbuXD7uBD4ntwABiZBEJxNKuVeQJwvwCE6DMczMyNfQQDuAFASBYEQfwUEwXBd4PqH5Avu%2Bn4/jnv45/%2BXAbAertJ6niFzChCDwBAaHoMFDDRDhEAtQkDcxKgwCUX4eExaQsEQBErsRMEdQAJ5Efww%2BsKQo8APIRLolQIbeLUcMIs9MPQ48MTgETuMAzgSPQsG8PwOBtcAkg7wQn12L9J/Htg6iVO4MWu4jjvHh8EScTPrg4K7b2BBgKn3IL9UgKIVB0mwBfD4rFy4CCMMAJQAA1Ag2AADus8EjMAntRUQtUKJURIooFQGhXb6D6ixMwFhDCfFgpAOY6AEgKRPuqWeRcTzgKnDgBhUcTJLzSI4cStk%2BqBAmFJaYck8iKUyMJaRClHJSN0mZGyhl5EqPaOZCRWkDDHDUXIvR2jCi6LFPMNyFFo5O2LgxKCex1CxFKuqUqr4GroilK%2BM4PgqSYRIIdDYYog5pzDtgCOMR%2BEvh/GcXOMTfwF0AuQYCoEXa2O4MneCadkKIBrmgOurdGAUCoM3PJbcQDAHYt7YwXcBAMF7v3QeDEp5jzwU0me89F62DwavbqG8t6u13vvQ%2BdUT63nPsYS%2BKxjyEFvgQe%2Brsn4vzfqAj%2Brtv6/1Hv/SZQcpwgNvOAyB2BoGwOCKABBNAkGoPQVgnBh5iLyDIpISi%2BDaLkIYkxQw4y2Jex9nQiIfCmEsLSGwjhiduEEF4fAFyPFBH8UEuogw4iTGWQMGpGRoiFEaR0civqmimD6QaPCnF0K2h4uMZMaSOKxjov0fUJRpQXKXncsaAuzsE5uy4PYxxzjXHlO%2BcYDxXifGED8QHY0QTy4hLCdQKxCSknxxLmkqwKdg6PgziBTxsTNVUUdpwtlpcVXp0dj4GxkFFUGrmOAlIDhXxAA%3D%3D ))

If tail calls are not optimized, then the reference implementation requires `331160280` function calls to calculate `fibonaaci(40)`, while the `fortran` version only requires `313883` function calls. This is calculated as follows:

* If `T_n` is the number of function calls that a naive recursive solution requires to calculate the nth Fibonacci number, then we have that `T_0 = T_1 = 0`, and `T_{n + 2} = 2 + T_{n + 1} + T_n`.
* If `U_n` is the number of function calls that the `fortran` optimization requires to calculate the nth Fibonacci number, then we have that `U_0 = U_1 = U_2 = U_3 = 0`, `U_4 = 2`, `U_5 = 3`, and `U_{n + 6} = 4 + U_n + U_{n + 1} + U_{n + 2} + U_{n + 3}`.

Thus, contrary to its stated goal, using this benchmark to compare `c` and `fortran` does *not* give us an accurate comparison of the overhead that each language introduces when making recursive function calls. Instead the difference in performance is almost *entirely* a result of the compilers' ability to *remove* the function calls altogether.

At the very least, the readme should clarify that while the source code for each implementation is not allowed to eliminate one of the recursive calls, and must make exactly two recursive calls on each iteration, and that these recursive calls must calculate the `(n - 1)th` and `(n - 2)th` Fibonacci numbers, the compiler may still make whatever optimizations it is capable of.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance in the Fibonacci benchmark is almost entirely determined by factors other than function call overhead #358

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Performance in the Fibonacci benchmark is almost entirely determined by factors other than function call overhead #358

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions