in this example we unrolled the loop 4 times and we measured the execution time of the unrolled loop, and the useful flops per execution time. then, we scheduled the 4-way unrolled loop and we took the same measurements