1 draw1: 6M for draw 0,0,100,100 no repl
2 draw3: 4M for draw 0,0,100,100 no repl
3 just read src, dst - 250k
6 alpha calculation - 3000k
8 olddraw: 10M for draw 0, 0, 1000, 1000 no repl all ldepth 3
9 44M for draw 0, 0, 1000, 1000 src, mask ldepth 2 dst ldepth 3
10 draw4: 160M for draw 0, 0, 1000, 1000 no repl all r8g8b8
12 src, dst reading: 13-15M each
14 alpha calculation loop: 90M
16 minimal loop control +20M
17 alpha calculation with divides +190M
18 alpha calculation wtih shifts +70M