Binary Trees............... 936518/s (43Mb) (3x faster than 7.8!)
For reference, simply replacing the blockalloc with malloc/free (something I did to try it out yesterday) gives this:
Binary Trees............... 818800/s (122Mb)
So the new block-alloc is at least fater (and uses significantly less memory than) malloc. :)