That makes complete sense - if you've got something 'needy', as soon as it's queuing up, I imagine it snowballs, too...
10-20 times the core count is crazy, but I guess it's had a lot of development effort into parallelizing it's execution, which of course goes against what your use case is :)