nv50/ir: set number of threads/block for variable local size