radeonsi/gfx10: enable GS fast launch for triangles and strips with NGG culling