From what I remember, the biggest cost is kernel transition and swapping registers, that you'll have to do in any case, and changing process context isn't big deal (or isn't any deal at all?). Kernel treats all threads more or less equally.
I'd like to hear from experts: so is it a BS or is there really different cost if context switch happens between threads of the same process vs between processes?