Ah, that sounds reasonable. We'll no longer require callback_mutex if we
accept races when current attaches to another cpuset here. We'll need
rcu_read_lock() to safely dereference task_cs(current) unless it's
top_cpuset, but that's much better than callback_mutex and spinning on
task_lock(current).