Skip to content

Commit 801afdf

Browse files
Frederic Weisbeckeringomolnar
authored andcommitted
genirq: Fix interrupt threads affinity vs. cpuset isolated partitions
When a cpuset isolated partition is created / updated or destroyed, the interrupt threads are affined blindly to all the non-isolated CPUs. This happens without taking into account the interrupt threads initial affinity that becomes ignored. For example in a system with 8 CPUs, if an interrupt and its kthread are initially affine to CPU 5, creating an isolated partition with only CPU 2 inside will eventually end up affining the interrupt kthread to all CPUs but CPU 2 (that is CPUs 0,1,3-7), losing the kthread preference for CPU 5. Besides the blind re-affining, this doesn't take care of the actual low level interrupt which isn't migrated. As of today the only way to isolate non managed interrupts, along with their kthreads, is to overwrite their affinity separately, for example through /proc/irq/ To avoid doing that manually, future development should focus on updating the interrupt's affinity whenever cpuset isolated partitions are updated. In the meantime, cpuset shouldn't fiddle with interrupt threads directly. To prevent from that, set the PF_NO_SETAFFINITY flag to them. This is done through kthread_bind_mask() by affining them initially to all possible CPUs as at that point the interrupt is not started up which means the affinity of the hard interrupt is not known. The thread will adjust that once it reaches the handler, which is guaranteed to happen after the initial affinity of the hard interrupt is established. Suggested-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Frederic Weisbecker <frederic@kernel.org> Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Ingo Molnar <mingo@kernel.org> Link: https://patch.msgid.link/20251121143500.42111-3-frederic@kernel.org
1 parent 68775ca commit 801afdf

1 file changed

Lines changed: 15 additions & 8 deletions

File tree

kernel/irq/manage.c

Lines changed: 15 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1408,16 +1408,23 @@ setup_irq_thread(struct irqaction *new, unsigned int irq, bool secondary)
14081408
* references an already freed task_struct.
14091409
*/
14101410
new->thread = get_task_struct(t);
1411+
14111412
/*
1412-
* Tell the thread to set its affinity. This is
1413-
* important for shared interrupt handlers as we do
1414-
* not invoke setup_affinity() for the secondary
1415-
* handlers as everything is already set up. Even for
1416-
* interrupts marked with IRQF_NO_BALANCE this is
1417-
* correct as we want the thread to move to the cpu(s)
1418-
* on which the requesting code placed the interrupt.
1413+
* The affinity can not be established yet, but it will be once the
1414+
* interrupt is enabled. Delay and defer the actual setting to the
1415+
* thread itself once it is ready to run. In the meantime, prevent
1416+
* it from ever being re-affined directly by cpuset or
1417+
* housekeeping. The proper way to do it is to re-affine the whole
1418+
* vector.
14191419
*/
1420-
set_bit(IRQTF_AFFINITY, &new->thread_flags);
1420+
kthread_bind_mask(t, cpu_possible_mask);
1421+
1422+
/*
1423+
* Ensure the thread adjusts the affinity once it reaches the
1424+
* thread function.
1425+
*/
1426+
new->thread_flags = BIT(IRQTF_AFFINITY);
1427+
14211428
return 0;
14221429
}
14231430

0 commit comments

Comments
 (0)