Raising IRQL on one core blocks threads on other cores?

Hoogin · September 23, 2020, 1:54pm

While messing around, I thought I would test what happens if I were to raise the IRQL to DISPATCH_LEVEL and run a really long running loop (multiple seconds long). What I would have expected to happen is that no other threads would be able to be scheduled on that one core. What happened instead is that it seemed that no threads were able to run on either core during this period. Why is this? I’m really new to drivers and kernel mode, so please forgive me if I’ve missed something really obvious.

Peter_Viscarola_OSR · September 23, 2020, 4:36pm

What I would have expected to happen is that no other threads would be able to be scheduled on that one core

I agree. If you raise to IRQL DISPATCH_LEVEL then, yes… there will be no preemption on that core.

I can’t explain any other behavior.

Peter

Tim_Roberts · September 23, 2020, 4:38pm

Why do you think no threads could run on the other core? If you were testing this from an application, remember that whatever thread you usurped cannot continue anywhere else. If it was an application’s main thread, then that application is dead until you return.

Hoogin · September 23, 2020, 5:19pm

I mean that not just the application was stuck during the loop, the entire system was seemingly unresponsive as well. I would expect that the other core would still be able to run threads even while the other core had its IRQL raised.

anton_bassov · September 23, 2020, 6:14pm

I can’t explain any other behavior.

AFAIK, Windows designers have eliminated a global dispatcher lock and introduced Linux-like per-CPU runqueues quite a while ago.
Furthermore, I heard that the whole thing got implemented in such way that a thread assigned to CPU A’s runquue would not get re-assigned to run on CPU B until the re-balancing event is scheduled, even if the former one is overloaded and the later one has absolutely nothing to do at the moment.

If this is the case, I can easily see a plausible explanation to the observed behavior. What may have happened here is that core 1’s runqueue was empty while the OP ran his DISPATCH_LEVEL loop on core 0. As a result, core 0 was spinning at elevated IRQL and core 1 had nothing to do at the moment, which created the illusion of the suspended system

Anton Bassov

Peter_Viscarola_OSR · September 23, 2020, 6:18pm

I would expect that the other core would still be able to run threads

That is indeed how it works. That leaves us to a natural question about your testing procedure. How did you implement this test, exactly?

the entire system was seemingly unresponsive as well

Please define “unresponsive” — Did the cursor blink? The mouse move? The display update?

Over all, please say more. I agree with your expectation… I’d like to know why you’re seeing what you’re seeing.

P

Hoogin · September 23, 2020, 10:11pm

How I did it was I had a program that would send data to my driver via WriteFile. The driver receives this, raises the IRQL to DISPATCH_LEVEL, does the loop, then lowers the IRQL back to its original level.

And by unresponsive, I mean that the entire screen appears to be frozen, and the cursor does not move. Moving the mouse during this period does not affect where it ends up after it unfreezes. I also tried running an HTTP server on the system. When it was frozen, the HTTP server was also unresponsive.

Peter_Viscarola_OSR · September 23, 2020, 10:22pm

That is very curious, as it sounds to ME like there are no interrupts being serviced either.

Ah! I have to ask: VM or physical machine?

Peter

Pavel_A1 · September 24, 2020, 12:05am

The driver receives this, raises the IRQL to DISPATCH_LEVEL, does the loop, then lowers the IRQL back to its original level.

What exactly it does in the loop? Remember that many useful APIs cannot be called at DISPATCH.
If you call such APIs, the system won’t necessarily catch you (crash) but it may break.

– pa

Hoogin · September 24, 2020, 1:21am

@“Peter_Viscarola_(OSR)” said:
That is very curious, as it sounds to ME like there are no interrupts being serviced either.

Ah! I have to ask: VM or physical machine?

Peter

this is on a physical machine. It’s got a dual core. Kinda weird I think if interrupts aren’t being serviced.

@Pavel_A said:

The driver receives this, raises the IRQL to DISPATCH_LEVEL, does the loop, then lowers the IRQL back to its original level.

What exactly it does in the loop? Remember that many useful APIs cannot be called at DISPATCH.
If you call such APIs, the system won’t necessarily catch you (crash) but it may break.

– pa

The loop looks like:

int i;
volatile int t;
for(i = 0; i < 1000000000; i++) t++;

Tim_Roberts · September 24, 2020, 3:01am

Dual core, or one core with two hyperthreads? I believe interrupts are handled differently.

Allow me to introduce you to KeStallExecutionProcessor.

Hoogin · September 24, 2020, 11:22am

I entirely neglected to consider if it had hyperthreads. Viewing it in CPU-Z, it reports 2 cores and 2 threads. Task Manager also shows 2 cores, so it seems like it doesn’t use hyperthreads.

And as for KeStallExecutionProcessor, that would have made testing this a lot more elegant.

anton_bassov · September 24, 2020, 12:50pm

it sounds to ME like there are no interrupts being serviced either.

Well, taking into account that Windows interrupt processing is mostly done in DPC’s, rather than ISRs, I am not really surprized
with this part. If interrupts are mostly routed to the core that spins at elevated IRQL, this delay in interrupt processing is perfectly understandable - an ISR queues a DPC to the target core’s DPC queue (a normal priority DPC is queued to the same CPU that has serviced an ISR,right), and this DPC cannot start running until IRQL drops below DISPATCH_LEVEL. Taking into consideration that most threads are IO-driven and wait for the inputs, the whole system gets eventually suspended, and freezes until the data from the hardware arrives

Anton Bassov

Peter_Viscarola_OSR · September 24, 2020, 1:01pm

Mr. Bassov’s observations are correct, but I’m not buying his conclusions. Unless the OP isn’t carefully describing his situation.

Most of us have seen cores get “lost” by a deadlock on a spin lock; I know I certainly have. That’s basically what the OP is describing. The system can seem OK for “a while”… then it eventually descends into a sort of weird, dead, state for the reasons Mr. Bassov lists. But I wouldn’t describe this as looking like “no threads were able to run on either core” or the system being “unresponsive.” This happens in pieces and gradually, yes. But only after some time.

Peter

Hoogin · September 24, 2020, 2:09pm

I don’t suppose that somewhere outside my code a spinlock is acquired, and a thread on the other core is waiting for it? I suppose that type of situation would cause the other core to stay spinning while it is waiting for the first core to release its lock. Please correct me if I’m misunderstanding something.

Mark_Roddy · September 24, 2020, 3:23pm

I don’t know, but if I had this problem I would use windbg to look at what
the fork was going on with all the other ‘cpus’.

Mark Roddy

anton_bassov · September 24, 2020, 5:06pm

Most of us have seen cores get “lost” by a deadlock on a spin lock;

Well, this is a slightly different story, albeit a similar once

In case of a spinlock- related deadlock you are,indeed, just bound to deadlock completely at some point. If a CPU tries
to acquire the target spinlock in this situation, it is just bound to deadlock. Once spinlocks normally get acquired on the code paths that happen to be generally useful, and, hence, get executed on more or less regular basis, all cores are eventually going to get out of play at some point, trying to acquire a lock that will never ever get released.

However, in our (HYPOTHETICAL,of course) , case, we lose that ability to execute DPCs that queued to the DPC queue of the spinning core.
The same DPC cannot get enqueued more than once, can it? Therefore, every time an interrupt gets routed to that target core, we are just bound to “lose” a DPC that ISR queues - they will get placed to the target core’s queue.

then it eventually descends into a sort of weird, dead, state for the reasons Mr. Bassov lists

Fair enough - in case of a spinlock-related deadlock you may start noticing some “funny” effects well before all CPUs are actually deadlocked, and it is going to happen because of “DPC starvation” that I have described above

Another point to consider is that, in order to get the observations that the OP gets (i.e. of a frozen GUI), all we have to do is to “lose” a SINGLE
DPC (i.e. the one that gets queued by the mouseclass driver) this way…

Anton Bassov

Peter_Viscarola_OSR · September 24, 2020, 7:11pm

What a long reply. Concision is a virtue, Mr. Bassov.

In case of a spinlock- related deadlock you are, indeed, just bound to deadlock completely at some point

Well… yes or no. It depends on the paths in which the spin-lock is acquired and what’s happening on the device.

In case of a spinlock-related deadlock you may start noticing some “funny” effects well before all CPUs are actually deadlocked, and it is going to happen because of “DPC starvation” that I have described above

Which, you know, is what I said…

the observations that the OP gets (i.e. of a frozen GUI), all we have to do is to “lose” a SINGLE DPC

Well… yes? More accurately, we need the USB HCD to queue a DPC on the core that is no longer servicing the DPC list (because it’s “otherwise engaged” at IRQL DISPATCH_LEVEL). It’d be bad luck for this to happen right away. And even in this case, you’d still see other UI elements working. The keyboard won’t immediately die.

In summary, as interrupts eventually get routed to the “busy” core, the DPCs that service the “bottom half” of those interrupts will be blocked, and no further DPC will be queued for those device instances for which a DPC is pending, as Mr. Bassov correctly notes.

So, again… the system “eventually descends into a sort of weird, dead, state” – “eventually” here might be a few seconds. But I can’t see this being immediate.

Mr. Roddy’s comment is really the most insightful: Just look at what’s happening with WinDbg, and we can all stop guessing?

Peter

Hoogin · September 24, 2020, 11:33pm

Wow, thank you all for the very insightful replies. As it stands, I’ve been testing this on my local machine, which may not be the best idea. If I can get it working in a Virtual Machine then I think I can take a look with WinDbg what’s going on. Very new to this all, so I apologize for my ignorances.

Peter_Viscarola_OSR · September 25, 2020, 12:10am

Hmmmmm… I’m not sure I’d trust results on a VM.

Get a (physical machine) test system hooked up.

ETA: More cores would be better, as well. Get a test system with 4 or 8 cores and you should definitely see the slower degradation we’d expect. On a 2 core machine, you have a 50% chance of interrupts happening on the “locked” core. Also, get something with GUI elements running (they’ll run until, you know, the graphics card gets hung).

Peter