Interrupt Routing -- was: Question abuot masking interrupt

Jake_Oshins · July 30, 2009, 1:32am

“Joseph M. Newcomer” wrote in message
news:xxxxx@ntdev…
> Note that on a multiprocessor, in general if an interrupt is blocked on
> CPUn because CPUn is running at a certain DIRQL level, the interrupt
> is rerouted by the hardware to interrupt CPUm for m != n, if it is
> interruptible. So it is possible to have as many interrupts running as
> CPUs
> in a fully-symmetric multiprocessor system (the world changes in Vista,
> which supports asymmetric device connections that might interrupt only a
> subset of the CPUs, as specified by the “affinity mask” that specifies the
> CPUs that are allowed to/able to handle the interrupt).

Joe, this hasn’t really been true since the PentiumIII, which had an APIC
bus. The only Pentium4 or later chipset that actually would route
interrupts to a low-priority processor came from ServerWorks, and they’re
gone now (or indistinguishably subsumed by Broadcom.)

Intel’s chipsets just route interrupts to the lowest numbered processor
which is in the target set, and many others followed their example.

So, in practice, interrupts that are targeted at a set of processors will
not be handled simultaneously with interrupts that are targeted at the same
set.

And, as you note, with MSI-X it’s possible to target specific processors
with associated messages. This technique has been been so useful that
effectively all high-end networking and storage hardware already uses it.

–
Jake Oshins
(former interrupt guy, author of the interrupt routing code in Windows)
Windows Kernel Group

This post implies no warranties and confers no rights.

--------------------------------------------------------------

From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of Skywing
Sent: Friday, July 24, 2009 12:06 PM
To: Windows System Software Devs Interest List
Subject: RE: [ntdev] Question abuot masking interrupt

That is the intention of the word “masked” in this particular instance.

- S

From: sivakumar thulasimani
Sent: Friday, July 24, 2009 01:55
To: Windows System Software Devs Interest List
Subject: Re: [ntdev] Question abuot masking interrupt
Being in any (X) IRQL does not mask any interrupts that are handled at a
lower (Y) IRQL. All the system does is it makes sure that your code which is
running at X IRQL will continue to run till it finished its function or till
it receives another interrupt which is handled by at even higher IRQL. The
interrupts for any lower (Y) IRQL are still “regisiterd” ( dont know the
exact technical word here, so am using my own) and will be handled when the
IRQL is reduced to the appropriate level. Hope that clears your doubt

rtshiva
2009/7/24
I found this on wiki, seems answer my previous question:
However, it is fairly easy for an edge triggered interrupt to be missed -
for example if interrupts have to be masked for a period - and unless there
is some type of hardware latch that records the event it is impossible to
recover. Such problems caused many “lockups” in early computer hardware
because the processor did not know it was expected to do something. More
modern hardware often has one or more interrupt status registers that latch
the interrupt requests; well written edge-driven interrupt software often
checks such registers to ensure events are not missed.

-----Original Message-----
From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of
xxxxx@viatech.com.cn
Sent: Friday, July 24, 2009 4:05 PM
To: Windows System Software Devs Interest List
Subject: RE: [ntdev] Question abuot masking interrupt

Another question:
If we mask the interrupt at and less than the interrupt currently servicing,
Is there any possibility that EDGE triggered lower prioprity interrupt
LOST? (seems level trigged interrupt will not be lost).

Thanks.
HW

-----Original Message-----
From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of
xxxxx@viatech.com.cn
Sent: Friday, July 24, 2009 3:52 PM
To: Windows System Software Devs Interest List
Subject: [ntdev] Question abuot masking interrupt

Hello everyone.
I am a newbie in windows kernel and is now reading “windows internals” by
Mark E. Russinovich, David A. Solomon.
I have a question about masking interrupt.
In chapter 3:
The books saids:
/*Quote begin
interrupts from a source with an IRQL above the current level interrupt the
processor, whereas interrupts from sources with IRQLs equal to or below the
current level are masked until an executing thread lowers the IRQL.
Quote end */

Here is my question:
Why bother masking the interrupts whose priority is lower than current
interrupt level?
The lower priority interrupt can’t preempt high priority interrupt per se.
Does masking make any difference?

Can anyone help me on this question? Thanks!

Best regards,
HW

Email secured by Check Point at OSR.COM

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

Email secured by Check Point at OSR.COM

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

Email secured by Check Point at OSR.COM

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

Email secured by Check Point at OSR.COM

= — NTDEV is sponsored by OSR For our schedule of WDF, WDM, debugging and
other seminars visit: http://www.osr.com/seminars To unsubscribe, visit the
List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

–
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

Jake_Oshins · July 30, 2009, 1:46am

Max, interrupt affinity was in NT long before Vista. But the contract was
fundamentally broken.

Old contract:

You don’t get any influence over the bits that are set in your affinity mask
as it is handed to you by the HAL (or, later, the PnP manager from the HAL.)
You do, however, get to remove any bits which were set, as long as you leave
at least one bit set when you call IoConnectInterrupt.

New contract (Vista and later):

You do get to state a policy preference before the PnP manager (and not the
HAL) chooses your affinity mask. You must call IoConnectInterrupt[Ex] with
the values that you were handed. Your policy preference will be honored
unless you are physically sharing an interrupt line with a device that is
already connected and running and which expressed a different policy
preference (or none at all.)

I thought long and hard about whether I could change that contract, and
usually the answer would have been no. But, since the end result of the old
contract was a machine deadlock if any driver which was sharing interrupts
actually did remove bits from its affinity mask, I felt justified in
changing it. (If you don’t accept my very brief explanation here, please
search the archives. I’ve covered this topic more than a few times.)

–
Jake Oshins
Hyper-V I/O Architect
Windows Kernel Group

This post implies no warranties and confers no rights.

“Maxim S. Shatskih” wrote in message
news:xxxxx@ntdev…
>>multiprocessor system (the world changes in Vista, which supports
>>asymmetric device connections
>>that might interrupt only a subset of the CPUs, as specified by the
>>“affinity mask” that specifies the
>>CPUs that are allowed to/able to handle the interrupt).
>
> I think interrupt affinity was always in NT long before Vista.
>
> –
> Maxim S. Shatskih
> Windows DDK MVP
> xxxxx@storagecraft.com
> http://www.storagecraft.com

Mark_Roddy · July 30, 2009, 8:02am

Indeed you are correct, I grossly over-simplified the relationship
between IRQ and priority.

I am dubious about the merits of a general purpose OS that provides a
configurable interrupt priority scheme. My devices would always want
the highest priority available. Conflicts would continue to exist and
programmers would be deluded into thinking that they could configure
their way around latency problems by adjusting interrupt priority.

As MP has become pervasive to the point where 4-way systems are
commodity, a platform guarantee that interrupts are always spread
across processors seems to be a better way of dealing with isr/dpc
latency.

Mark Roddy

On Wed, Jul 29, 2009 at 5:29 PM, wrote:
>> Only that just hasn’t been true for a long time for device interrupt levels, which are just
>> more or less arbitrarily assigned based on pci wiring on the board and not on any importance
>> of a device that happens to be plugged into one slot or another.
>
>
> ?It depends on interrupt controller on the target motherboard. If it is “good old PIC”, then interrupt priority is ,indeed, implied by IRQ number, so that your statement about priority depending “on pci wiring on the board” applies. However, IOAPIC breaks this dependency completely - it allows you to map IRQ to any vector above 32. Once priority is associated with vector, rather than IRQ, on APIC-based system, you can assign any priority to ?a given IRQ.
>
> In any case, Joe’s statement about device priority is completely false - if you want to assign a certain priority to all devices of a given class, you will have to implement a custom HAL that I mentioned in my previous post (and deal with the dilemma of handling the situation when devices of ?different classes happen to signal interrupts via the same pin)…
>
> Anton Bassov
>
>
> —
> NTDEV is sponsored by OSR
>
> For our schedule of WDF, WDM, debugging and other seminars visit:
> http://www.osr.com/seminars
>
> To unsubscribe, visit the List Server section of OSR Online at http://www.osronline.com/page.cfm?name=ListServer
>

OSR_Community_User · July 30, 2009, 10:29am

Thanks for the clarification. Sad that in the default case all interrupts
get funneled through a single processor. Surprising that we haven’t figured
out how to do this right after 40 years of multiprocessor development.

When you refer to “target set” are you referring to the hardware
architecture or can this be controlled by the use of the affinity mask?
joe

-----Original Message-----
From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of Jake Oshins
Sent: Thursday, July 30, 2009 1:32 AM
To: Windows System Software Devs Interest List
Subject: [ntdev] Interrupt Routing – was: Question abuot masking interrupt

“Joseph M. Newcomer” wrote in message
news:xxxxx@ntdev…
> Note that on a multiprocessor, in general if an interrupt is blocked on
> CPUn because CPUn is running at a certain DIRQL level, the interrupt
> is rerouted by the hardware to interrupt CPUm for m != n, if it is
> interruptible. So it is possible to have as many interrupts running as
> CPUs
> in a fully-symmetric multiprocessor system (the world changes in Vista,
> which supports asymmetric device connections that might interrupt only a
> subset of the CPUs, as specified by the “affinity mask” that specifies the
> CPUs that are allowed to/able to handle the interrupt).

Joe, this hasn’t really been true since the PentiumIII, which had an APIC
bus. The only Pentium4 or later chipset that actually would route
interrupts to a low-priority processor came from ServerWorks, and they’re
gone now (or indistinguishably subsumed by Broadcom.)

Intel’s chipsets just route interrupts to the lowest numbered processor
which is in the target set, and many others followed their example.

So, in practice, interrupts that are targeted at a set of processors will
not be handled simultaneously with interrupts that are targeted at the same
set.

And, as you note, with MSI-X it’s possible to target specific processors
with associated messages. This technique has been been so useful that
effectively all high-end networking and storage hardware already uses it.

–
Jake Oshins
(former interrupt guy, author of the interrupt routing code in Windows)
Windows Kernel Group

This post implies no warranties and confers no rights.

--------------------------------------------------------------

From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of Skywing
Sent: Friday, July 24, 2009 12:06 PM
To: Windows System Software Devs Interest List
Subject: RE: [ntdev] Question abuot masking interrupt

That is the intention of the word “masked” in this particular instance.

- S

From: sivakumar thulasimani
Sent: Friday, July 24, 2009 01:55
To: Windows System Software Devs Interest List
Subject: Re: [ntdev] Question abuot masking interrupt
Being in any (X) IRQL does not mask any interrupts that are handled at a
lower (Y) IRQL. All the system does is it makes sure that your code which is

running at X IRQL will continue to run till it finished its function or till

it receives another interrupt which is handled by at even higher IRQL. The
interrupts for any lower (Y) IRQL are still “regisiterd” ( dont know the
exact technical word here, so am using my own) and will be handled when the
IRQL is reduced to the appropriate level. Hope that clears your doubt

rtshiva
2009/7/24
I found this on wiki, seems answer my previous question:
However, it is fairly easy for an edge triggered interrupt to be missed -
for example if interrupts have to be masked for a period - and unless there
is some type of hardware latch that records the event it is impossible to
recover. Such problems caused many “lockups” in early computer hardware
because the processor did not know it was expected to do something. More
modern hardware often has one or more interrupt status registers that latch
the interrupt requests; well written edge-driven interrupt software often
checks such registers to ensure events are not missed.

-----Original Message-----
From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of
xxxxx@viatech.com.cn
Sent: Friday, July 24, 2009 4:05 PM
To: Windows System Software Devs Interest List
Subject: RE: [ntdev] Question abuot masking interrupt

Another question:
If we mask the interrupt at and less than the interrupt currently servicing,

Is there any possibility that EDGE triggered lower prioprity interrupt
LOST? (seems level trigged interrupt will not be lost).

Thanks.
HW

-----Original Message-----
From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of
xxxxx@viatech.com.cn
Sent: Friday, July 24, 2009 3:52 PM
To: Windows System Software Devs Interest List
Subject: [ntdev] Question abuot masking interrupt

Hello everyone.
I am a newbie in windows kernel and is now reading “windows internals” by
Mark E. Russinovich, David A. Solomon.
I have a question about masking interrupt.
In chapter 3:
The books saids:
/*Quote begin
interrupts from a source with an IRQL above the current level interrupt the
processor, whereas interrupts from sources with IRQLs equal to or below the
current level are masked until an executing thread lowers the IRQL.
Quote end */

Here is my question:
Why bother masking the interrupts whose priority is lower than current
interrupt level?
The lower priority interrupt can’t preempt high priority interrupt per se.
Does masking make any difference?

Can anyone help me on this question? Thanks!

Best regards,
HW

Email secured by Check Point at OSR.COM

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

Email secured by Check Point at OSR.COM

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

Email secured by Check Point at OSR.COM

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

Email secured by Check Point at OSR.COM

= — NTDEV is sponsored by OSR For our schedule of WDF, WDM, debugging and
other seminars visit: http://www.osr.com/seminars To unsubscribe, visit the
List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

–
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

–
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

anton_bassov · July 30, 2009, 10:34am

> I am dubious about the merits of a general purpose OS that provides a configurable

interrupt priority scheme. My devices would always want the highest priority available.

…which, in turn, raises questions about the reasoning behind prioritizing hardware interrupts to one another, in the first place. To begin with, the same device may interrupt for different reasons (for example, send completion and data arrival on NIC), and sometimes these reasons may, from the logical point of view, imply different priorities of interrupts handling. I think it would be much more reasonable to treat all hardware interrupts (apart from timer, of course) as equals while allowing a wide range of priorities of software interrupts that ISRs defer the work to, and enforcing the requirement for ISRs to do as little work as possible (i.e. check the reason for interrupt, program device registers to stop in from interrupting and request software interrupt that does the further processing)…

Anton Bassov

OSR_Community_User · July 30, 2009, 10:37am

Interrupt affinity was there from the beginning; there was even an
asymmetric 386 processor (from Compaq) that I understand required this. But
it was essentially meaningless until Vista. The affinity mask that was
returned was always the system processor mask.
joe

-----Original Message-----
From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of Maxim S. Shatskih
Sent: Wednesday, July 29, 2009 5:37 PM
To: Windows System Software Devs Interest List
Subject: Re:[ntdev] Question abuot masking interrupt

multiprocessor system (the world changes in Vista, which supports
asymmetric device connections
that might interrupt only a subset of the CPUs, as specified by the
“affinity mask” that specifies the
CPUs that are allowed to/able to handle the interrupt).

I think interrupt affinity was always in NT long before Vista.

–
Maxim S. Shatskih
Windows DDK MVP
xxxxx@storagecraft.com
http://www.storagecraft.com

NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

–
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

OSR_Community_User · July 30, 2009, 10:41am

I’d hardly call it “completely false” if the only solution is to write a
custom HAL!

Is it true or false in an out-of-the-box Windows installation that setting
device priority is entirely fixed by the time the device driver writer sees
it?
joe

-----Original Message-----
From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of
xxxxx@hotmail.com
Sent: Wednesday, July 29, 2009 5:30 PM
To: Windows System Software Devs Interest List
Subject: RE:[ntdev] Question abuot masking interrupt

Only that just hasn’t been true for a long time for device interrupt
levels, which are just
more or less arbitrarily assigned based on pci wiring on the board and not
on any importance
of a device that happens to be plugged into one slot or another.

It depends on interrupt controller on the target motherboard. If it is
“good old PIC”, then interrupt priority is ,indeed, implied by IRQ number,
so that your statement about priority depending “on pci wiring on the board”
applies. However, IOAPIC breaks this dependency completely - it allows you
to map IRQ to any vector above 32. Once priority is associated with vector,
rather than IRQ, on APIC-based system, you can assign any priority to a
given IRQ.

In any case, Joe’s statement about device priority is completely false - if
you want to assign a certain priority to all devices of a given class, you
will have to implement a custom HAL that I mentioned in my previous post
(and deal with the dilemma of handling the situation when devices of
different classes happen to signal interrupts via the same pin)…

Anton Bassov

NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=ListServer

–
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

Don_Burn_1 · July 30, 2009, 10:43am

Especially since the HAL kit is no longer available so there are no custom
HAL’s for current OS’es.

–
Don Burn (MVP, Windows DKD)
Windows Filesystem and Driver Consulting
Website: http://www.windrvr.com
Blog: http://msmvps.com/blogs/WinDrvr

“Joseph M. Newcomer” wrote in message
news:xxxxx@ntdev…
> I’d hardly call it “completely false” if the only solution is to write a
> custom HAL!
>
> Is it true or false in an out-of-the-box Windows installation that setting
> device priority is entirely fixed by the time the device driver writer
> sees
> it?
> joe
>
> -----Original Message-----
> From: xxxxx@lists.osr.com
> [mailto:xxxxx@lists.osr.com] On Behalf Of
> xxxxx@hotmail.com
> Sent: Wednesday, July 29, 2009 5:30 PM
> To: Windows System Software Devs Interest List
> Subject: RE:[ntdev] Question abuot masking interrupt
>
>> Only that just hasn’t been true for a long time for device interrupt
> levels, which are just
>> more or less arbitrarily assigned based on pci wiring on the board and
>> not
> on any importance
>> of a device that happens to be plugged into one slot or another.
>
>
> It depends on interrupt controller on the target motherboard. If it is
> “good old PIC”, then interrupt priority is ,indeed, implied by IRQ number,
> so that your statement about priority depending “on pci wiring on the
> board”
> applies. However, IOAPIC breaks this dependency completely - it allows you
> to map IRQ to any vector above 32. Once priority is associated with
> vector,
> rather than IRQ, on APIC-based system, you can assign any priority to a
> given IRQ.
>
> In any case, Joe’s statement about device priority is completely false -
> if
> you want to assign a certain priority to all devices of a given class, you
> will have to implement a custom HAL that I mentioned in my previous post
> (and deal with the dilemma of handling the situation when devices of
> different classes happen to signal interrupts via the same pin)…
>
> Anton Bassov
>
>
> —
> NTDEV is sponsored by OSR
>
> For our schedule of WDF, WDM, debugging and other seminars visit:
> http://www.osr.com/seminars
>
> To unsubscribe, visit the List Server section of OSR Online at
> http://www.osronline.com/page.cfm?name=ListServer
>
> –
> This message has been scanned for viruses and
> dangerous content by MailScanner, and is
> believed to be clean.
>
>
>
> Information from ESET NOD32 Antivirus, version of virus
> signature database 4291 (20090730)
>
> The message was checked by ESET NOD32 Antivirus.
>
> http://www.eset.com
>
>
>

Information from ESET NOD32 Antivirus, version of virus signature database 4291 (20090730)

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com

OSR_Community_User · July 30, 2009, 11:04am

If it were based on the PCI wiring, then presumably changing the slot =
into
which the card is plugged should change the interrupt priority;
experimentation on one motherboard indicated that this did not occur. =
The
board always appeared to have the same interrupt level no matter what =
slot
it was plugged into. The PCI BIOS and/or Windows always seemed to =
assign
the same priority value. Alas, it was a realtime board with tight
constraints on interrupt latency, and they never could get it to work =
right.
My own observation was that since (as they had told me) the engineer =
based
the design on his ability to reprogram the APIC on a bare (well, MS-DOS)
x86, he had erroneously assumed that this was going to be universally
possible on all machine architectures, for all operating systems. The
problem was solved by a board redesign with an onboard FIFO, but I =
didn’t
hear about this for a couple more years. That’s why I was curious as =
to
whether there really was any method available to reassign interrupt
priorities. =20

I have found far too many hardware designers assume that what they can =
do on
a bare or MS-DOS x86 is used as their guideline for doing minimum-cost
design, letting “the software” (whatever THAT means!) solve the rest of =
the
problems. The design I just described is only one of several that I =
have
seen over the last decade or so that suffered from this particular =
disease
of hardware designers. Ed Dekker has even more frightening stories =
about
hardware designers. I’m sure many of the other participants in this =
group
do, also.

This is not a new phenomenon; I saw the same problems on the PDP-11. I
remember having to open “priority windows” in the middle of one device
driver so the (lower-priority, real-time) tape controller could detect =
the
BOT mark and stop the reels before the tape spun off. Of course, this =
meant
I could get recursive interrupts. ARGH! (Funny, hardware designers in =
2009
seem to make the same design errors that were made in 1975…doesn’t =
anyone
LEARN? Doesn’t anyone TEACH principles of Bad Hardware Design?)
joe

-----Original Message-----
From: xxxxx@lists.osr.com
[mailto:xxxxx@lists.osr.com] On Behalf Of Mark Roddy
Sent: Wednesday, July 29, 2009 5:00 PM
To: Windows System Software Devs Interest List
Subject: Re: [ntdev] Question abuot masking interrupt

Only that just hasn’t been true for a long time for device interrupt
levels, which are just more or less arbitrarily assigned based on pci
wiring on the board and not on any importance of a device that happens
to be plugged into one slot or another.

Mark Roddy

On Wed, Jul 29, 2009 at 12:01 PM, Joseph M.
Newcomer wrote:
> Generally, the idea is that the reason the interrupt is at a =
=93higher
> priority=94 is because it is more important than the lower-priority
> interrupts.=A0 The consequence of allowing lower-priority interrupts =
is that
> an unimportant device that is interrupting frequently can consume all =
the
> CPU cycles, thus delaying the response to the more important device.
>
>
>
> This is a variant of what is called the =93priority inversion=94 =
problem,
where
> a low-priority thread sets a lock, then is preempted by higher =
priority
> threads, but then a high-priority thread tries to acquire the lock, =
and
ends
> up being blocked for an indeterminate and perhaps indefinitely long =
time
by
> the lower-priority thread. =A0The result of this is that the =
high-priority
> thread effectively gets cycles only when the lower-priority thread =
runs,
> thus effectively making it a low-priority thread (hence the term =
=93priority
> inversion=94).
>
>
>
> I have encountered several situations in which the priority assigned =
by
the
> system BIOS was inappropriate for the device. =A0Changing priorities =
is a
> difficult, or perhaps impossible, task.
>
>
>
> Note that on a multiprocessor, in general if an interrupt is blocked =
on
CPUn
> because CPUn is running at a certain DIRQL level, the interrupt is
rerouted
> by the hardware to interrupt CPUm for m !=3D n, if it is =
interruptible. =A0So
it
> is possible to have as many interrupts running as CPUs in a
fully-symmetric
> multiprocessor system (the world changes in Vista, which supports
asymmetric
> device connections that might interrupt only a subset of the CPUs, as
> specified by the =93affinity mask=94 that specifies the CPUs that are =
allowed
> to/able to handle the interrupt).
>
>
>
> For PCI, interrupts are level-triggered, so as long as the device =
holds
the
> interrupt line low, the interrupt is held pending.=A0 There is some
analogous
> mechanism for handling message-based interrupts (only Vista and beyond
have
> support for message-based interrupts).=A0 It is not clear that there =
was
ever
> a situation in which edge-triggered interrupts could be lost =
(certainly
not
> when I was doing MS-DOS device drivers for ISA cards! =A0I had to =
quite
often
> deal with arbitrarily-long-delayed edge-triggered interrupts!) =A0It =
is
> unlikely that something that worked back in the days of ISA would be =
lost
in
> modern architectures when such a failure could be disastrous.
>
>
>
> Note also that in order to prevent running a given ISR concurrently or
> sequentially-before-it-has-completed, a combination of CPU masking of
> this-or-lower interrupts combined with the interrupt spin lock in the
> KINTERRUPT object is used.
>
>
>
> Personally, I would be interested if someone knows how to change =
interrupt
> priorities on a given device.
>
> =
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0
joe
>
>
>
>
>
>
>
> From: xxxxx@lists.osr.com
> [mailto:xxxxx@lists.osr.com] On Behalf Of Skywing
> Sent: Friday, July 24, 2009 12:06 PM
>
> To: Windows System Software Devs Interest List
> Subject: RE: [ntdev] Question abuot masking interrupt
>
>
>
> That is the intention of the word “masked” in this particular =
instance.
>
> - S
>
>
>
>
> From: sivakumar thulasimani
> Sent: Friday, July 24, 2009 01:55
> To: Windows System Software Devs Interest List
> Subject: Re: [ntdev] Question abuot masking interrupt
>
> Being in any (X) IRQL does not mask any interrupts that are handled at =
a
> lower (Y)=A0IRQL. All the system does is it makes sure that your code =
which
is
> running at X IRQL will continue to run till it finished its function =
or
till
> it receives another interrupt which is handled by=A0at even higher =
IRQL. The
> interrupts for any lower (Y) IRQL are still “regisiterd” ( dont know =
the
> exact technical word here, so am using my own) and will be handled =
when
the
> IRQL is reduced to the appropriate level. Hope that clears your doubt
>
>
>
> rtshiva
>
> 2009/7/24
>
> I found this on wiki, seems answer my previous question:
> However, it is fairly easy for an edge triggered interrupt to be =
missed -
> for example if interrupts have to be masked for a period - and unless
there
> is some type of hardware latch that records the event it is impossible =
to
> recover. Such problems caused many “lockups” in early computer =
hardware
> because the processor did not know it was expected to do something. =
More
> modern hardware often has one or more interrupt status registers that
latch
> the interrupt requests; well written edge-driven interrupt software =
often
> checks such registers to ensure events are not missed.
>
> -----Original Message-----
> From: xxxxx@lists.osr.com
> [mailto:xxxxx@lists.osr.com] On Behalf Of
> xxxxx@viatech.com.cn
> Sent: Friday, July 24, 2009 4:05 PM
> To: Windows System Software Devs Interest List
>
> Subject: RE: [ntdev] Question abuot masking interrupt
>
> Another question:
> If we mask the interrupt at and less than the interrupt currently
servicing,
> Is there any possibility that EDGE triggered lower prioprity interrupt
> LOST? (seems level trigged interrupt will not be lost).
>
> Thanks.
> HW
>
> -----Original Message-----
> From: xxxxx@lists.osr.com
> [mailto:xxxxx@lists.osr.com] On Behalf Of
> xxxxx@viatech.com.cn
> Sent: Friday, July 24, 2009 3:52 PM
> To: Windows System Software Devs Interest List
> Subject: [ntdev] Question abuot masking interrupt
>
> Hello everyone.
> I am a newbie in windows kernel and is now reading “windows internals” =
by
> Mark E. Russinovich, David A. Solomon.
> I have a question about masking interrupt.
> In chapter 3:
> The books saids:
> /*Quote begin
> interrupts from a source with an IRQL above the current level =
interrupt
the
> processor, whereas interrupts from sources with IRQLs equal to or =
below
the
> current level are masked until an executing thread lowers the IRQL.
> Quote end */
>
> Here is my question:
> Why bother masking the interrupts whose priority is lower than current
> interrupt level?
> The lower priority interrupt can’t preempt high priority interrupt per =
se.
> Does masking make any difference?
>
> Can anyone help me on this question? Thanks!
>
> Best regards,
> HW
>
> Email secured by Check Point at OSR.COM
>
> —
> NTDEV is sponsored by OSR
>
> For our schedule of WDF, WDM, debugging and other seminars visit:
> http://www.osr.com/seminars
>
> To unsubscribe, visit the List Server section of OSR Online at
> http://www.osronline.com/page.cfm?name=3DListServer
>
>
> Email secured by Check Point at OSR.COM
>
> —
> NTDEV is sponsored by OSR
>
> For our schedule of WDF, WDM, debugging and other seminars visit:
> http://www.osr.com/seminars
>
> To unsubscribe, visit the List Server section of OSR Online at
> http://www.osronline.com/page.cfm?name=3DListServer
>
>
> Email secured by Check Point at OSR.COM
>
> —
> NTDEV is sponsored by OSR
>
> For our schedule of WDF, WDM, debugging and other seminars visit:
> http://www.osr.com/seminars
>
> To unsubscribe, visit the List Server section of OSR Online at
> http://www.osronline.com/page.cfm?name=3DListServer
>
>
> Email secured by Check Point at OSR.COM
>
> =3D — NTDEV is sponsored by OSR For our schedule of WDF, WDM, =
debugging
and
> other seminars visit: http://www.osr.com/seminars To unsubscribe, =
visit
the
> List Server section of OSR Online at
> http://www.osronline.com/page.cfm?name=3DListServer
>
> —
> NTDEV is sponsored by OSR
>
> For our schedule of WDF, WDM, debugging and other seminars visit:
> http://www.osr.com/seminars
>
> To unsubscribe, visit the List Server section of OSR Online at
> http://www.osronline.com/page.cfm?name=3DListServer
>
> —
> NTDEV is sponsored by OSR
>
> For our schedule of WDF, WDM, debugging and other seminars visit:
> http://www.osr.com/seminars
>
> To unsubscribe, visit the List Server section of OSR Online at
> http://www.osronline.com/page.cfm?name=3DListServer
> –
> This message has been scanned for viruses and
> dangerous content by MailScanner, and is
> believed to be clean.

—
NTDEV is sponsored by OSR

For our schedule of WDF, WDM, debugging and other seminars visit:=20
http://www.osr.com/seminars

To unsubscribe, visit the List Server section of OSR Online at
http://www.osronline.com/page.cfm?name=3DListServer

–=20
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

anton_bassov · July 30, 2009, 11:43am

> Is it true or false in an out-of-the-box Windows installation that setting device priority is entirely

fixed by the time the device driver writer sees it?

This one is true - this part gets established during HAL initialization, which, IIRC, is the very first thing kernel does upon initialization. However, device drivers (including boot ones) come into the play at much later stage when kernel is already up and running…

However, interrupt priority has nothing to do with the importance of a device itself - HAL cannot assign
hardware interrupt priority to any given device . The only thing it can assign priority to is IOAPIC pin ( I assume APIC HAL - on PIC -based machine HAL does not have any discretion even here). For PCI devices it may also try to assign a routing group to the device so that all devices of a given group signal interrupts via the same pin, and, hence, have the same interrupt priority. However, again, it does not always have a discretion here - some devices may be just physically wired together and hence be bound to signal interrupts via the same pin.

Therefore, if you want to be able to assign priority to a device you have to write custom HAL that implements interrupt priorities purely in a software. With such approach you can go as far as assigning different priorities to devices that share the same pin (although, once you cannot discover which particular device has actually interrupted without invoking all appropriate ISRs, it does not really seem to make any practical sense to do so)

Anton Bassov

anton_bassov · July 30, 2009, 12:08pm

> If it were based on the PCI wiring, then presumably changing the slot into which the card is

plugged should change the interrupt priority;

…but only on PIC-based machine…

experimentation on one motherboard indicated that this did not occur. The board always appeared
to have the same interrupt level no matter what slot it was plugged into.

This is because the OS can assign a priority to IOAPIC pin. . However, what you have described it is not necessarily going to be the case even on APIC-based machine if different devices are physically bound to share the same pin (unless device is MSI-capable and the OS decides to take advantage of its MSI capability, rather than making it signal interrupts via a pin - in such case it will be able to get dedicated vector, and, hence, priority)…

Anton Bassov

Daniel_Terhell · July 30, 2009, 12:56pm

“Joseph M. Newcomer” wrote in message
news:xxxxx@ntdev…
> When you refer to “target set” are you referring to the hardware
> architecture or can this be controlled by the use of the affinity mask?
> joe
>

I 'm asserting that what he is referring to is the ProcessorEnableMask
affinity that you specify when calling IoConnectInterrupt which causes
HalEnable(System)Interrupt to be called for each processor.
HalDisable(System)Interrupt would give you some control to decide what
interrupts were connected to a specific CPU but these functions seem no
longer documented since they changed name and switched to a new format.

//Daniel

Peter_Viscarola_OSR · July 30, 2009, 1:02pm

Totally. +1

This seems so obvious to me, that I figure it MUST have been tried and proven unacceptable for some reason. I’d love to know some of the back-story, if somebody in the know wants to provide it…

Peter
OSR

anton_bassov · July 30, 2009, 4:36pm

> I 'm asserting that what he is referring to is the ProcessorEnableMask affinity that you specify

when calling IoConnectInterrupt which causes HalEnable(System)Interrupt to be
called for each processor.

Please note that it is IOAPIC’s redirection table entry that specifies CPUs that may get interrupted by a given
source and not the other way around - otherwise you could get into a situation when the same interrupt source maps to different vectors on different CPUs, which implies the same source could have different priorities for different CPUs…

Anton Bassov

anton_bassov · July 30, 2009, 4:39pm

Peter,

This seems so obvious to me, that I figure it MUST have been tried and proven unacceptable
for some reason.

Does not APIC bus arbitration protocol ensure that interrupt gets dispensed to the least busy CPU among the ones that are allowed to be interrupted by a given source???

Anton Bassov

Daniel_Terhell · July 30, 2009, 5:40pm

Sorry, wrong usage of ‘assert’. It was more like I wanted to see if this
throws an exception or not.

Is it not that the I/O APIC (which belongs to the chipset) directs
interrupts to the local APICs based on their IDTs ? Or is it that the I/O
APIC is programmed separately in this manner ? Unfortunately there is not
much information on the I/O APIC or this redirection table in the public
Intel manuals, they say you need to contact them manually.

//Daniel

wrote in message news:xxxxx@ntdev…
>> I 'm asserting that what he is referring to is the ProcessorEnableMask
>> affinity that you specify
>> when calling IoConnectInterrupt which causes HalEnable(System)Interrupt
>> to be
>> called for each processor.
>
> Please note that it is IOAPIC’s redirection table entry that specifies
> CPUs that may get interrupted by a given
> source and not the other way around - otherwise you could get into a
> situation when the same interrupt source maps to different vectors on
> different CPUs, which implies the same source could have different
> priorities for different CPUs…
>
>
> Anton Bassov
>

anton_bassov · July 30, 2009, 6:57pm

Daniel,

Unfortunately there is not much information on the I/O APIC or this redirection table in the
public Intel manuals,

In fact, they just have a separate manual for IOAPIC, so that they don’t go into too much details about it in
their 3-volume developer’s manuals. Please find a link to IOAPIC manual below - this doc goes into all details of IOAPIC, including even pin layout.

they say you need to contact them manually.

Luckily, it is not as bad as that - they’ve got quite a few docs in a public domain. Just enter 'IOAPIC" into a search box on Intel site, and the very first link that you will get is http://www.intel.com/design/chipsets/datashts/290566.htm

If you want to discover the mapping of a particular device to particular IOAPIC pin, then you have to read BIOS -related docs as well. The link below may be quite helpful
http://www.intel.com/design/archives/processors/pro/docs/242016.htm.

Although, according to Jake, Windows does not even look at tables that above mentioned doc describes, and, instead, goes right to ACPI ones, I believe it may still helpful - after all, in terms of complexity, layout of these tables does not go anywhere close to ACPI ones, while there is a good chance that info
found in these tables may be valid even on machines with ACPI HAL…

If you want more than that, then you can download ACPI Specs. However, I must warn you in advance that, unlike Intel manuals, this is not the easiest read one would imagine…

Anton Bassov

OSR_Community_User · July 30, 2009, 7:14pm

The IOAPIC has a mode called lowest priority delivery, where the chipset chooses the right destination from a set of processors based on the interrupt priority of the various processors. The problem is that interrupt priority changes a lot. Like a whole lot. Acquiring spinlocks changes it, timers and other DPCs change it, and of course interrupts change it. And the information about priority is kept in the processor, but the chipset needs to act on it. The latency and overhead of transmitting this information meant that it was basically always stale. And so many systems gave up trying.

Some chipset/processor combinations forward to just one processor. Some round robin or hash based on vector number. All of these options are simpler than perfect lowest priority, but have negative effects in some workloads. But they exist, and new processors/chipsets are continuing to do this differently, so there is still a desire to get the distribution right.

Dave

James_Harper · July 30, 2009, 9:21pm

>

As MP has become pervasive to the point where 4-way systems are
commodity, a platform guarantee that interrupts are always spread
across processors seems to be a better way of dealing with isr/dpc
latency.

Totally. +1

This seems so obvious to me, that I figure it MUST have been tried and
proven
unacceptable for some reason. I’d love to know some of the
back-story, if
somebody in the know wants to provide it…

Having a read of the Linux mailing list archives where this is discussed
is an interesting (but lengthy) thing to do. Some of the more specific
discussions are obviously not relevant to windows but still interesting.

James

Jake_Oshins · July 31, 2009, 2:23am

Simply put, no it doesn’t.

First of all, none of us have seen a machine with an APIC bus in recent
memory. Second, when the APIC bus did exist, it guaranteed only that the
processor with the lowest TPR value got interrupted. But NT idles
processors (for various reasons) at DISPATCH_LEVEL. So the least busy
processor gets interrupted well after processors doing useful work at
PASSIVE_LEVEL.

–
Jake Oshins
Hyper-V I/O Architect
Windows Kernel Group

This post implies no warranties and confers no rights.

wrote in message news:xxxxx@ntdev…
> Peter,
>
>> This seems so obvious to me, that I figure it MUST have been tried and
>> proven unacceptable
>> for some reason.
>
> Does not APIC bus arbitration protocol ensure that interrupt gets
> dispensed to the least busy CPU among the ones that are allowed to be
> interrupted by a given source???
>
> Anton Bassov
>