So far, I have not found anything Media Foundation can do that DirectShow can’t. The mechanism for creating a fake video camera in DShow is well-understood. Why not just do it in DShow?
Clearly, it is possible to write a Media Source or Source Reader object to produce a camera-like video stream, and the magic to get a filter recognized as a video camera is mostly registry entries, so it’s probably possible.