Search code examples
videocomvideo-capturedirectshowwindows-10-desktop

Project and build structure for Microsoft DirectShow based virtual webcam application on Window 10


I am trying to create simplest virtual webcam application which can display image file on my local filesystem.

After initial research on stackoverflow links and seeing OBS Studio source code I got some idea how can I achieve this.

  1. I would need to use Microsoft DirectShow.

  2. I would need to develop one source filter that would work as capture filter using IBaseFilter

  3. I would need to develop another source filter that would work as output filter or virtual webcam filter. I would need to compile this filter as .dll file and will need to register using regsvr32.exe
    As given on https://learn.microsoft.com/en-us/windows/win32/directshow/building-directshow-filters

  4. I would need to create Filter Graph and Capture Filter Graph using CoCreateInstance like

    hr = CoCreateInstance(CLSID_FilterGraph, NULL, CLSCTX_INPROC_SERVER, IID_IFilterGraph, (void **)&graph);

    hr = CoCreateInstance(CLSID_CaptureGraphBuilder2, NULL, CLSCTX_INPROC_SERVER, IID_ICaptureGraphBuilder2, (void **)&builder);

  5. Then I would need to add these filters to Filter Graph

  6. Then I would set Filter Graph to Capture Filter Graph like hr = builder->SetFiltergraph(graph);

Here is my confusion now:
After these steps I am not sure if I have to wrap these Graph Filters and Capture Graph Filter in one application which would be having main method and compile it to get .exe file or I need to compile as another .dll file.

Or How should I wrap these steps to create final application?


Solution

  • I want to create simplest virtual webcam application which can output any image or video to virtual camera. That virtual camera should be visible as video device in online meetings like Google meet or zoom.

    There is no support for virtual web cameras in Windows as a unified API and what you are trying to achieve is, generally speaking, possible but far more complicated than a question of setup.

    The task can be decomposed into three parts, and you will be able to find past StackOverflow questions that elaborate all of the three (some references are given below).

    First, you need to resolve the problem of integration of a virtual camera into third party software. Per the statement I started from, the OS API offers no way for a generic virtual camera interface in terms of OS extensibility point that enables third party application "see" a new camera device.

    A popular way to inject a fake camera device into applications is virtual DirectShow video source (and respectively Vivek's VCam source code).

    The diagram from Registering a network video stream as a virtual camera describes the APIs used by applications to work with cameras and illustrates limitations of virtual DirectShow cameras, specifically why they are not visible by every video-enabled application in Windows.

    See also questions Virtual Driver Cam not recognized by browser and DirectShow filter is not shown as input capture device.

    All in all, to develop a virtual webcam for all and any application in Windows you would need to develop a driver, something few are ready to deal with.

    Newer Media Foundation API offers nothing to help with functionality of virtual webcam.

    Second, you need to define a method of injection of video frames into whatever virtual camera you develop. There is no need to use DirectShow or Media Foundation because in the end of the day all you need is to submit video frames to the back end of your virtual camera implementation and you are free to use any convenient method.

    Use of DirectShow for this task make sense overall, but you don't need to. If you are not familiar with the API and you are starting with basics of creation of a filter graph, then it is quite likely that it is easier to go with a non-DirectShow solution. If you need to mix a real webcam image into your feed, you can capture it with Media Foundation in particular. If you plan to use GPU services of sorts, Media Foundation would be a better API to use again. DirectShow still remains good option as API to build your pipeline on.

    Third, there is often a question of interprocess communication to connect the virtual camera implementation and the source of the video. In some cases it is not necessary, but more often it is just overlooked.

    A virtual DirectShow camera (or virtual Media Foundation camera if you, for example, will be detouring) is running in context of camera consuming process, and cameras in general might be accessed from multiple applications including simultaneously. Quite so often you expect to produce video from another [single] application, including the case of application of unlatching bitness/architecture, so you are to take care of the challenge of passing data between the processes. If you are in an attempt to develop a driver for virtual camera you will have the same task too.

    I mentioned aspects of this in MSDN question there: How to implement a "source filter" for splitting camera video based on Vivek's vcam?, then there Read USB camera's input edit and send the output to a virtual camera on Windows and also there How to create Directshow filter?.

    All in all, it is not a question of project setup. Instead, it is a set of quite sophisticated problems to solve (which are doable though, and we see examples of this).


    For virtual cameras in Windows 11, see also footnote in this answer.