Multi-Threading in Vulkan

original url: https://community.arm.com/graphics/b/blog/posts/multi-threading-in-vulkan

In my previous blog post I explained some of the key concepts of Vulkan and how we implemented them in our internal graphics engine. In this post I will go into a bit more detail about how we implemented multi-threading and some of the caveats to watch out for.

Quick background

Vulkan was created from the ground up to be thread-friendly and there's a huge amount of details in the spec relating to thread-safety and the consequences of function calls. In OpenGL, for instance, the driver might have a number of background threads working while waiting for API calls from the application. In Vulkan, this responsibility has moved up to the application level, so it's now up to you to ensure correct and efficient multi-threading behavior. This is a good thing since the application often has better visibility of what it wants to achieve.

Command pools

In Vulkan command buffers are allocated from command pools. Typically you pin command pools to a thread and only use this thread when writing to command buffers allocated from its command pool. Otherwise you need to externally synchronize access between the command buffer and the command pool which adds overhead.

Multi-Threading in Vulkan

For graphics use-cases you also typically pin a command pool per frame. This has the nice side-effect that you can simply reset the entire command pool once the work for the frame is completed. You can also reset individual command buffers, but it's often more efficient to just reset the entire command pool.

Coordinating work

In OpenGL, work is executed implicitly behind the scenes. In Vulkan this is explicit where the application submits command buffers to queues for execution.

Multi-Threading in Vulkan

Vulkan has the following synchronization primitives:

Semaphores - used to synchronize work across queues or across coarse-grained submissions to a single queue
Events and barriers - used to synchronize work within a command buffer or a sequence of command buffers submitted to a single queue
Fences - used to synchronize work between the device and the host

Queues have simple sync primitives for ordering the execution of command buffers. You can basically tell the driver to wait for a specific event before processing the submitted work and you can also get a signal for when the submitted work is completed. This synchronization is really important when it comes to submitting and synchronizing work to the swap chain. The following diagram shows how work can be recorded and submitted to the device queue for execution before we finally tell the device to present our frame to the display.

Multi-Threading in Vulkan

In the above sequence there is no overlap of work between different frames. Therefore, even though we're recording work to command buffers in multiple threads, we still have a certain amount of time where the CPU threads sit idle waiting for a signal in order to start work on the next frame.

Multi-Threading in Vulkan

This is much better. Here we start recording work for the next frame immediately after submitting the current frame to the device queue. All synchronization here is done using semaphores. vkAcquireNextImageKHR will signal a semaphore once the swap chain image is ready, vkQueueSubmit will wait for this semaphore before processing any of the commands and will signal another semaphore once the submitted commands are completed. Finally, vkQueuePresentKHR will present the image to the display, but it will wait for the signaled semaphore from vkQueueSubmit before doing so.

Summary

In this blog post I have given a brief overview of how to get overlap between CPU threads that record commands into command buffers over multiple frames. For our own internal implementation we found this really useful as it allowed us to start preparing work for the next frame very early on, ensuring the GPU is kept busy.

Multi-Threading in Vulkan

Quick background

Command pools

Coordinating work

Summary

繼續閱讀

Vulkan學習（五）: Command buffers & Rendering & PresentationFramebuffersCommand buffersRendering and presentationCode

Vulkan填坑學習Day26-1—組合圖像取樣器Vulkan 組合圖像取樣器一、更新描述符

[轉] Redefining the shading languages ecosystem with SPIR-V

Vulkan demo運作

Vulkan 基本原理

Vulkan_基于查詢池的遮擋剔除

【Vulkan】學習筆記2——執行個體(instance)、驗證層(validation layers)

Vulkan圖檔紋理使用方法

win10 vs2019 編譯vsg vsgXchange vsgExamples

vulkan記憶體配置設定類在參考下的嘗試實作（草稿

Vulkan Tutorial 3 建立邏輯裝置

lvepipeline.cpp lvepipeline.h

Vulkan Cascade Shadow Map的故事

Vulkan Barriers

Vulkan規範：第七章 7.1