-
Notifications
You must be signed in to change notification settings - Fork 362
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vmaCreatePool failing at startup on some Nvidia gpu + driver combos #390
Comments
If I can't rely on allocating any amount of memory from a heap that reports 210 mb, then I'm not quite sure how to build an app. I realize these are heap totals, and not free memory. I realize that we're not the only app using the gpu, but having a basic allocation like this fail at startup is hard to resolve. Worse case, I can probably fixup code to fallback to HOST instead instead DEVICE_LOCAL memory if an OOM error occurs at startup. |
Regarding ReBAR: This is not something you either have or don't have. If supported by the GPU + motherboard + rest of the system, you can enable/disable it in the BIOS/UEFI at system startup, so you should be able to test it yourself without ReBAR enabled. Regarding available memory: Vulkan does provide a way to query for available budget per memory heap - see extension VK_EXT_memory_budget, which is wrapped in VMA by function If you are pretty certain that is error code is returned from Vulkan, then this problem is not a bug in VMA - it is out of scope of this project and I think it should rather be discussed in a repository about Vulkan. Although VMA works cross-platform and cross-GPU-vendor, I work at AMD, so for issues specific to Nvidia I recommend to reach out to one of their Developer Technology Engineers. Hopefully they can provide more information about what is happening on their platform. I think that when ReBAR is not enabled and you see only the classic 256 MB BAR memory ( You may find these articles useful: |
Ah, the budget thing is useful. I'm initializing that for VMA, but have never looked at those numbers. It's never clear if I need to init it, since it's folded into 1.1 already. I'll try to add those budgets to my logs. I'll also try to see if there's a control for ReBAR. I didn't see one in the HP bios last I looked, but may not have a consistent name. I'll just try to summarize any findings in this issue. So nothing to act upon until I get to the bottom of all this. And I may just fallback to HOST memory instead of DEVICE_LOCAL for that pool if OOM occurs. |
I'm back revisiting this. Seems to mostly occur on Win11. Like it's locked down all of memory type 4, and then VMA can't allocate our 4MB uniform pool. It's unclear how to map memory type 4 to one of the heaps, but I assume it's just the heaps in their sequential order. In all your examples, the ordinal starts at 0 on each heap. So that makes it hard to relate what VMA is looking up for a COHERENT + UNIFORM_BIT lookup. And I think I want memory type 2 if that fails, but not sure how to sway vmaFindMemoryTypeIndexForBufferInfo to pick 2 as the fallback.
This is just one startup failure of many. But most of the Nvidia cards work properly. AMD and Intel don't hit this.
Found memory type 4 for uniform pool No fallback, so app quits. We can't use the split device/host buffer uniforms, since our code doesn't handle the copy from device to host on uniforms. I messed up the memory_budget test trying to fix Android, so I don't have that reported above. This isn't my card or setup, it's from a user. On our next release, I'll finally have that info on some of these crashes. |
Digging into the pool allocation. VMA finds type 2 and type 4, but picks 4 since its cost is 0 vs. 1. But since this heap type is full, that doesn’t work and VMA pool allocation fails. So I think I need to set |= VMA_MEMORY_USAGE_PREFER_HOST so that it picks type 2 on the failure case. That seems to work, and falls back to type 2. Note this is all on 1.3 drivers, and so is using the budget and buffer memory requirements function. |
Doesn't the do-while loop in VmaAllocator_T::AllocateMemory() try out type=2 if type=4 fails though? You shouldn't need to explicitly set PREFER_HOST |
So our users have Nvidia 2070, 3060, 4090 gpu with low coherent memory (210 mb) failing to allocate a single 4mb uniform buffer pool. This isn't on all gpu/driver combos. And many of these like my 3070 have resizable bar. All of our other allocations are cpu or gpu only allocations. My 3070 doesn't exhibit this problem but has 8 gb of coherent memory.
The pool is one of the first things we allocate at startup. No buffers, no images, or render targets. So this OOM makes no sense to me. The only thing I can think is that the command buffers may also be allocated out of this memory. Wish there was a way to identify heap usage in Vulkan.
Found memory type 4 for uniform pool
[E] Error: ERROR_OUT_OF_DEVICE_MEMORY at call vmaCreatePool in CreateDevice, exiting app..
I reviewed the vmaCreatePool code. The only OOM error would be from vkAllocateMemory. But we barely request any memory from the COHERENT heap. There are maybe a total of this 4mb + 1mb of other allocations. I can't make this fallback to another memory type, or other parts of our engine fail with asserts.
Here are the stats on the card and heap. This driver is
Win10
GpuDevices NVIDIA GeForce RTX 2070 with Max-Q Design (0x10de:0x1f50)
DriverVersion:545.92.0.0 (0x88570000)
MemHeap0 size:7.83 gb, flags:DEVICE_LOCAL(0x1)
MemHeap0 flags:DEVICE_LOCAL(0x1)
MemHeap1 size:15.94 gb, flags:(0x0)
MemHeap1 flags:(0x0)
MemHeap1 flags:HOST_VISIBLE,HOST_COHERENT(0x6)
MemHeap1 flags:HOST_VISIBLE,HOST_COHERENT,HOST_CACHED(0xe)
MemHeap2 size:0.21 gb, flags:DEVICE_LOCAL(0x1)
MemHeap2 flags:DEVICE_LOCAL,HOST_VISIBLE,HOST_COHERENT(0x7) <- type 4
MemHeaps Local:8.04 Coherent:0.21 NonLocal:15.94
The text was updated successfully, but these errors were encountered: