Commit graph

6644 commits

Author SHA1 Message Date
netborg
fd68a08572 [dxvk] Improve handling of cpu-frametime related stutters for low-latency frame pacing
Stutters less this way because we increase the sensitivity to mark frames as outliers, so that they don't get used for predicting the next frame. The actual "optimal" threshold is still to be fine-tuned, but this one worked really well.
2025-02-28 13:53:43 +01:00
netborg
988dcdc08c [dxvk] Improve low-latency frame pacing when gpuStart gets signaled earlier than expected 2025-02-28 13:53:42 +01:00
netborg
91c6793b55 [dxvk] Disallow flush for beginLatencyTracking's emitCs
Not sure if this does anything, but better be safe to correctly track when the first succeeding Cs will get executed.
2025-02-28 08:51:31 +01:00
netborg
0d018451fd [dxvk] Optimize flush heuristic for low-latency
Possibly can be optimized more, but just changing these numbers already had a huge effect, especially for games having a small number of submissions to begin with.
2025-02-28 08:05:46 +01:00
netborg
bee72c27c8 [dxvk] Improve robustness of frame pacer initialization
In d3d9 there were situations where the first frameId was 22, although in d3d11 it always started at 17. This did cause issues especially when waiting for fences which didn't get signaled for these frameIds.
2025-02-25 20:14:43 +01:00
netborg
3720d3e0e8 [dxvk] Removed redundant frameId tracking in frame pacer 2025-02-24 13:44:26 +01:00
netborg
869cf25f7f [dxvk] Optimize GPU start measurement for low-latency frame pacing
In practice, this change affects oversubscribed threading situations where waking up the "dxvk-queue" thread potentially can cause delays in the 100s of microseconds. For a lot of situations this change isn't affecting measurements in a meaningful way. Possibly affects AMD where vkQueueSubmit execution time is non-zero.
2025-02-24 13:44:26 +01:00
netborg
c802bdf42e [dxvk] Remove dxvk.lowLatencyAllowCpuFramesOverlap config variable 2025-02-24 13:44:26 +01:00
netborg
a13b821f95 [dxvk] Allow for fps limit < 50 fps in low-latency mode 2025-02-24 13:44:17 +01:00
netborg
0f3cbd6f66 [dxvk] Improve low-latency frame pacing for cpu-limited frames 2025-02-22 18:21:02 +01:00
netborg-afps
b59a1b9083
Merge branch 'doitsujin:master' into low-latency-framepacing 2025-02-21 14:47:36 +01:00
Robin Kertels
94b48c1633 [d3d9] Slightly clean up sampler slot handling 2025-02-21 13:25:23 +01:00
Robin Kertels
f7d56886c5 [d3d9] Fix sampler slot correction not respecting dmap texture 2025-02-21 13:25:23 +01:00
Philip Rebohle
0691a7fc46 [dxvk] De-duplicate drawIndirectCount implementations
No functional change, just some code cleanup.
2025-02-20 13:30:31 +01:00
Philip Rebohle
a135e01f89 [dxvk] Unroll merged indirect draws as necessary 2025-02-20 13:30:31 +01:00
Philip Rebohle
1d8fb818fc [dxvk] Check whether pipeline has potentially hazardous stores 2025-02-20 13:30:31 +01:00
Philip Rebohle
6f7a468174 [dxvk] Fix global render pass barrier
Only need to deal with common write-after-read scenarios, we can
ignore writes since those will add extra barriers anyway. Also
move this work out of the somewhat hot pipeline bind function.
2025-02-20 13:30:31 +01:00
Philip Rebohle
dd1ca4ce59 [dxvk] Track order-invariant access ops in barrier tracker
This elides barriers between draws or dispatches if we can prove
order-invariance through atomic operations.
2025-02-20 13:30:31 +01:00
Philip Rebohle
c475960754 [dxvk] Pass store op around for barrier tracking 2025-02-20 13:30:31 +01:00
Philip Rebohle
e01a6eec3e [dxbc] Track bindings with order-invariant atomic stores 2025-02-20 13:30:31 +01:00
Philip Rebohle
d94e3633dc [dxvk] Introduce concept of order-invariant atomic stores 2025-02-20 13:30:31 +01:00
Philip Rebohle
8c7da07085 [util] Fix enum declaration 2025-02-20 13:30:31 +01:00
WinterSnowfall
22052106d8 [d3d9] Relax logging level on validateGammaRamp 2025-02-20 00:01:10 +01:00
WinterSnowfall
3716d48c89 [d3d8/9] Use numeric_limits globally 2025-02-20 00:01:10 +01:00
WinterSnowfall
317607e192 [d3d8/9] Prevent device child ref underruns on release 2025-02-20 00:01:10 +01:00
Philip Rebohle
19361c962c [dxvk] Add debug region for barrier control 2025-02-19 19:34:52 +01:00
Philip Rebohle
636669e1a5 [dxvk] Improve handling of nested debug regions 2025-02-19 19:34:52 +01:00
Philip Rebohle
b03d457ffb [dxvk,d3d11] Improve explicit UAV overlap behaviour
If the app explicitly enables UAV overlap, don't synchronize
back-to-back read-modify-write operations to the same UAV either.
2025-02-19 19:34:52 +01:00
Philip Rebohle
a2c9c0f740 [d3d11] Use resource cookies for draw buffer tracking
Avoids keeping draw buffers alive when the app stops using indirect
draws. Unlikely to have caused issues in practice, but draw buffers
are not part of the API state to begin with.
2025-02-19 19:34:52 +01:00
Philip Rebohle
07f7ccdc96 [dxvk,d3d11] Fix draw buffer tracking for DrawAuto
Not like anybody uses this feature, but we need to both check for
hazards and make sure the SO counter actually gets tracked. Use
the existing draw buffer mechanism for this.
2025-02-19 19:34:52 +01:00
Philip Rebohle
d37a13847a [dxvk] Rename and repurpose ignoreGraphicsBarriers option
Less nuclear approach that hopefully works just as well in practice.
2025-02-19 19:34:52 +01:00
Philip Rebohle
18e5c12b6d [dxvk] Fix resource hazard checks
Turns out we've had broken write-after-read checks for a while.
2025-02-19 19:34:52 +01:00
Philip Rebohle
48d145fff6 [dxvk] Change debug color for pipelines with side effects 2025-02-19 19:34:52 +01:00
Philip Rebohle
04d2609a91 [dxvk] Clean up shader resource hazard checking 2025-02-19 19:34:52 +01:00
Philip Rebohle
3bbae86ec9 [dxvk] Rework compute barrier tracking 2025-02-19 19:34:52 +01:00
Philip Rebohle
53b076be61 [dxvk] Rework graphics barrier tracking
Avoids having to insert redundant barriers when the app does UAV rendering.
2025-02-19 19:34:52 +01:00
Philip Rebohle
009f8ee356 [dxvk] Emit barriers when updating shader resources 2025-02-19 19:34:52 +01:00
Philip Rebohle
be9391ded5 [dxvk] Track transform feedback buffer access 2025-02-19 19:34:52 +01:00
Philip Rebohle
96337f11d4 [dxvk] Track vertex buffer access 2025-02-19 19:34:52 +01:00
Philip Rebohle
24b58e5858 [dxvk] Track index buffer access 2025-02-19 19:34:52 +01:00
Philip Rebohle
21eb682b39 [dxvk] Track indirect draw buffer access 2025-02-19 19:34:52 +01:00
Philip Rebohle
a7c1e7a2a0 [dxvk] Add resource flag to track graphics pipeline side effects 2025-02-19 19:34:52 +01:00
Philip Rebohle
978d7cb65b [dxvk] Add more convenience methods to track buffer barriers 2025-02-19 19:34:52 +01:00
netborg
9ed43a60a3 Revert "[dxvk] Fix lack of forward progress guarantee in presenter"
This reverts commit efeb15edbd.
2025-02-19 18:23:18 +01:00
netborg
2c26eb2118 [hud] Add render latency 2025-02-19 18:23:18 +01:00
netborg
a8bd6f069e [dxvk] Integrate frame pacing 2025-02-19 18:23:18 +01:00
netborg
01ccd1e776 [dxvk] Add low-latency frame pacing 2025-02-19 18:23:18 +01:00
netborg
13ae0d218b [dxvk] Add config variables for frame pacing 2025-02-19 18:22:56 +01:00
Philip Rebohle
23067c48c7 [dxvk] Clean up CS chunk allocation
This also makes it more robust w.r.t. alignment.
2025-02-18 15:37:11 +01:00
Philip Rebohle
4c8ee300b5 [util] Add more optimal popcnt implementation 2025-02-18 15:28:41 +01:00