1
0
Fork 0
mirror of synced 2025-03-06 20:59:54 +01:00
linux/drivers/gpu/drm/amd/amdkfd
Harish Kasiviswanathan d7e8ddc392 drm/amdkfd: Reset GPU on queue preemption failure
Currently, with F32 HWS GPU reset is only when unmap queue fails.

However, if compute queue doesn't repond to preemption request in time
unmap will return without any error. In this case, only preemption error
is logged and Reset is not triggered. Call GPU reset in this case also.

Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
Signed-off-by: Harish Kasiviswanathan <Harish.Kasiviswanathan@amd.com>
Reviewed-by: Mukul Joshi <mukul.joshi@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
2024-03-27 01:44:53 -04:00
..
cik_event_interrupt.c drm/amdkfd: add send exception operation 2023-06-09 12:36:01 -04:00
cik_int.h drm/amdkfd: Clean up reference of radeon 2018-07-11 22:33:08 -04:00
cik_regs.h drm/amdkfd: Delete a duplicate statement in set_pasid_vmid_mapping() 2018-11-05 14:21:13 -05:00
cwsr_trap_handler.h drm/amdkfd: Use SQC when TCP would fail in gfx10.1 context save 2024-03-06 15:24:50 -05:00
cwsr_trap_handler_gfx8.asm drm/amdkfd: Add aldebaran trap handler support 2021-03-10 00:02:24 -05:00
cwsr_trap_handler_gfx9.asm drm/amdkfd: Use S_ENDPGM_SAVED in trap handler 2024-01-29 15:38:20 -05:00
cwsr_trap_handler_gfx10.asm drm/amdkfd: Use SQC when TCP would fail in gfx10.1 context save 2024-03-06 15:24:50 -05:00
Kconfig drm/amdkfd: drop IOMMUv2 support 2023-08-11 14:47:25 -04:00
kfd_chardev.c drm/amdkfd: Check cgroup when returning DMABuf info 2024-03-22 15:47:56 -04:00
kfd_crat.c drm/amdkfd: add KFD support for GC 11.5.1 2024-02-22 10:27:10 -05:00
kfd_crat.h drm/amdkfd: Add cache line sizes to KFD topology 2024-02-07 12:25:51 -05:00
kfd_debug.c drm/amdkfd: Fix variable dereferenced before NULL check in 'kfd_dbg_trap_device_snapshot()' 2024-01-15 18:35:37 -05:00
kfd_debug.h drm/amdkfd: fix and enable ttmp setup for gfx11 2023-07-27 14:59:29 -04:00
kfd_debugfs.c drm/amdkfd: Introduce kfd_node struct (v5) 2023-06-09 09:42:27 -04:00
kfd_device.c drm/amdkfd: Increase the size of the memory reserved for the TBA 2024-03-06 15:24:49 -05:00
kfd_device_queue_manager.c drm/amdkfd: Reset GPU on queue preemption failure 2024-03-27 01:44:53 -04:00
kfd_device_queue_manager.h drm/amdkfd: rename device_queue_manager_init_v10_navi10() 2023-08-11 14:47:39 -04:00
kfd_device_queue_manager_cik.c drm/amdkfd: drop IOMMUv2 support 2023-08-11 14:47:25 -04:00
kfd_device_queue_manager_v9.c drm/amdkfd: drop IOMMUv2 support 2023-08-11 14:47:25 -04:00
kfd_device_queue_manager_v10.c drm/amdkfd: rename device_queue_manager_init_v10_navi10() 2023-08-11 14:47:39 -04:00
kfd_device_queue_manager_v11.c drm/amdkfd: Add KFD support for soc21 v3 2022-05-04 10:43:54 -04:00
kfd_device_queue_manager_vi.c drm/amdkfd: drop IOMMUv2 support 2023-08-11 14:47:25 -04:00
kfd_doorbell.c drm/amdkfd: get doorbell's absolute offset based on the db_size 2023-10-09 17:02:34 -04:00
kfd_events.c drm/amdgpu: Add log info for umc_v12_0 2024-01-22 17:13:25 -05:00
kfd_events.h drm/amdkfd: add event_age tracking when receiving interrupt 2023-06-15 11:37:55 -04:00
kfd_flat_memory.c drm/amdkfd: Relocate TBA/TMA to opposite side of VM hole 2024-02-16 15:41:50 -05:00
kfd_int_process_v9.c drm/amdkfd: range check cp bad op exception interrupts 2024-03-22 15:54:42 -04:00
kfd_int_process_v10.c drm/amdkfd: range check cp bad op exception interrupts 2024-03-22 15:54:42 -04:00
kfd_int_process_v11.c drm/amdkfd: range check cp bad op exception interrupts 2024-03-22 15:54:42 -04:00
kfd_interrupt.c drm/amdkfd: Cleanup workqueue during module unload 2024-03-22 15:54:48 -04:00
kfd_kernel_queue.c drm/amdkfd: Skip packet submission on fatal error 2024-02-26 11:14:31 -05:00
kfd_kernel_queue.h drm/amdkfd: Skip packet submission on fatal error 2024-02-26 11:14:31 -05:00
kfd_migrate.c drm/amdkfd: Correct partial migration virtual addr 2024-01-29 15:36:47 -05:00
kfd_migrate.h drm/amdkfd: Use partial migrations/mapping for GPU/CPU page faults in SVM 2023-12-06 15:22:32 -05:00
kfd_module.c drm/amdkfd: Fixed kfd_process cleanup on module exit. 2023-03-15 18:45:26 -04:00
kfd_mqd_manager.c drm/amdkfd: Check preemption status on all XCDs 2024-03-20 13:38:12 -04:00
kfd_mqd_manager.h drm/amdkfd: Check preemption status on all XCDs 2024-03-20 13:38:12 -04:00
kfd_mqd_manager_cik.c drm/amdkfd: Check preemption status on all XCDs 2024-03-20 13:38:12 -04:00
kfd_mqd_manager_v9.c drm/amdkfd: Check preemption status on all XCDs 2024-03-20 13:38:12 -04:00
kfd_mqd_manager_v10.c drm/amdkfd: Check preemption status on all XCDs 2024-03-20 13:38:12 -04:00
kfd_mqd_manager_v11.c drm/amdkfd: Check preemption status on all XCDs 2024-03-20 13:38:12 -04:00
kfd_mqd_manager_vi.c drm/amdkfd: Check preemption status on all XCDs 2024-03-20 13:38:12 -04:00
kfd_packet_manager.c drm/amdkfd: Skip packet submission on fatal error 2024-02-26 11:14:31 -05:00
kfd_packet_manager_v9.c drm/amdgpu: Use function for IP version check 2023-09-20 12:23:28 -04:00
kfd_packet_manager_vi.c drm/amdgpu: add configurable grace period for unmap queues 2023-06-09 12:35:31 -04:00
kfd_pasid.c drm/amdkfd: update SPDX license header 2022-02-14 15:08:40 -05:00
kfd_pm4_headers.h drm/amdkfd: update SPDX license header 2022-02-14 15:08:40 -05:00
kfd_pm4_headers_ai.h drm/amdkfd: prepare map process for single process debug devices 2023-06-09 12:35:36 -04:00
kfd_pm4_headers_aldebaran.h drm/amdkfd: update SPDX license header 2022-02-14 15:08:40 -05:00
kfd_pm4_headers_vi.h drm/amdkfd: Fix leftover errors and warnings 2022-02-14 15:08:40 -05:00
kfd_pm4_opcodes.h drm/amdkfd: update SPDX license header 2022-02-14 15:08:40 -05:00
kfd_priv.h drm/amdkfd: fix TLB flush after unmap for GFX9.4.2 2024-03-22 15:51:49 -04:00
kfd_process.c drm/amdkfd: Fix sparse __rcu annotation warnings 2024-01-09 15:44:13 -05:00
kfd_process_queue_manager.c drm/amdkfd: update SIMD distribution algo for GFXIP 9.4.2 onwards 2024-02-14 17:15:26 -05:00
kfd_queue.c drm/amdkfd: update SPDX license header 2022-02-14 15:08:40 -05:00
kfd_smi_events.c drm/amdgpu: change vm->task_info handling 2024-03-04 15:59:08 -05:00
kfd_smi_events.h drm/amdkfd: Update SMI events for GFX9.4.3 2023-06-09 09:46:31 -04:00
kfd_svm.c drm/amdgpu: Auto-validate DMABuf imports in compute VMs 2024-01-15 18:35:35 -05:00
kfd_svm.h drm/amdkfd: Use partial migrations/mapping for GPU/CPU page faults in SVM 2023-12-06 15:22:32 -05:00
kfd_topology.c drm/amdkfd: Add partition id field to location_id 2024-02-26 11:15:38 -05:00
kfd_topology.h drm/amdkfd: Update cache info reporting for GFX v9.4.3 2023-09-11 18:17:20 -04:00
Makefile drm/amdkfd: drop IOMMUv2 support 2023-08-11 14:47:25 -04:00
soc15_int.h drm/amdkfd: Update interrupt handling for GFX 9.4.3 2023-06-30 13:11:35 -04:00