drm/scheduler: signal scheduled fence when kill job
authorLin.Cao <lincao12@amd.com>
Thu, 15 May 2025 02:07:13 +0000 (10:07 +0800)
committerChristian König <christian.koenig@amd.com>
Thu, 22 May 2025 14:54:40 +0000 (16:54 +0200)
commit471db2c2d4f80ee94225a1ef246e4f5011733e50
tree655ea99bdb7492d81e7d953bd14857eda322b6e7
parent6692dbc15e5ed40a3aa037aced65d7b8826c58cd
drm/scheduler: signal scheduled fence when kill job

When an entity from application B is killed, drm_sched_entity_kill()
removes all jobs belonging to that entity through
drm_sched_entity_kill_jobs_work(). If application A's job depends on a
scheduled fence from application B's job, and that fence is not properly
signaled during the killing process, application A's dependency cannot be
cleared.

This leads to application A hanging indefinitely while waiting for a
dependency that will never be resolved. Fix this issue by ensuring that
scheduled fences are properly signaled when an entity is killed, allowing
dependent applications to continue execution.

Signed-off-by: Lin.Cao <lincao12@amd.com>
Reviewed-by: Philipp Stanner <phasta@kernel.org>
Signed-off-by: Christian König <christian.koenig@amd.com>
Link: https://lore.kernel.org/r/20250515020713.1110476-1-lincao12@amd.com
drivers/gpu/drm/scheduler/sched_entity.c