drm/amdgpu: Do core dump immediately when job tmo
authorTrigger Huang <Trigger.Huang@amd.com>
Mon, 19 Aug 2024 08:04:52 +0000 (16:04 +0800)
committerAlex Deucher <alexander.deucher@amd.com>
Thu, 29 Aug 2024 17:39:00 +0000 (13:39 -0400)
commitc67db6a6a6be4bb1db1b0fd5b24040d68e461cb1
treef3382b825641f2b28cf5fc6f5e6c4e07f780e525
parent6122f5c72e38a88eda13c7168e2ebbd3bd80b681
drm/amdgpu: Do core dump immediately when job tmo

Do the coredump immediately after a job timeout to get a closer
representation of GPU's error status.

V2: This will skip printing vram_lost as the GPU reset is not
happened yet (Alex)

V3: Unconditionally call the core dump as we care about all the reset
functions(soft-recovery and queue reset and full adapter reset, Alex)

V4: Do the dump after adev->job_hang = true (Sunil)

Signed-off-by: Trigger Huang <Trigger.Huang@amd.com>
Acked-by: Sunil Khatri <sunil.khatri@amd.com>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu_job.c