drm/amdgpu: Direct ret in ras_reset_err_cnt on VF
authorEllen Pan <yunru.pan@amd.com>
Fri, 11 Apr 2025 02:12:24 +0000 (22:12 -0400)
committerAlex Deucher <alexander.deucher@amd.com>
Fri, 11 Apr 2025 21:01:00 +0000 (17:01 -0400)
With adding sriov_vf check, we directly return EOPNOTSUPP in
ras_reset_error_count as we should not do anything on VF to reset RAS error
count.

This also fixes the issue that loading guest driver causes register
violations.

Reviewed-by: Ahmad Rehman <Ahmad.Rehman@amd.com>
Signed-off-by: Ellen Pan <yunru.pan@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c

index 5bb7673fd28eed19be4fc37fbd3eb160dba43b0d..17f0911ee7e917d9d557f4b7505345e89249ebb3 100644 (file)
@@ -1498,6 +1498,9 @@ int amdgpu_ras_reset_error_count(struct amdgpu_device *adev,
            !amdgpu_ras_get_aca_debug_mode(adev))
                return -EOPNOTSUPP;
 
+       if (amdgpu_sriov_vf(adev))
+               return -EOPNOTSUPP;
+
        /* skip ras error reset in gpu reset */
        if ((amdgpu_in_reset(adev) || amdgpu_ras_in_recovery(adev)) &&
            ((smu_funcs && smu_funcs->set_debug_mode) ||