diff options
| author | Nicolai Haehnle <nhaehnle@gmail.com> | 2018-06-21 13:36:08 +0000 | 
|---|---|---|
| committer | Nicolai Haehnle <nhaehnle@gmail.com> | 2018-06-21 13:36:08 +0000 | 
| commit | b3a9b685132a19d3b5252cca6d0b5569c848451c (patch) | |
| tree | 9529b737efcdc6d87fa81f79f0b173aa62554fcf /llvm/test/CodeGen/AMDGPU | |
| parent | f2674319018151e00f603ebab34ab2155f669394 (diff) | |
| download | bcm5719-llvm-b3a9b685132a19d3b5252cca6d0b5569c848451c.tar.gz bcm5719-llvm-b3a9b685132a19d3b5252cca6d0b5569c848451c.zip  | |
AMDGPU: Add implicit def of SCC to kill and indirect pseudos
Summary:
Kill instructions sometimes do use SCC in unusual circumstances, when
v_cmpx cannot be used due to the operands that are involved.
Additionally, even if SCC was never defined by the expansion, kill pseudos
could previously occur between an s_cmp and an s_cbranch_scc, which breaks
the SCC liveness tracking when the pseudo is expanded to split the basic
block. While it would be possible to explicitly mark the SCC as live-in for
the successor basic block, it's simpler to just mark the pseudo as using SCC,
so that such a sequence is never emitted by instruction selection in the
first place.
A similar issue affects indirect source/dest pseudos in principle, although
I haven't been able to come up with a test case where it actually matters
(this affects instruction selection, so a MIR test can't be used).
Fixes: dEQP-GLES3.functional.shaders.discard.dynamic_loop_always
Change-Id: Ica8d82ecff1a763b892a1112cf1b06c948863a4f
Reviewers: arsenm, rampitec
Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits
Differential Revision: https://reviews.llvm.org/D47761
llvm-svn: 335223
Diffstat (limited to 'llvm/test/CodeGen/AMDGPU')
| -rw-r--r-- | llvm/test/CodeGen/AMDGPU/insert-skips-kill-uncond.mir | 2 | ||||
| -rw-r--r-- | llvm/test/CodeGen/AMDGPU/llvm.amdgcn.kill.ll | 20 | 
2 files changed, 21 insertions, 1 deletions
diff --git a/llvm/test/CodeGen/AMDGPU/insert-skips-kill-uncond.mir b/llvm/test/CodeGen/AMDGPU/insert-skips-kill-uncond.mir index 8f034c37efc..c842d849580 100644 --- a/llvm/test/CodeGen/AMDGPU/insert-skips-kill-uncond.mir +++ b/llvm/test/CodeGen/AMDGPU/insert-skips-kill-uncond.mir @@ -33,7 +33,7 @@ body: |    bb.1:      successors: %bb.2      $vgpr0 = V_MOV_B32_e32 0, implicit $exec -    SI_KILL_F32_COND_IMM_TERMINATOR $vgpr0, 0, 3, implicit-def $exec, implicit-def $vcc, implicit $exec +    SI_KILL_F32_COND_IMM_TERMINATOR $vgpr0, 0, 3, implicit-def $exec, implicit-def $vcc, implicit-def $scc, implicit $exec      S_BRANCH %bb.2    bb.2: diff --git a/llvm/test/CodeGen/AMDGPU/llvm.amdgcn.kill.ll b/llvm/test/CodeGen/AMDGPU/llvm.amdgcn.kill.ll index fabed795c56..bd8684de8c1 100644 --- a/llvm/test/CodeGen/AMDGPU/llvm.amdgcn.kill.ll +++ b/llvm/test/CodeGen/AMDGPU/llvm.amdgcn.kill.ll @@ -251,6 +251,26 @@ define amdgpu_ps void @test_non_inline_imm_sgpr(float inreg %a) #0 {    ret void  } +; SI-LABEL: {{^}}test_scc_liveness: +; SI: v_cmp +; SI: s_and_b64 exec +; SI: s_cmp +; SI: s_cbranch_scc +define amdgpu_ps void @test_scc_liveness() #0 { +main_body: +  br label %loop3 + +loop3:                                            ; preds = %loop3, %main_body +  %tmp = phi i32 [ 0, %main_body ], [ %tmp5, %loop3 ] +  %tmp1 = icmp sgt i32 %tmp, 0 +  call void @llvm.amdgcn.kill(i1 %tmp1) #1 +  %tmp5 = add i32 %tmp, 1 +  br i1 %tmp1, label %endloop15, label %loop3 + +endloop15:                                        ; preds = %loop3 +  ret void +} +  declare void @llvm.amdgcn.kill(i1) #0  declare void @llvm.amdgcn.exp.f32(i32, i32, float, float, float, float, i1, i1) #0  declare i1 @llvm.amdgcn.wqm.vote(i1)  | 

