Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | [AMDGPU] Turn on the new waitcnt insertion pass. Adjust tests. | Mark Searles | 2017-06-02 | 1 | -2/+1 |
| | | | | | | | | | -enable-si-insert-waitcnts=1 becomes the default -enable-si-insert-waitcnts=0 to use old pass Differential Revision: https://reviews.llvm.org/D33730 llvm-svn: 304551 | ||||
* | AMDGPU: Unify divergent function exits. | Matt Arsenault | 2017-03-24 | 1 | -6/+11 |
| | | | | | | | | | | StructurizeCFG can't handle cases with multiple returns creating regions with multiple exits. Create a copy of UnifyFunctionExitNodes that only unifies exit nodes that skips exit nodes with uniform branch sources. llvm-svn: 298729 | ||||
* | Enable FeatureFlatForGlobal on Volcanic Islands | Matt Arsenault | 2017-01-24 | 1 | -1/+1 |
| | | | | | | | | | | | This switches to the workaround that HSA defaults to for the mesa path. This should be applied to the 4.0 branch. Patch by Vedran Miletić <vedran@miletic.net> llvm-svn: 292982 | ||||
* | [AMDGPU] Fix multiple vreg definitions in si-lower-control-flow | Stanislav Mekhanoshin | 2016-11-22 | 1 | -1/+1 |
| | | | | | | Differential Revision: https://reviews.llvm.org/D26939 llvm-svn: 287608 | ||||
* | AMDGPU: Fix use-after-free in SIOptimizeExecMasking | Nicolai Haehnle | 2016-10-07 | 1 | -0/+39 |
Summary: There was a bug with sequences like s_mov_b64 s[0:1], exec s_and_b64 s[2:3]<def>, s[0:1], s[2:3]<kill> ... s_mov_b64_term exec, s[2:3] because s[2:3] was defined and used in the same instruction, ending up with SaveExecInst inside OtherUseInsts. Note that the test case also exposes an unrelated bug. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98028 Reviewers: tstellarAMD, arsenm Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25306 llvm-svn: 283528 |