| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
| |
llvm-svn: 327772
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
| |
Summary: This is a follow-on patch of https://reviews.llvm.org/D44210
Author: FarhanaAleen
Reviewed By: msearles
Subscribers: llvm-commits, AMDGPU
Differential Revision: https://reviews.llvm.org/D44319
llvm-svn: 327726
|
| |
|
|
|
|
|
|
|
|
|
| |
opcodes
See bug 36751: https://bugs.llvm.org/show_bug.cgi?id=36751
Differential Revision: https://reviews.llvm.org/D44529
Reviewers: artem.tamazov, arsenm
llvm-svn: 327723
|
| |
|
|
|
|
|
|
| |
of a single basic block loop. mergeInputScoreBrackets() does this for us; update it so that it processes the single bb's score bracket when processing the single bb's preds. It is, after all, a pred of itself, so it's score bracket is needed.
Differential Revision: https://reviews.llvm.org/D44434
llvm-svn: 327583
|
| |
|
|
|
|
|
|
|
|
| |
Get rid of the "; mem:" suffix and use the one we use in MIR: ":: (load 2)".
rdar://38163529
Differential Revision: https://reviews.llvm.org/D42377
llvm-svn: 327580
|
| |
|
|
|
|
|
|
|
|
| |
Since the enqueued kernels have internal linkage, their names may be dropped.
In this case, give them unique names __amdgpu_enqueued_kernel or
__amdgpu_enqueued_kernel.n where n is a sequential number starting from 1.
Differential Revision: https://reviews.llvm.org/D44322
llvm-svn: 327291
|
| |
|
|
|
|
|
|
|
| |
See bug 36252: https://bugs.llvm.org/show_bug.cgi?id=36252
Differential Revision: https://reviews.llvm.org/D43874
Reviewers: artem.tamazov, arsenm
llvm-svn: 327278
|
| |
|
|
| |
llvm-svn: 327269
|
| |
|
|
| |
llvm-svn: 327268
|
| |
|
|
| |
llvm-svn: 327267
|
| |
|
|
| |
llvm-svn: 327211
|
| |
|
|
| |
llvm-svn: 327209
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
address-space.
Summary: Starting from GCN 2nd generation, ISA supports ds_read_b128 on top of ds_read_b64.
This patch supports ds_read_b128 instruction pattern and generation of this instruction.
In the vectorizer, this patch also widen the vector length so that vectorizer generates
128 bit loads for local address-space which gets translated to ds_read_b128.
Since the performance benefit is not clear; compiler generates ds_read_b128 under -amdgpu-ds128.
Author: FarhanaAleen
Reviewed By: rampitec, arsenm
Subscribers: llvm-commits, AMDGPU
Differential Revision: https://reviews.llvm.org/D44210
llvm-svn: 327153
|
| |
|
|
| |
llvm-svn: 327147
|
| |
|
|
|
|
|
|
| |
GFX9 should select opsel version.
Differential Revision: https://reviews.llvm.org/D44279
llvm-svn: 327106
|
| |
|
|
| |
llvm-svn: 327066
|
| |
|
|
|
|
|
| |
This will likely need to be changed again for anything more than:
fmul undef, undef -> undef
llvm-svn: 327034
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Summary: GCN ISA supports instructions that can read 16 consecutive dwords from memory through the scalar data cache;
loadstoreVectorizer should take advantage of the wider vector length and pack 16/8 elements of dwords/quadwords.
Author: FarhanaAleen
Reviewed By: rampitec
Subscribers: llvm-commits, AMDGPU
Differential Revision: https://reviews.llvm.org/D44179
llvm-svn: 326910
|
| |
|
|
|
|
| |
This reverts commit ce988cc100dc65e7c6c727aff31ceb99231cab03.
llvm-svn: 326907
|
| |
|
|
| |
llvm-svn: 326904
|
| |
|
|
|
|
|
|
|
|
| |
One addrspacecast disappeared in clang emitted IR for
block invoke function due to adoption of the new
addr space mapping.
Differential Revision: https://reviews.llvm.org/D43785
llvm-svn: 326806
|
| |
|
|
| |
llvm-svn: 326715
|
| |
|
|
|
|
|
| |
As far as I can tell legalization of weird sizes for the
output type isn't implemented.
llvm-svn: 326714
|
| |
|
|
|
|
|
|
| |
dependent instruction selection.
Differential revision: https://reviews.llvm.org/D35267
llvm-svn: 326703
|
| |
|
|
| |
llvm-svn: 326589
|
| |
|
|
| |
llvm-svn: 326588
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326587
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326586
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326567
|
| |
|
|
|
|
|
|
|
| |
i16 capable ASICs do not support i16 operands for this instruction.
Add tablegen pattern to merge chained i16 additions.
Differential Revision: https://reviews.llvm.org/D43985
llvm-svn: 326535
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326534
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326533
|
| |
|
|
| |
llvm-svn: 326532
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326526
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326525
|
| |
|
|
| |
llvm-svn: 326524
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326523
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326490
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326489
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326482
|
| |
|
|
| |
llvm-svn: 326481
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326479
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326477
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326472
|
| |
|
|
| |
llvm-svn: 326471
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326470
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326468
|
| |
|
|
| |
llvm-svn: 326466
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326465
|
| |
|
|
|
|
| |
Patch by Tom Stellard
llvm-svn: 326464
|