| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
|
|
|
|
|
| |
ARMv8.2-A adds a new ID register, ID_A64MMFR2_EL1, which behaves in the
same way as ID_A64MMFR0_EL1 and ID_A64MMFR1_EL1. It is a required part
of ARMv8.2-A, so no additional subtarget features are required.
Differential Revision: http://reviews.llvm.org/D15017
llvm-svn: 254155
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
| |
This adds subtarget features for ARMv8.2-A, which builds on (and
requires the features from) ARMv8.1-A. Most assembler-visible features
of ARMv8.2-A are system instructions, and are all required parts of the
architecture, so just depend on the HasV8_2aOps subtarget feature. There
is also one large, optional feature, which adds 16-bit floating point
versions of all existing floating-point instructions (VFP and SIMD),
this is represented by the FeatureFullFP16 subtarget feature.
Differential Revision: http://reviews.llvm.org/D15013
llvm-svn: 254154
|
| |
|
|
|
|
|
|
| |
be an indirect call.
Fixes the crasher in PR25651 and related crashers using the same pattern.
llvm-svn: 254145
|
| |
|
|
| |
llvm-svn: 254142
|
| |
|
|
|
|
|
|
|
|
| |
generated for _mm_losd_s{s,d}() intrinsics and used in scalar FMAs generated
for FMA intrinsics _mm_f{madd,msub,nmadd,nmsub}_s{s,d}().
Reviewer: David Kreitzer
Differential Revision: http://reviews.llvm.org/D14762
llvm-svn: 254140
|
| |
|
|
|
|
| |
some of the type checks in the isel matching tables.
llvm-svn: 254139
|
| |
|
|
| |
llvm-svn: 254132
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
| |
Summary:
This returns a pointer to the dispatch packet, which can be used to load
information about the kernel dispach.
Reviewers: arsenm
Subscribers: arsenm, llvm-commits
Differential Revision: http://reviews.llvm.org/D14898
llvm-svn: 254116
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This is one of the many steps to commonize value profiling support between profile
runtime and compiler/llvm tools.
After this change, profiler runtime now can share the same C APIs to do VP
serialization/deseriazation with LLVM host tools (and produces value data
in identical format between indexed and raw profile).
It is not yet enabled in profiler runtime yet.
Also added a unit test case to test runtime profile data serialization/deserialization
interfaces implemented using common closure code.
llvm-svn: 254110
|
| |
|
|
|
|
| |
Fixes PR25588.
llvm-svn: 254109
|
| |
|
|
| |
llvm-svn: 254106
|
| |
|
|
| |
llvm-svn: 254104
|
| |
|
|
| |
llvm-svn: 254100
|
| |
|
|
|
|
|
|
| |
This is a temporary fix to address ICE on 2005-10-21-longlonggtu.ll.
The proper fix will be to use A2_tfrsi, but it will need more work to
teach all users of A2_tfrsi to also expect a floating-point operand.
llvm-svn: 254099
|
| |
|
|
| |
llvm-svn: 254096
|
| |
|
|
|
|
|
|
|
|
| |
v2: added more tests, moved the SALU->VALU conversion to a separate function
It looks like it's not possible to get subregisters in the S_ABS lowering
code, and I don't feel like guessing without testing what the correct code
would look like.
llvm-svn: 254095
|
| |
|
|
|
|
|
|
|
| |
If virtual registers are created late, mappings to WebAssembly
registers need to be added explicitly. This patch adds a function
to do so and teaches WebAssemblyPeephole to use it. This fixes
an out-of-bounds access on the WARegs vector.
llvm-svn: 254094
|
| |
|
|
| |
llvm-svn: 254093
|
| |
|
|
| |
llvm-svn: 254091
|
| |
|
|
| |
llvm-svn: 254089
|
| |
|
|
|
|
|
| |
They can be loaded and stored, so count them as legal. This is
mostly to fix a number of common cases for load/store merging.
llvm-svn: 254086
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
| |
Summary:
Many target lowerings copy-paste the code to test SDValues for known constants.
This code can instead be shared in SelectionDAG.cpp, and reused in the targets.
Reviewers: MatzeB, andreadb, tstellarAMD
Subscribers: arsenm, jyknight, llvm-commits
Differential Revision: http://reviews.llvm.org/D14945
llvm-svn: 254085
|
| |
|
|
|
|
|
|
|
|
| |
Instead of trying to move ARGUMENT instructions back up to the top after
they've been scheduled or sunk down, use a fake physical register to
create a liveness constraint that prevents ARGUMENT instructions from
moving down in the first place. This is still not entirely ideal, however
it is more robust than letting them move and moving them back.
llvm-svn: 254084
|
| |
|
|
| |
llvm-svn: 254080
|
| |
|
|
| |
llvm-svn: 254079
|
| |
|
|
| |
llvm-svn: 254076
|
| |
|
|
| |
llvm-svn: 254075
|
| |
|
|
| |
llvm-svn: 254073
|
| |
|
|
| |
llvm-svn: 254071
|
| |
|
|
| |
llvm-svn: 254069
|
| |
|
|
|
|
|
|
|
| |
The e500mc does not actually support the mfocrf instruction; update the
processor definitions to reflect that fact.
Patch by Tom Rix (with some test-case cleanup by me).
llvm-svn: 254064
|
| |
|
|
|
|
|
|
|
|
| |
to a simple type when lowering a truncating store of a vector type. In this
case for an EVT we'll return Expand as we should in all of the cases anyhow.
The testcase triggered at the one in VectorLegalizer::LegalizeOp, inspection
found the rest.
llvm-svn: 254061
|
| |
|
|
|
|
|
|
|
| |
It was wrong order of operands (from intrinsic to DAG node).
I added more strict type specification for instruction selection.
Differential Revision: http://reviews.llvm.org/D14942
llvm-svn: 254059
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
methods
1. Convert serialization methods using InstrProfRecord as source into C (impl)
interfaces using Closure.
2. Reimplement InstrProfRecord serialization method to use new C interface
as dummy wrapper.
Now it is ready to implement wrapper for runtime value profile data.
(The new code need better source location -- but not changed in this patch to
minimize diffs. )
llvm-svn: 254057
|
| |
|
|
| |
llvm-svn: 254056
|
| |
|
|
| |
llvm-svn: 254055
|
| |
|
|
|
|
|
|
|
|
| |
that for unnamed globals.
This fixes parsing of forward references to unnamed aliases.
While here, remove an unnecessary isa check.
llvm-svn: 254054
|
| |
|
|
| |
llvm-svn: 254047
|
| |
|
|
|
|
|
|
|
|
| |
Reviewers: majnemer
Subscribers: llvm-commits
Differential Revision: http://reviews.llvm.org/D14857
llvm-svn: 254046
|
| |
|
|
| |
llvm-svn: 254045
|
| |
|
|
|
|
|
|
|
| |
This caused PR25607 and also caused Chromium to crash on start-up.
(Also had to update test/CodeGen/X86/avx-splat.ll, which was committed
after shrink wrapping was enabled.)
llvm-svn: 254044
|
| |
|
|
|
|
| |
test/CodeGen/X86/avg.ll.
llvm-svn: 254043
|
| |
|
|
|
|
| |
Revert the above revision due to multiple issues.
llvm-svn: 254040
|
| |
|
|
|
|
|
| |
This allows cleaner implementation and merging retrieving/mapping in
one pass.
llvm-svn: 254038
|
| |
|
|
|
|
|
|
| |
Add a simple initial heuristic to control importing based on the number
of instructions recorded in the function's summary. Add option to
control the limit, and test using option.
llvm-svn: 254036
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
When the original binary is executed and sampled, the resulting profile
contains information on the original inline stack. We currently follow
the original inline plan if we notice that the inlined callsite has more
than 0 samples to it.
A better way is to determine whether the callsite is actually worth
inlining. If the callsite accumulates a small fraction of the samples
spent in the parent function, then we don't want to bother inlining it
(as it means that the callsite is actually cold).
This patch introduces a threshold expressed in percentage of samples
in relation to the parent function. If the callsite uses less than N%
of the total samples used by its parent, the original inline decision is
not re-applied.
I've set the threshold to the very arbitrary value of 5%. I'm yet to do
any actual experiments to see what's a good value. I wanted to separate
the basic mechanism from the tuning.
llvm-svn: 254034
|
| |
|
|
|
|
| |
Fix buildbot failure for x86_64-darwin due to r254021
llvm-svn: 254028
|
| |
|
|
|
|
|
|
|
|
|
|
| |
This patch implements a minimum spanning tree (MST) based instrumentation for
PGO. The use of MST guarantees minimum number of CFG edges getting
instrumented. An addition optimization is to instrument the less executed
edges to further reduce the instrumentation overhead. The patch contains both the
instrumentation and the use of the profile to set the branch weights.
Differential Revision: http://reviews.llvm.org/D12781
llvm-svn: 254021
|
| |
|
|
| |
llvm-svn: 254020
|
| |
|
|
|
|
|
|
|
|
|
| |
r253918 had refactored expressions like "A - B.Address + C" to "A -
B.getAddressWithOffset(C)". This is incorrect, since the latter really
computes "A - B.Address - C".
None of the tests I can run locally on x86 broke due to this bug, but it
is the current suspect for breakage on the AArch64 buildbots.
llvm-svn: 254017
|