summaryrefslogtreecommitdiffstats
path: root/llvm
Commit message (Collapse)AuthorAgeFilesLines
...
* [IR][X86] Fix llvm version number in comments in AutoUpgrade. Forgot the ↵Craig Topper2017-02-241-13/+13
| | | | | | next release is 5.0 not 4.1 llvm-svn: 296092
* [AVX-512] Remove lzcnt intrinsics and autoupgrade them to generic ctlz ↵Craig Topper2017-02-248-96/+185
| | | | | | | | intrinsics with select. Clang has been emitting cltz intrinsics for a while now. llvm-svn: 296091
* [AVX-512] Move lzcnt and conflict intrinsic tests to avx512cd intrinsic test ↵Craig Topper2017-02-242-79/+77
| | | | | | file since that's their feature. llvm-svn: 296090
* [AVX-512] Use update_llc_test_checks.py to generate a test.Craig Topper2017-02-241-6/+13
| | | | llvm-svn: 296089
* [Fuchsia] Use thread-pointer ABI slots for stack-protector and safe-stackPetr Hosek2017-02-2410-50/+96
| | | | | | | | | | | | The Fuchsia ABI defines slots from the thread pointer where the stack-guard value for stack-protector, and the unsafe stack pointer for safe-stack, are stored. This parallels the Android ABI support. Patch by Roland McGrath Differential Revision: https://reviews.llvm.org/D30237 llvm-svn: 296081
* Add some testcases for bitfields with illegal widths.Eli Friedman2017-02-243-0/+379
| | | | | | | | clang will generate IR like this for input using packed bitfields; very simple semantically, but it's a bit tricky to actually generate good code. llvm-svn: 296080
* Fix old testcase for dead store to match the original intent.Eli Friedman2017-02-241-9/+17
| | | | | | | The x86 backend has a special case for load+xor+store, which isn't really what this is trying to test. llvm-svn: 296077
* Fix an iterator invalidation bug when simplifying LIC user.Xin Tong2017-02-241-1/+7
| | | | | | | | LoopUnswitch/simplify-with-nonvalness.ll is the test case for this. The LIC has 2 users and deleting the 1st user when it can be simplified invalidated the iterator for the 2nd user. llvm-svn: 296069
* [LazyMachineBFI] Add testcaseAdam Nemet2017-02-241-0/+63
| | | | | | | | | | | | | This is based on Justin's testcase and checking whether BFI is not populated in case hotness is off. This is a patch meant on top of Justin's patch to enable Machine opt-remarks in the AsmPrinter (http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20170130/426595.html) Differential Revision: https://reviews.llvm.org/D29837 llvm-svn: 296065
* Revert r269060 to pacify bots.Michael Kuperstein2017-02-245-435/+13
| | | | llvm-svn: 296064
* OptDiag: Add test for r296053Justin Bogner2017-02-241-0/+15
| | | | | | Forgot to commit this with the change. llvm-svn: 296061
* [CGP] Split some critical edges coming out of indirect branchesMichael Kuperstein2017-02-245-13/+435
| | | | | | | | | | | | | | | | | | | | | | Splitting critical edges when one of the source edges is an indirectbr is hard in general (because it requires changing the memory the indirectbr reads). But if a block only has a single indirectbr predecessor (which is the common case), we can simulate splitting that edge by splitting the destination block, and retargeting the *direct* branches. This is motivated by the use of computed gotos in python 2.7: PyEval_EvalFrame() ends up using an indirect branch with ~100 successors, and passing a constant to each of those. Since MachineSink can't break indirect critical edges on demand (and doing this in MIR doesn't look feasible), this causes us to emit about ~100 defs of registers containing constants, which we in the predecessor block, where only one of those constants is used in each successor. So, at each computed goto, we needlessly spill about a 100 constants to stack. The end result is that a clang-compiled python interpreter can be about ~2.5x slower on a simple python reduction loop than a gcc-compiled interpreter. Differential Revision: https://reviews.llvm.org/D29916 llvm-svn: 296060
* [GlobalISel] Use the same name for all remarks.Ahmed Bougacha2017-02-241-2/+2
| | | | | | While there, switch to the explicit ctor. llvm-svn: 296059
* [GlobalISel] Use the DISubprogram for translation failure remarks.Ahmed Bougacha2017-02-242-3/+5
| | | | | | | Justin added support for DISubprogram locs in r295531 and r296052. Use that instead of no-loc for constants and arguments. llvm-svn: 296058
* [GlobalISel] Remove now-unnecessary variable. NFC.Ahmed Bougacha2017-02-241-15/+14
| | | | | | | Since r296047, we're able to return early on failures. Don't track whether we succeeded. llvm-svn: 296057
* Fix unit tests after r296049.Adrian McCarthy2017-02-241-1/+1
| | | | llvm-svn: 296055
* OptDiag: Summarize the instruction count in asm-printerJustin Bogner2017-02-242-1/+17
| | | | | | | Add an optimization remark to asm-printer that summarizes the number of instructions emitted per function. llvm-svn: 296053
* OptDiag: Use DiagnosticLocation in MachineOptimizationRemarksJustin Bogner2017-02-241-11/+13
| | | | | | | DiagnosticInfo switched from DebugLoc to DiagnosticLocation in r295519, update these subclasses to match. llvm-svn: 296052
* [msan] Fix instrumentation of array allocas.Evgeniy Stepanov2017-02-242-6/+66
| | | | | | | Before this, MSan poisoned exactly one element of any array alloca, even if the number of elements was zero. llvm-svn: 296050
* Implement some methods for NativeRawSymbolAdrian McCarthy2017-02-247-10/+31
| | | | | | | | | | | This allows the ability to call IPDBSession::getGlobalScope with a NativeSession and to then query it for some basic fields from the PDB's InfoStream. Note that the symbols now have non-const references back to the Session so that NativeRawSymbol can access the PDBFile through the Session. Differential Revision: https://reviews.llvm.org/D30314 llvm-svn: 296049
* [GlobalISel] Don't translate other blocks when one failed.Ahmed Bougacha2017-02-231-35/+33
| | | | | | | | | | We were stopping the translation of the parent block when the translation of an instruction failed, but we were still trying to translate the other blocks of the parent function. Don't do that. llvm-svn: 296047
* [GlobalISel] Finalize translated function on scope exit. NFC.Ahmed Bougacha2017-02-231-3/+4
| | | | | | | This is the compromise between having a per-function IRTranslator and manually managing the per-function state. llvm-svn: 296046
* fix 80-column violationAdrian Prantl2017-02-231-1/+2
| | | | llvm-svn: 296045
* Delete outdated comment. NFCXin Tong2017-02-231-2/+1
| | | | llvm-svn: 296043
* LoopUnswitch - Simplify based on known not to a be constant.Xin Tong2017-02-232-3/+102
| | | | | | | | | | | | | | Summary: In case we do not know what the condition is in an unswitched loop, but we know its definitely NOT a known constant. We can perform simplifcations based on this information. Reviewers: sanjoy, hfinkel, chenli, efriedma Reviewed By: efriedma Subscribers: david2050, llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D28968 llvm-svn: 296041
* [OptDiag] Comment about the legacy status of emitOptimizationRemark*Adam Nemet2017-02-231-24/+37
| | | | | | functions llvm-svn: 296039
* [OptDiag] Remove hotness parameter from legacy remark ctorsAdam Nemet2017-02-231-21/+13
| | | | | | Anything using hotness should be using ORE. llvm-svn: 296038
* [OptDiag] Hide legacy remark ctorsAdam Nemet2017-02-232-66/+104
| | | | | | | These are only used when emitting remarks without ORE directly using the free functions emitOptimizationRemark*. llvm-svn: 296037
* [ADT] Fix zip iterator interface.Bryant Wong2017-02-232-27/+131
| | | | | | | | | | | | This commit provides `zip_{first,shortest}` with the standard member types and methods expected of iterators (e.g., `difference_type`), in order for zip to be used with other adaptors, such as `make_filter_range`. Support for reverse iteration has also been added. Differential Revision: https://reviews.llvm.org/D30246 llvm-svn: 296036
* [IR] Add a Instruction::dropPoisonGeneratingFlags helperSanjoy Das2017-02-233-0/+88
| | | | | | | | | | | | | | | | Summary: The helper will be used in a later change. This change itself is NFC since the only user of this new function is its unit test. Reviewers: majnemer, efriedma Reviewed By: efriedma Subscribers: efriedma, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D30184 llvm-svn: 296035
* [NVPTX] Added support for .f16x2 instructions.Artem Belevich2017-02-2315-153/+2343
| | | | | | | | | | | | | This patch enables support for .f16x2 operations. Added new register type Float16x2. Added support for .f16x2 instructions. Added handling of vectorized loads/stores of v2f16 values. Differential Revision: https://reviews.llvm.org/D30057 Differential Revision: https://reviews.llvm.org/D30310 llvm-svn: 296032
* ARM: make sure FastISel bails on f64 operations for Cortex-M4.Tim Northover2017-02-232-8/+75
| | | | | | | | | | | FastISel wasn't checking the isFPOnlySP subtarget feature before emitting double-precision operations, so it got completely invalid CodeGen for doubles on Cortex-M4F. The normal ISel testing wasn't spectacular either so I added a second RUN line to improve that while I was in the area. llvm-svn: 296031
* Revert r282872 "CVP. Turn marking adds as no wrap on by default"Hans Wennborg2017-02-232-1/+6
| | | | | | | | | | While not CVP's fault, this caused miscompiles (PR31181). Reverting until those are resolved. (This also reverts the follow-ups r288154 and r288161 which removed the flag.) llvm-svn: 296030
* [llvm-cov] Strip redundant path components from filenames (fix PR31982)Vedant Kumar2017-02-233-21/+57
| | | | | | | | | Instead of stripping the longest common prefix off of the filenames in a report, strip out the longest chain of redundant path components. This fixes the case in PR31982, where there are two files with the same prefix, and stripping out the LCP makes things less intelligible. llvm-svn: 296029
* Add call branch annotation for ICP promoted direct call in SamplePGO mode.Dehao Chen2017-02-238-24/+69
| | | | | | | | | | | | | | Summary: SamplePGO uses branch_weight annotation to represent callsite hotness. When ICP promotes an indirect call to direct call, we need to make sure the direct call is annotated with branch_weight in SamplePGO mode, so that downstream function inliner can use hot callsite heuristic. Reviewers: davidxl, eraman, xur Reviewed By: davidxl, xur Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D30282 llvm-svn: 296028
* [Hexagon] Handle saturations in Hexagon bit trackerKrzysztof Parzyszek2017-02-232-0/+71
| | | | llvm-svn: 296026
* [Hexagon] Allow setting register in BitVal without storing into mapKrzysztof Parzyszek2017-02-232-6/+13
| | | | | | | | | | | In the bit tracker, references to other bit values in which the register is 0 are prohibited. This means that generating self-referential register cells like { w:32 [0-15]:s[0-15] [16-31]:s[15] } is impossible. In order to get a self-referential cell, it had to be stored into a map and then reloaded from it. To avoid this step, add a function that will set the register to a given value without going through the map. llvm-svn: 296025
* [AMDGPU] Shut the warning "getRegUnitWeight hides overload...". NFC.Stanislav Mekhanoshin2017-02-231-0/+2
| | | | | | | Clang issues warning about hidden overload. That was intended, so add "using AMDGPUGenRegisterInfo::getRegUnitWeight;" to mute it. llvm-svn: 296021
* [ORE] Remove ORE.emit{{.+}} functionsAdam Nemet2017-02-232-194/+0
| | | | | | | Last use was killed in my previous patch. The preferred way is now to construct the remark, pipe things to it and pass it to ORE.emit. llvm-svn: 296019
* CodeGen: MachineBlockPlacement: Rename member to more general name. NFC.Kyle Butt2017-02-231-13/+11
| | | | | | | | | Rename ComputedTrellisEdges to ComputedEdges to allow for other methods of pre-computing edges. Differential Revision: https://reviews.llvm.org/D30308 llvm-svn: 296018
* [LAA] Remove unused LoopAccessReportAdam Nemet2017-02-233-64/+0
| | | | | | | The need for this removed when I converted everything to use the opt-remark classes directly with the streaming interface. llvm-svn: 296017
* [LV] Remove unused VectorizationReportAdam Nemet2017-02-231-15/+0
| | | | | | | The need for this removed when I converted everything to use the opt-remark classes directly with the streaming interface. llvm-svn: 296016
* Disable TLS for stack protector on Android API<17.Evgeniy Stepanov2017-02-235-28/+64
| | | | | | The TLS slot did not exist back then. llvm-svn: 296014
* [GlobalISel] Emit opt remarks on isel fallbacks.Ahmed Bougacha2017-02-239-83/+146
| | | | | | | | | | | | | | | | | Having more fine-grained information on the specific construct that caused us to fallback is valuable for large-scale data collection. We still have the fallback warning, that's also used for FastISel. We still need to remove the fallback warning, and teach FastISel to also emit remarks (it currently has a combination of the warning, stats, and debug prints: the remarks could unify all three). The abort-on-fallback path could also be better handled using remarks: one could imagine a "-Rpass-error", analoguous to "-Werror", which would promote missed/failed remarks to errors. It's not clear whether that would be useful for other remarks though, so we're not there yet. llvm-svn: 296013
* [CodeGen] Teach opt remarks how to print MI instructions.Ahmed Bougacha2017-02-232-0/+22
| | | | | | This will be used with GISel opt remarks. llvm-svn: 296012
* [CodeGen] Print MI without a newline when skipping debugloc. NFC.Ahmed Bougacha2017-02-232-4/+12
| | | | | | | This matches the behavior for skip-operands. While there, document it. This is a follow-up to r296007. llvm-svn: 296011
* [CodeGen] Use const MBBs in the opt remark diagnostics. NFC.Ahmed Bougacha2017-02-231-5/+6
| | | | llvm-svn: 296010
* Correct register pressure calculation in presence of subregsStanislav Mekhanoshin2017-02-239-30/+153
| | | | | | | | | | If a subreg is used in an instruction it counts as a whole superreg for the purpose of register pressure calculation. This patch corrects improper register pressure calculation by examining operand's lane mask. Differential Revision: https://reviews.llvm.org/D29835 llvm-svn: 296009
* [ORE] Use const CodeRegions in the remark diagnostics. NFC.Ahmed Bougacha2017-02-232-14/+17
| | | | llvm-svn: 296008
* [CodeGen] Add a way to SkipDebugLoc in MachineInstr::print(). NFC.Ahmed Bougacha2017-02-232-5/+7
| | | | llvm-svn: 296007
OpenPOWER on IntegriCloud