bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[Utils] Avoid a hash table lookup in salvageDI, NFC	Vedant Kumar	2018-02-22	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \|	According to the current coverage report salvageDebugInfo() is called 5.12 million times during testing and almost always returns early. The early return depends on LocalAsMetadata::getIfExists returning null, which involves a DenseMap lookup in an LLVMContextImpl. We can probably speed this up by simply checking the IsUsedByMD bit in Value. llvm-svn: 325738
*	[X86][MMX] Generlize MMX_MOVD64rr combines to accept v4i16/v8i8 build ↵	Simon Pilgrim	2018-02-21	1	-7/+17
\| \| \| \| \| \| \| \|	vectors as well as v2i32 Also handle both cases where the lower 32-bits of the MMX is undef or zero extended. llvm-svn: 325736
*	bpf: disable DwarfUsesRelocationsAcrossSections	Yonghong Song	2018-02-21	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pahole does not work with BPF backend properly: -bash-4.2$ cat test.c struct test_t { int a; int b; }; int test(struct test_t s) { return s->a; } -bash-4.2$ clang -g -O2 -target bpf -c test.c -bash-4.2$ pahole test.o struct clang version 7.0.0 (trunk 325446) (llvm/trunk 325464) { clang version 7.0.0 (trunk 325446) (llvm/trunk 325464) clang version 7.0.0 (trunk 325446) (llvm/trunk 325464); / 0 4 / clang version 7.0.0 (trunk 325446) (llvm/trunk 325464) clang version 7.0.0 (trunk 325446) (llvm/trunk 325464); / 4 4 / / size: 8, cachelines: 1, members: 2 / / last cacheline: 8 bytes / }; -bash-4.2$ The reason is that BPF backend is not yet implemented in elfutils backend https://github.com/threatstack/elfutils/tree/master/backends and pahole depends on elfutils for dwarf parsing and resolving relocation. More specifically, the unsupported relocation in .debug_info for type/member name against symbol table caused the incorrect result above. The following is the raw .rel.debug_info for the above example, Hex dump of section '.rel.debug_info': 0x00000000 06000000 00000000 0a000000 0b000000 ................ 0x00000010 0c000000 00000000 0a000000 01000000 ................ 0x00000020 12000000 00000000 0a000000 02000000 ................ 0x00000030 16000000 00000000 0a000000 0e000000 ................ 0x00000040 1a000000 00000000 0a000000 03000000 ................ ----------------- -------- -------- reloc location type symtab index Hex dump of section '.debug_info': 0x00000000 7b000000 04000000 00000801 00000000 {............... 0x00000010 0c000000 00000000 00000000 00000000 ................ 0x00000020 00000000 00001000 00000200 00000000 ................ Based on "type", the proper value will be extracted from symbol table and filled in .debug_info so later on .debug_info can be properly resolved against debug strings. There are two ways to fix this problem. One is to fix elfutils by adding BPF support which is desirable. This could take a long time and won't work with already deployed pahole. For a short term workaround, we can disable dwarf cross-section relation which specifically avoids debug_info and symbol table cross relocation. This should help any dwarf-related tool which has not implement BPF specific relocations yet. Now .rel.debug_info does not have any relocation for symbol table and .debug_info itself contains necessary relocation information by itself. Hex dump of section '.debug_info': 0x00000000 7b000000 04000000 00000801 00000000 {............... 0x00000010 0c003700 00000000 00003e00 00000000 ..7.......>..... 0x00000020 00000000 00001000 00000200 00000000 ................ location 0xc has 0, 0x12 has 0x37, 0x1a has 0x3e in place which will be used in relocation resolution. Here, the values of 0, 0x37 and 0x3e are offset in .debug_str section. Please note the difference between two above .debug_info dumps. With the fix, pahole works properly with BPF backend: -bash-4.2$ clang -O2 -g -target bpf -c test.c -bash-4.2$ pahole test.o struct test_t { int a; / 0 4 / int b; / 4 4 / / size: 8, cachelines: 1, members: 2 / / last cacheline: 8 bytes */ }; Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 325735
*	Resubmit r325107 (case folding DJB hash)	Pavel Labath	2018-02-21	3	-2/+821
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The issue was that the has function was generating different results depending on the signedness of char on the host platform. This commit fixes the issue by explicitly using an unsigned char type to prevent sign extension and adds some extra tests. The original commit message was: This patch implements a variant of the DJB hash function which folds the input according to the algorithm in the Dwarf 5 specification (Section 6.1.1.4.5), which in turn references the Unicode Standard (Section 5.18, "Case Mappings"). To achieve this, I have added a llvm::sys::unicode::foldCharSimple function, which performs this mapping. The implementation of this function was generated from the CaseMatching.txt file from the Unicode spec using a python script (which is also included in this patch). The script tries to optimize the function by coalescing adjecant mappings with the same shift and stride (terms I made up). Theoretically, it could be made a bit smarter and merge adjecant blocks that were interrupted by only one or two characters with exceptional mapping, but this would save only a couple of branches, while it would greatly complicate the implementation, so I deemed it was not worth it. Since we assume that the vast majority of the input characters will be US-ASCII, the folding hash function has a fast-path for handling these, and only whips out the full decode+fold+encode logic if we encounter a character outside of this range. It might be possible to implement the folding directly on utf8 sequences, but this would also bring a lot of complexity for the few cases where we will actually need to process non-ascii characters. Reviewers: JDevlieghere, aprantl, probinson, dblaikie Subscribers: mgorny, hintonda, echristo, clayborg, vleschuk, llvm-commits Differential Revision: https://reviews.llvm.org/D42740 llvm-svn: 325732
*	[Hexagon] Add TargetRegisterInfo::getPointerRegClass() override	Tobias Edler von Koch	2018-02-21	2	-0/+9
\| \| \| \|	llvm-svn: 325731
*	[InstCombine] add and use Create*FMF functions; NFC	Sanjay Patel	2018-02-21	1	-15/+7
\| \| \| \|	llvm-svn: 325730
*	[ORC] Switch to shared_ptr ownership for SymbolSources in VSOs.	Lang Hames	2018-02-21	1	-48/+59
\| \| \| \| \| \| \|	This makes it easy to free a SymbolSource (and any related resources) when the last reference in a VSO is dropped. llvm-svn: 325727
*	[ORC] Switch RTDyldObjectLinkingLayer to take a unique_ptr<MemoryBuffer> rather	Lang Hames	2018-02-21	2	-23/+15
\| \| \| \| \| \| \| \| \| \|	than a shared ObjectFile/MemoryBuffer pair. There's no need to pre-parse the buffer into an ObjectFile before passing it down to the linking layer, and moving the parsing into the linking layer allows us remove the parsing code at each call site. llvm-svn: 325725
*	Revert "[IRMover] Implement name based structure type mapping"	Rafael Espindola	2018-02-21	1	-14/+7
\| \| \| \| \| \| \| \|	This reverts commit r325686. There was a misunderstanding and this has not been approved yet. llvm-svn: 325715
*	[hwasan] Fix inline instrumentation.	Evgeniy Stepanov	2018-02-21	1	-5/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch changes hwasan inline instrumentation: Fixes address untagging for shadow address calculation (use 0xFF instead of 0x00 for the top byte). Emits brk instruction instead of hlt for the kernel and user space. Use 0x900 instead of 0x100 for brk immediate (0x100 - 0x800 are unavailable in the kernel). Fixes and adds appropriate tests. Patch by Andrey Konovalov. Differential Revision: https://reviews.llvm.org/D43135 llvm-svn: 325711
*	Handle IMAGE_REL_AMD64_ADDR32NB in RuntimeDyldCOFF	Frederich Munch	2018-02-21	1	-21/+94
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: IMAGE_REL_AMD64_ADDR32NB relocations are currently set to zero in all cases. This patch sets the relocation to the correct value when possible and shows an error when not. Reviewers: enderby, lhames, compnerd Reviewed By: compnerd Subscribers: LepelTsmok, compnerd, martell, llvm-commits Differential Revision: https://reviews.llvm.org/D30709 llvm-svn: 325700
*	[Hexagon] Return true in enableMultipleCopyHints().	Jonas Paulsson	2018-02-21	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Enable multiple COPY hints to eliminate more COPYs during register allocation. Note that this is something all targets should do, see https://reviews.llvm.org/D38128. Review: Krzysztof Parzyszek llvm-svn: 325697
*	[X86] LowerBITCAST - pull out repeated calls to getOperand(0). NFCI.	Simon Pilgrim	2018-02-21	1	-8/+7
\| \| \| \|	llvm-svn: 325695
*	[Sparc] Include __tls_get_addr in symbol table for TLS calls to it	Jonas Devlieghere	2018-02-21	1	-2/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Global Dynamic and Local Dynamic call relocations only implicitly reference __tls_get_addr; there is no connection in the ELF file between the relocations and the symbol other than the specification for the relocations' semantics. However, it still needs to be in the symbol table despite the lack of explicit references to the symbol table entry, since it needs to be bound at link time for these relocations, otherwise any objects will fail to link. For details, see https://sourceware.org/bugzilla/show_bug.cgi?id=22832. Path by: James Clarke (jrtc27) Differential revision: https://reviews.llvm.org/D43271 llvm-svn: 325688
*	[SCEV] Temporarily disable loop versioning for the purpose	Silviu Baranga	2018-02-21	1	-0/+7
\| \| \| \| \| \| \| \| \| \|	of turning SCEVUnknowns of PHIs into AddRecExprs. This feature is now hidden behind the -scev-version-unknown flag. Fixes PR36032 and PR35432. llvm-svn: 325687
*	[IRMover] Implement name based structure type mapping	Eugene Leviant	2018-02-21	1	-7/+14
\| \| \| \| \| \|	Differential revision: https://reviews.llvm.org/D43199 llvm-svn: 325686
*	AMDGPU: Do not combine loads/store across physreg defs	Nicolai Haehnle	2018-02-21	1	-1/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Since this pass operates on machine SSA form, this should only really affect M0 in practice. Fixes various piglit variable-indexing/vs-varying-array-mat4-index-* Change-Id: Ib2a1dc3a8d7b08225a8da49a86f533faa0986aa8 Fixes: r317751 ("AMDGPU: Merge S_BUFFER_LOAD_DWORD_IMM into x2, x4") Reviewers: arsenm, mareko, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D40343 llvm-svn: 325677
*	[AMDGPU][MC] Added lds support for MUBUF instructions	Dmitry Preobrazhensky	2018-02-21	4	-54/+168
\| \| \| \| \| \| \| \| \|	See bug 28234: https://bugs.llvm.org/show_bug.cgi?id=28234 Differential Revision: https://reviews.llvm.org/D43472 Reviewers: vpykhtin, artem.tamazov, arsenm llvm-svn: 325676
*	[BDCE] Salvage debug info from dying insts	Vedant Kumar	2018-02-21	1	-0/+2
\| \| \| \| \| \| \| \|	This results in 15 additional unique source variables in a stage2 build of FileCheck (at '-Os -g'), with a negligible increase in the size of the .debug_loc section. llvm-svn: 325660
*	[X86] Disable CLWB for Cannon Lake	Craig Topper	2018-02-21	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \|	Cannon Lake does not support CLWB, therefore it does not include all features listed under SKX anymore. Instead, enumerate all SKX features with the exception of CLWB. Patch by Gabor Buella Differential Revision: https://reviews.llvm.org/D43380 llvm-svn: 325654
*	[mips] Spectre variant two mitigation for MIPSR2	Simon Dardis	2018-02-21	14	-38/+206
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch provides mitigation for CVE-2017-5715, Spectre variant two, which affects the P5600 and P6600. It implements the LLVM part of -mindirect-jump=hazard. It is _not_ enabled by default for the P5600. The migitation strategy suggested by MIPS for these processors is to use hazard barrier instructions. 'jalr.hb' and 'jr.hb' are hazard barrier variants of the 'jalr' and 'jr' instructions respectively. These instructions impede the execution of instruction stream until architecturally defined hazards (changes to the instruction stream, privileged registers which may affect execution) are cleared. These instructions in MIPS' designs are not speculated past. These instructions are used with the attribute +use-indirect-jump-hazard when branching indirectly and for indirect function calls. These instructions are defined by the MIPS32R2 ISA, so this mitigation method is not compatible with processors which implement an earlier revision of the MIPS ISA. Performance benchmarking of this option with -fpic and lld using -z hazardplt shows a difference of overall 10%~ time increase for the LLVM testsuite. Certain benchmarks such as methcall show a substantially larger increase in time due to their nature. Reviewers: atanasyan, zoran.jovanovic Differential Revision: https://reviews.llvm.org/D43486 llvm-svn: 325653
*	[InstCombine] C / -X --> -C / X	Sanjay Patel	2018-02-21	1	-8/+17
\| \| \| \| \| \| \| \| \|	We already do this in DAGCombiner, but it should also be good to eliminate the fsub use in IR. This is similar to rL325648. llvm-svn: 325649
*	[InstCombine] -X / C --> X / -C for FP	Sanjay Patel	2018-02-20	1	-5/+12
\| \| \| \| \| \| \|	We already do this in DAGCombiner, but it should also be good to eliminate the fsub use in IR. llvm-svn: 325648
*	Revert "[AMDGPU] Increased vector length for global/constant loads."	Konstantin Zhuravlyov	2018-02-20	2	-34/+2
\| \| \| \| \| \| \| \| \| \|	https://reviews.llvm.org/rL325518 It breaks following OpenCL conformance tests: - Basic - parameter_types - Basic - vload_private llvm-svn: 325643
*	[DSE] Don't DSE stores that subsequent memmove calls read from	Sanjoy Das	2018-02-20	1	-16/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: We used to remove the first memmove in cases like this: memmove(p, p+2, 8); memmove(p, p+2, 8); which is incorrect. Fix this by changing isPossibleSelfRead to what was most likely the intended behavior. Historical note: the buggy code was added in https://reviews.llvm.org/rL120974 to address PR8728. Reviewers: rsmith Subscribers: mcrosier, llvm-commits, jlebar Differential Revision: https://reviews.llvm.org/D43425 llvm-svn: 325641
*	[PBQP] Fix PR33038 by pruning empty intervals in initializeGraph.	Lang Hames	2018-02-20	1	-11/+27
\| \| \| \| \| \| \| \|	Spilling may cause previously non-empty intervals (both for the spilled vreg and others) to become empty. Moving the pruning into initializeGraph catches these cases and fixes PR33038. llvm-svn: 325632
*	[MemoryBuiltins] Check nobuiltin status when identifying calls to free.	Benjamin Kramer	2018-02-20	1	-10/+8
\| \| \| \| \| \| \| \|	This is usually not a problem because this code's main purpose is eliminating unused new/delete pairs. We got deletes of nullptr or nobuiltin deletes of builtin new wrong though. llvm-svn: 325630
*	[InstCombine] remove unneeded operand swap: NFCI	Sanjay Patel	2018-02-20	1	-3/+0
\| \| \| \| \| \| \|	FMul is commutative, so complexity-based canonicalization should always take care of the swap via SimplifyAssociativeOrCommutative(). llvm-svn: 325628
*	[SelectionDAG] Support known true/false SimplifySetCC cases for comparing ↵	Craig Topper	2018-02-20	1	-58/+87
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	against vector splats of constants. This is split off from D42948 and includes just the cases that constant fold to true or false. It also includes some refactoring to keep predicate checks together. This supports things like (setcc uge X, 0) -> true Differential Revision: https://reviews.llvm.org/D43489 llvm-svn: 325627
*	[AArch64] Refactor instructions using SIMD immediates	Evandro Menezes	2018-02-20	1	-368/+281
\| \| \| \| \| \| \| \| \| \| \|	Get rid of icky goto loops and make the code easier to maintain. Otherwise, NFC. Restore r324903 and fix PR36369. Differentail revision: https://reviews.llvm.org/D43364 llvm-svn: 325621
*	[LTO] Remove unused Path parameter to AddBufferFn	Teresa Johnson	2018-02-20	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: With D43396, no clients use the Path parameter anymore. Depends on D43396. Reviewers: pcc Subscribers: mehdi_amini, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D43400 llvm-svn: 325619
*	[ARM] Lower BR_CC for f16	Sjoerd Meijer	2018-02-20	1	-2/+1
\| \| \| \| \| \| \| \|	This case wasn't handled yet. Differential Revision: https://reviews.llvm.org/D43508 llvm-svn: 325616
*	[Hexagon] Handle *Low8 register classes in early if-conversion	Krzysztof Parzyszek	2018-02-20	1	-0/+2
\| \| \| \|	llvm-svn: 325606
*	[X86] Correct SHRUNKBLEND creation to work correctly when there are multiple ↵	Craig Topper	2018-02-20	1	-31/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	uses of the condition. SimplifyDemandedBits forces the demanded mask to all 1s if the node has multiple uses, unless the AssumeSingleUse flag is set. So previously we were only really likely to simplify something if the condition had a single use. And on the off chance we did simplify with multiple uses the demanded mask being used was all ones so there was no reason to create a shrunkblend. This patch now checks that the condition is only used by selects first, and then sets the AssumeSingleUse flag for the simplifcation. Then we convert the selects to shrunkblend, and finally replace condition. Differential Revision: https://reviews.llvm.org/D43446 llvm-svn: 325604
*	[SelectionDAG] Add LegalTypes flag to getShiftAmountTy. Use it to unify and ↵	Craig Topper	2018-02-20	3	-19/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	simplify DAGCombiner and simplifySetCC code and fix a bug. DAGCombiner and SimplifySetCC both use getPointerTy for shift amounts pre-legalization. DAGCombiner uses a single helper function to hide this. SimplifySetCC does it in multiple places. This patch adds a defaulted parameter to getShiftAmountTy that can make it return getPointerTy for scalar types. Use this parameter to simplify the SimplifySetCC and DAGCombiner. Additionally, there were two places in SimplifySetCC that were creating shifts using the target's preferred shift amount pre-legalization. If the target uses a narrow type and the type is illegal, this can cause SimplfiySetCC to create a shift with an amount that can't represent all possible shift values for the type. To fix this we should use pointer type there too. Alternatively we could make getScalarShiftAmountTy for each target return a safe value for large types as proposed in D43445. And maybe we should still do that, but fixing the SimplifySetCC code keeps other targets from tripping over this in the future. Fixes PR36250. Differential Revision: https://reviews.llvm.org/D43449 llvm-svn: 325602
*	[X86] Promote 16-bit cmovs to 32-bits	Craig Topper	2018-02-20	1	-3/+54
\| \| \| \| \| \| \| \| \| \|	This allows us to avoid an opsize prefix. And forcing some move immediates to i32 avoids a length changing prefix on those instructions. This mostly replaces the existing combine we had for zext/sext+cmov of constants. I left in a case for sign extending a 32 bit cmov of constants to 64 bits. Differential Revision: https://reviews.llvm.org/D43327 llvm-svn: 325601
*	[InstCombine] remove unneeded dyn_cast to prevent unused variable warning	Sanjay Patel	2018-02-20	1	-2/+1
\| \| \| \|	llvm-svn: 325597
*	[InstCombine] remove compound fdiv pattern folds	Sanjay Patel	2018-02-20	1	-27/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	These are fdiv-with-constant-divisor, so they already become reciprocal multiplies. The last gap for vector ops should be closed with rL325590. It's possible that we're missing folds for some edge cases with denormal intermediate constants after deleting these, but there are no tests for those patterns, and it would be better to handle denormals more consistently (and less conservatively) as noted in TODO comments. llvm-svn: 325595
*	[InstCombine] fold fdiv with non-splat divisor to fmul: X/C --> X * (1/C)	Sanjay Patel	2018-02-20	2	-21/+29
\| \| \| \|	llvm-svn: 325590
*	[mips] Correct the definition of cvt.d.w	Simon Dardis	2018-02-20	1	-3/+2
\| \| \| \| \| \| \| \|	An upcoming patch D41434, changes the ordering of the matcher table for assembly. This patch corrects the definition of the normal MIPS cvt.d.w not to be available in microMIPS. llvm-svn: 325589
*	[DEBUGINFO] Add support for emission of the inlined strings.	Alexey Bataev	2018-02-20	3	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Patch adds an option for emission of inlined strings rather than .debug_str section. Reviewers: echristo, jlebar Subscribers: eraman, llvm-commits, JDevlieghere Differential Revision: https://reviews.llvm.org/D43390 llvm-svn: 325583
*	[PowerPC] Reduce stack frame for fastcc functions by only allocating ↵	Lei Huang	2018-02-20	1	-2/+11
\| \| \| \| \| \| \| \| \| \| \| \|	parameter save area when needed Current implementation always allocates the parameter save area conservatively for fastcc functions. There is no reason to allocate the parameter save area if all the parameters can be passed via registers. Differential Revision: https://reviews.llvm.org/D42602 llvm-svn: 325581
*	[Hexagon] Fix alignment calculation of stack objects in Hexagon bit tracker	Krzysztof Parzyszek	2018-02-20	3	-6/+6
\| \| \| \|	llvm-svn: 325580
*	[VectorLegalizer] Fix uint64_t typo in ExpandUINT_TO_FLOAT (PR36391)	Simon Pilgrim	2018-02-20	1	-1/+1
\| \| \| \| \| \| \| \|	ExpandUINT_TO_FLOAT can accept vXi32 or vXi64 inputs, so we need to use a uint64_t shift to generate the 2^(BW/2) constant. No test case unfortunately as no upstream target uses this, but its affecting a downstream target. llvm-svn: 325578
*	[ARM] Mark -1 as cheap in xor's for thumb1	David Green	2018-02-20	1	-0/+7
\| \| \| \| \| \| \| \| \| \|	We can always convert xor %a, -1 into MVN, even in thumb 1 where the -1 would not otherwise be considered a cheap constant. This prevents the -1's from being pulled out into constants and potentially hoisted. Differential Revision: https://reviews.llvm.org/D43451 llvm-svn: 325573
*	[llvm-mc] - Produce R_X86_64_PLT32 for "call/jmp foo".	George Rimar	2018-02-20	6	-2/+39
\| \| \| \| \| \| \| \| \| \| \|	For instructions like call foo and jmp foo patch changes relocation produced from R_X86_64_PC32 to R_X86_64_PLT32. Relocation can be used as a marker for 32-bit PC-relative branches. Linker will reduce PLT32 relocation to PC32 if function is defined locally. Differential revision: https://reviews.llvm.org/D43383 llvm-svn: 325569
*	[AMDGPU] stop buffer_store being moved illegally	Tim Renouf	2018-02-20	1	-6/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The machine instruction scheduler was illegally moving a buffer store past a buffer load with the same descriptor and offset. Fixed by marking buffer ops as mayAlias and isAliased. This may be overly conservative, and we may need to revisit. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D43332 Change-Id: Iff3173d9e0653e830474546276ab9d30318b8ef7 llvm-svn: 325567
*	[MC] - Don't crash on unclosed frame.	George Rimar	2018-02-20	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	llvm-mc can crash when there is cfi_startproc without cfi_end_proc: .text .globl foo foo: .cfi_startproc Testcase shows the issue, patch fixes it. Differential revision: https://reviews.llvm.org/D43456 llvm-svn: 325564
*	[X86] Add 512-bit unmasked pmulhrsw/pmulhw/pmulhuw intrinsics. Remove and ↵	Craig Topper	2018-02-20	2	-9/+46
\| \| \| \| \| \| \| \|	auto upgrade 128/256/512 bit masked pmulhrsw/pmulhw/pmulhuw intrinsics. The 128 and 256 bit versions were already not used by clang. This adds an equivalent unmasked 512 bit version. Then autoupgrades all sizes to use unmasked intrinsics plus select. llvm-svn: 325559
*	Report fatal error in the case of out of memory	Serge Pavlov	2018-02-20	9	-18/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the second part of recommit of r325224. The previous part was committed in r325426, which deals with C++ memory allocation. Solution for C memory allocation involved functions `llvm::malloc` and similar. This was a fragile solution because it caused ambiguity errors in some cases. In this commit the new functions have names like `llvm::safe_malloc`. The relevant part of original comment is below, updated for new function names. Analysis of fails in the case of out of memory errors can be tricky on Windows. Such error emerges at the point where memory allocation function fails, but manifests itself when null pointer is used. These two points may be distant from each other. Besides, next runs may not exhibit allocation error. In some cases memory is allocated by a call to some of C allocation functions, malloc, calloc and realloc. They are used for interoperability with C code, when allocated object has variable size and when it is necessary to avoid call of constructors. In many calls the result is not checked for null pointer. To simplify checks, new functions are defined in the namespace 'llvm': `safe_malloc`, `safe_calloc` and `safe_realloc`. They behave as corresponding standard functions but produce fatal error if allocation fails. This change replaces the standard functions like 'malloc' in the cases when the result of the allocation function is not checked for null pointer. Finally, there are plain C code, that uses malloc and similar functions. If the result is not checked, assert statement is added. Differential Revision: https://reviews.llvm.org/D43010 llvm-svn: 325551