bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Finish renaming remaining analyzeBranch functions	Matt Arsenault	2016-09-14	46	-138/+136
\| \| \| \|	llvm-svn: 281535
*	[Stackmap] Added callsite counts to emitted function information.	Sanjoy Das	2016-09-14	1	-13/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: It was previously not possible for tools to use solely the stackmap information emitted to reconstruct the return addresses of callsites in the map, which is necessary to use the information to walk a stack. This patch adds per-function callsite counts when emitting the stackmap section in order to resolve the problem. Note that this slightly alters the stackmap format, so external tools parsing these maps will need to be updated. Problem Details: Records only store their offset from the beginning of the function they belong to. While these records and the functions are output in program order, it is not possible to determine where the end of one function's records are without the callsite count when processing the records to compute return addresses. Patch by Kavon Farvardin! Reviewers: atrick, ributzka, sanjoy Subscribers: nemanjai Differential Revision: https://reviews.llvm.org/D23487 llvm-svn: 281532
*	Revert "[ARM] Promote small global constants to constant pools"	Evgeniy Stepanov	2016-09-14	4	-145/+1
\| \| \| \| \| \| \| \|	Breaks Android tests by introducing text relocations to ARM binaries. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/25362/steps/run%20asan%20lit%20tests%20%5Barm%2Fbullhead-userdebug%2FMTC20F%5D/logs/stdio llvm-svn: 281526
*	[lib/LTO] Fix a typo. NFC.	Davide Italiano	2016-09-14	1	-1/+1
\| \| \| \|	llvm-svn: 281517
*	Revert "AMDGPU: Use SOPK compare instructions"	Matt Arsenault	2016-09-14	6	-135/+51
\| \| \| \| \| \|	Accidentally committed llvm-svn: 281514
*	AMDGPU: Use SOPK compare instructions	Matt Arsenault	2016-09-14	6	-51/+135
\| \| \| \|	llvm-svn: 281513
*	Verifier: Mark orphaned DICompileUnits as a debug info failure.	Adrian Prantl	2016-09-14	1	-10/+10
\| \| \| \| \| \| \| \| \|	This is a follow-up to r268778 that adds a couple of missing cases, most notably orphaned compile units. rdar://problem/28193346 llvm-svn: 281508
*	Make analyzeBranch family of instruction names consistent	Matt Arsenault	2016-09-14	46	-97/+97
\| \| \| \| \| \| \|	analyzeBranch was renamed to use lowercase first, rename the related set to match. llvm-svn: 281506
*	AArch64: Use TTI branch functions in branch relaxation	Matt Arsenault	2016-09-14	35	-240/+268
\| \| \| \| \| \| \| \| \|	The main change is to return the code size from InsertBranch/RemoveBranch. Patch mostly by Tim Northover llvm-svn: 281505
*	[x86] fix formatting; NFC	Sanjay Patel	2016-09-14	1	-28/+20
\| \| \| \|	llvm-svn: 281504
*	[compiler-rt] Avoid instrumenting sanitizer functions	Etienne Bergeron	2016-09-14	1	-6/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: Function __asan_default_options is called by __asan_init before the shadow memory got initialized. Instrumenting that function may lead to flaky execution. As the __asan_default_options is provided by users, we cannot expect them to add the appropriate function atttributes to avoid instrumentation. Reviewers: kcc, rnk Subscribers: dberris, chrisha, llvm-commits Differential Revision: https://reviews.llvm.org/D24566 llvm-svn: 281503
*	[X86][SSE] Improve recognition of i64 sitofp conversions that can be ↵	Simon Pilgrim	2016-09-14	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \|	performed as i32 (PR29078) Until AVX512DQ we only support i64/vXi64 sitofp conversion as scalars. This patch sees if the sign bit extends far enough that we can truncate to a i32 type and then perform sitofp without loss of precision. Differential Revision: https://reviews.llvm.org/D24345 llvm-svn: 281502
*	[LoopInterchange] Typo. NFC.	Chad Rosier	2016-09-14	1	-4/+4
\| \| \| \|	llvm-svn: 281501
*	[LoopInterchange] Add CL option to override cost threshold.	Chad Rosier	2016-09-14	1	-3/+6
\| \| \| \| \| \|	Mostly useful for getting consistent lit testing. llvm-svn: 281500
*	[X86][SSE] Don't use PSHUFD directly - lower with generic shuffle	Simon Pilgrim	2016-09-14	1	-17/+1
\| \| \| \| \| \|	Remove the last user of the old getTargetShuffleNode helpers llvm-svn: 281499
*	getValueType().getScalarSizeInBits() -> getScalarValueSizeInBits(), round 2 ↵	Sanjay Patel	2016-09-14	3	-8/+6
\| \| \| \| \| \|	; NFCI llvm-svn: 281498
*	[LoopInterchange] Cleanup debug whitespace. NFC.	Chad Rosier	2016-09-14	1	-4/+4
\| \| \| \|	llvm-svn: 281497
*	getVectorElementType().getSizeInBits() -> getScalarSizeInBits() ; NFCI	Sanjay Patel	2016-09-14	20	-86/+84
\| \| \| \|	llvm-svn: 281495
*	getValueType().getSizeInBits() -> getValueSizeInBits() ; NFCI	Sanjay Patel	2016-09-14	24	-120/+105
\| \| \| \|	llvm-svn: 281493
*	Fix typo in comment [NFC]	Etienne Bergeron	2016-09-14	1	-1/+1
\| \| \| \|	llvm-svn: 281492
*	AMDGPU: Support folding FrameIndex operands	Matt Arsenault	2016-09-14	1	-9/+26
\| \| \| \| \| \|	This avoids test regressions in a future commit. llvm-svn: 281491
*	getValueType().getScalarSizeInBits() -> getScalarValueSizeInBits() ; NFCI	Sanjay Patel	2016-09-14	7	-57/+42
\| \| \| \|	llvm-svn: 281490
*	getScalarType().getSizeInBits() -> getScalarSizeInBits() ; NFCI	Sanjay Patel	2016-09-14	12	-77/+77
\| \| \| \|	llvm-svn: 281489
*	AMDGPU: Improve splitting 64-bit bit ops by constants	Matt Arsenault	2016-09-14	6	-88/+272
\| \| \| \| \| \| \| \|	This addresses a TODO to handle operations besides and. This also starts eliminating no-op operations with a constant that can emerge later. llvm-svn: 281488
*	[LV] Process pointer IVs with PHINodes in collectLoopUniforms	Matthew Simpson	2016-09-14	1	-4/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch moves the processing of pointer induction variables in collectLoopUniforms from the consecutive pointer phase of the analysis to the phi node phase. Previously, if a pointer induction variable was used by both a scalarized non-memory instruction as well as a vectorized memory instruction, we would incorrectly identify the pointer as uniform. Pointer induction variables should be treated the same as other phi nodes. That is, they are uniform if all users of the induction variable and induction variable update are uniform. Differential Revision: https://reviews.llvm.org/D24511 llvm-svn: 281485
*	[ARM] Promote small global constants to constant pools	James Molloy	2016-09-14	4	-1/+145
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If a constant is unamed_addr and is only used within one function, we can save on the code size and runtime cost of an indirection by changing the global's storage to inside the constant pool. For example, instead of: ldr r0, .CPI0 bl printf bx lr .CPI0: &format_string format_string: .asciz "hello, world!\n" We can emit: adr r0, .CPI0 bl printf bx lr .CPI0: .asciz "hello, world!\n" This can cause significant code size savings when many small strings are used in one function (4 bytes per string). llvm-svn: 281484
*	[X86][SSE] Removed unused getTargetShuffleNode function	Simon Pilgrim	2016-09-14	1	-17/+0
\| \| \| \|	llvm-svn: 281481
*	Fix code-gen crash on Power9 for insert_vector_elt with variable index (PR30189)	Nemanja Ivanovic	2016-09-14	2	-2/+16
\| \| \| \| \| \| \| \| \| \| \|	This patch corresponds to review: https://reviews.llvm.org/D24021 In the initial implementation of this instruction, I forgot to account for variable indices. This patch fixes PR30189 and should probably be merged into 3.9.1 (I'll open a bug according to the new instructions). llvm-svn: 281479
*	[StackProtector] Use INITIALIZE_TM_PASS instead of INITIALIZE_PASS	Silviu Baranga	2016-09-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	in order to make sure that its TargetMachine constructor is registered. This allows us to run the PEI machine pass with MIR input (see PR30324). llvm-svn: 281474
*	Adding missing directive for Power9.	Nemanja Ivanovic	2016-09-14	1	-1/+1
\| \| \| \| \| \| \| \|	There is currently no codegen for Power9 that depends on the directive so this is NFC for now but will be important in the future. This was missed in r268950 so I'm adding it now. llvm-svn: 281473
*	[X86][SSE] Don't blend vector shifts with MOVSS/MOVSD directly, lower from ↵	Simon Pilgrim	2016-09-14	1	-10/+10
\| \| \| \| \| \| \| \|	generic shuffle Shuffle lowering will correctly lower to MOVSS/MOVSD/PBLEND, improving commutation opportunities llvm-svn: 281471
*	[asan] Enable -asan-use-private-alias on Darwin/Mach-O, add test for ODR ↵	Kuba Brecka	2016-09-14	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	false positive with LTO (llvm part) The '-asan-use-private-alias’ option (disabled by default) option is currently only enabled for Linux and ELF, but it also works on Darwin and Mach-O. This option also fixes a known problem with LTO on Darwin (https://github.com/google/sanitizers/issues/647). This patch enables the support for Darwin (but still keeps it off by default) and adds the LTO test case. Differential Revision: https://reviews.llvm.org/D24292 llvm-svn: 281470
*	Revert "[Thumb] Teach ISel how to lower compares of AND bitmasks efficiently"	James Molloy	2016-09-14	2	-138/+5
\| \| \| \| \| \|	This reverts commit r281323. It caused chromium test failures and a selfhost failure. llvm-svn: 281451
*	Missing includes.	Vassil Vassilev	2016-09-14	5	-1/+5
\| \| \| \|	llvm-svn: 281450
*	GlobalISel: mark pointer stores as legal on AArch64.	Tim Northover	2016-09-14	1	-1/+1
\| \| \| \|	llvm-svn: 281448
*	This reapplies r281304. The issue was that I had missed	Sjoerd Meijer	2016-09-14	5	-49/+52
\| \| \| \| \| \|	to copy the new isAdd field in the tablegen data structure. llvm-svn: 281447
*	AVX-512: Fixed a bug in kortest.z intrinsic	Elena Demikhovsky	2016-09-14	1	-1/+1
\| \| \| \| \| \|	Lowering was wrong - X86ISD::SETCC node should return i8 type. llvm-svn: 281446
*	[AVX512BW] Change truncStore action (v16i16->v16i18). It can be legal only ↵	Igor Breger	2016-09-14	1	-2/+3
\| \| \| \| \| \| \| \|	with AVX512VL. Differential Revision: http://reviews.llvm.org/D24547 llvm-svn: 281445
*	[X86] Remove the VCVTSI2SD32 with rounding intrinsic. It's not used by clang ↵	Craig Topper	2016-09-14	1	-1/+0
\| \| \| \| \| \|	and not needed since 32-bit integer to double is always exact. llvm-svn: 281442
*	Create a getelementptr instead of sub expr for ValueOffsetPair if the	Wei Mi	2016-09-14	1	-3/+22
\| \| \| \| \| \| \| \| \| \| \| \|	value is a pointer. This patch is to fix PR30213. When expanding an expr based on ValueOffsetPair, if the value is of pointer type, we can only create a getelementptr instead of sub expr. Differential Revision: https://reviews.llvm.org/D24088 llvm-svn: 281439
*	[libFuzzer] start using trace-pc-guard as an alternative source of coverage	Kostya Serebryany	2016-09-14	6	-52/+32
\| \| \| \|	llvm-svn: 281435
*	[sanitizer-coverage] add yet another flavour of coverage instrumentation: ↵	Kostya Serebryany	2016-09-14	1	-2/+52
\| \| \| \| \| \|	trace-pc-guard. The intent is to eventually replace all of {bool coverage, 8bit-counters, trace-pc} with just this one. LLVM part llvm-svn: 281431
*	Address Pete's review comment and define OrigArg on its own line.	Akira Hatanaka	2016-09-13	1	-1/+2
\| \| \| \| \| \|	This is a follow-up to r281419. llvm-svn: 281421
*	[ObjCARC] Traverse chain downwards to replace uses of argument passed to	Akira Hatanaka	2016-09-13	1	-4/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ObjC library call with call return. ARC contraction tries to replace uses of an argument passed to an objective-c library call with the call return value. For example, in the following IR, it replaces uses of argument %9 and uses of the values discovered traversing the chain upwards (%7 and %8) with the call return %10, if they are dominated by the call to @objc_autoreleaseReturnValue. This transformation enables code-gen to tail-call the call to @objc_autoreleaseReturnValue, which is necessary to enable auto release return value optimization. %7 = tail call i8* @objc_loadWeakRetained(i8** %6) %8 = bitcast i8* %7 to %0* %9 = bitcast %0* %8 to i8* %10 = tail call i8* @objc_autoreleaseReturnValue(i8* %9) ret %0* %8 Since r276727, llvm started removing redundant bitcasts and as a result started feeding the following IR to ARC contraction: %7 = tail call i8* @objc_loadWeakRetained(i8** %6) %8 = bitcast i8* %7 to %0* %9 = tail call i8* @objc_autoreleaseReturnValue(i8* %7) ret %0* %8 ARC contraction no longer does the optimization described above since it only traverses the chain upwards and fails to recognize that the function return can be replaced by the call return. This commit changes ARC contraction to traverse the chain downwards too and replace uses of bitcasts with the call return. rdar://problem/28011339 Differential Revision: https://reviews.llvm.org/D24523 llvm-svn: 281419
*	[CodeGen] Fix invalid shift in mul expansion	Pawel Bylica	2016-09-13	1	-6/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: When expanding mul in type legalization make sure the type for shift amount can actually fit the value. This fixes PR30354 https://llvm.org/bugs/show_bug.cgi?id=30354. Reviewers: hfinkel, majnemer, RKSimon Subscribers: RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D24478 llvm-svn: 281403
*	[DAG] Allow build-to-shuffle combine to combine builds from two wide vectors.	Michael Kuperstein	2016-09-13	1	-27/+53
\| \| \| \| \| \| \| \| \| \| \|	This allows us to, in some cases, create a vector_shuffle out of a build_vector, when the inputs to the build are extract_elements from two different vectors, at least one of which is wider than the output. (E.g. a <8 x i16> being constructed out of elements from a <16 x i16> and a <8 x i16>). Differential Revision: https://reviews.llvm.org/D24491 llvm-svn: 281402
*	Next set of additional error checks for invalid Mach-O files for bad load ↵	Kevin Enderby	2016-09-13	1	-7/+82
\| \| \| \| \| \| \| \| \| \| \| \| \|	commands that use the Mach::dyld_info_command type for the load commands that are currently use in the MachOObjectFile constructor. This contains the missing checks for LC_DYLD_INFO and LC_DYLD_INFO_ONLY load commands and the fields for the Mach::dyld_info_command type. llvm-svn: 281400
*	[Hexagon] Better handling of HVX vector lowering	Krzysztof Parzyszek	2016-09-13	2	-4/+17
\| \| \| \| \| \| \|	- Expand SELECT_CC and BR_CC for vector types. - Implement TLI::isShuffleMaskLegal. llvm-svn: 281397
*	Reapply "InstCombine: Reduce trunc (shl x, K) width."	Matt Arsenault	2016-09-13	1	-7/+25
\| \| \| \| \| \| \|	This reapplies r272987 with a fix for infinitely looping when the truncated value is another shift of a constant. llvm-svn: 281379
*	AArch64: Cleanup tailcall CC check, enable swiftcc.	Matthias Braun	2016-09-13	2	-14/+20
\| \| \| \| \| \| \| \| \| \| \| \| \|	Cleanup/change the code that checks for possible tailcall conventions to look the same as the one in the X86 target. This makes the distinction between calling conventions that can guarnatee tailcalls and the ones that may tailcall more obvious. - Add Swift to the mayTailCall list - PreserveMost seemed to be incorrectly part of the guarnteed tail call list, move it to the mayTailCall list. llvm-svn: 281376