bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Reformat a couple of functions for clarity.	Eric Christopher	2014-05-07	1	-22/+19
\| \| \| \|	llvm-svn: 208248
*	[Hexagon] Add New TSFlags to be used in the upcoming patches.	Jyotsna Verma	2014-05-07	4	-67/+102
\| \| \| \|	llvm-svn: 208239
*	avoid segfaulting	Sebastian Pop	2014-05-07	1	-2/+1
\| \| \| \| \| \|	Quotient and Remainder don't have to be initialized. llvm-svn: 208238
*	do not collect undef terms	Sebastian Pop	2014-05-07	1	-1/+36
\| \| \| \|	llvm-svn: 208237
*	Fix using wrong result type for setcc.	Matt Arsenault	2014-05-07	2	-4/+16
\| \| \| \| \| \| \| \| \| \| \|	When reducing the bitwidth of a comparison against a constant, the original setcc's result type was used, which was incorrect. No test since I don't think any other in tree targets change the bitwidth of the setcc type depending on the bitwidth of the compared type. llvm-svn: 208236
*	split delinearization pass in 3 steps	Sebastian Pop	2014-05-07	3	-397/+484
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	To compute the dimensions of the array in a unique way, we split the delinearization analysis in three steps: - find parametric terms in all memory access functions - compute the array dimensions from the set of terms - compute the delinearized access functions for each dimension The first step is executed on all the memory access functions such that we gather all the patterns in which an array is accessed. The second step reduces all this information in a unique description of the sizes of the array. The third step is delinearizing each memory access function following the common description of the shape of the array computed in step 2. This rewrite of the delinearization pass also solves a problem we had with the previous implementation: because the previous algorithm was by induction on the structure of the SCEV, it would not correctly recognize the shape of the array when the memory access was not following the nesting of the loops: for example, see polly/test/ScopInfo/multidim_only_ivs_3d_reverse.ll ; void foo(long n, long m, long o, double A[n][m][o]) { ; ; for (long i = 0; i < n; i++) ; for (long j = 0; j < m; j++) ; for (long k = 0; k < o; k++) ; A[i][k][j] = 1.0; Starting with this patch we no longer delinearize access functions that do not contain parameters, for example in test/Analysis/DependenceAnalysis/GCD.ll ;; for (long int i = 0; i < 100; i++) ;; for (long int j = 0; j < 100; j++) { ;; A[2i - 4j] = i; ;; B++ = A[6i + 8*j]; these accesses will not be delinearized as the upper bound of the loops are constants, and their access functions do not contain SCEVUnknown parameters. llvm-svn: 208232
*	[x86] Make the 'x86-64' cpu, what I see as and many use as the generic	Chandler Carruth	2014-05-07	1	-2/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	default architecture for reasonable modern x86 processors, actually be modern. This processor model should essentially be "tuned" for modern x86 chips as much as possible without undue penalties on any specific architecture. Previously we weren't even using the nice scheduling models. There are a few other tweaks needed here, but this change at least I have benchmarked across a decent swatch of chips (intel's clovertown, westmere, and sandybridge; amd's istanbul) and seen no significant regressions. If anyone has suggested ways to test this, just let me know. Somewhat alarmingly, no existing tests failed. llvm-svn: 208230
*	Tidy up whitespace with clang-format prior to making significant	Chandler Carruth	2014-05-07	1	-45/+41
\| \| \| \| \| \|	changes. llvm-svn: 208229
*	[yaml2obj] Support ELF x86 relocations.	Simon Atanasyan	2014-05-07	1	-0/+43
\| \| \| \|	llvm-svn: 208228
*	[ARM64][fast-isel] Disable target specific optimizations at -O0. Functionally,	Chad Rosier	2014-05-07	3	-31/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	this patch disables the dead register elimination pass and the load/store pair optimization pass at -O0. The ILP optimizations don't require the optimization level to be checked because the call to addILPOpts is predicated with the necessary check. The AdvSIMDScalar pass is disabled by default at all optimization levels. This patch leaves that pass disabled by default. Also, move command-line options into ARM64TargetMachine.cpp and add a few additional flags to aid in debugging. This fixes an issue with the -debug-pass=Structure flag where passes were printed, but not actually run (i.e., AdvSIMDScalar pass). llvm-svn: 208223
*	[mips] Add highly experimental support for MIPS-I, MIPS-II, MIPS-III, and MIPS-V	Daniel Sanders	2014-05-07	3	-5/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: These processors will only be available for the integrated assembler at first (CodeGen will emit a fatal error saying they are not implemented). The intention is to work through the existing instructions and correctly annotate the ISA they were added in so that we have a sufficiently good base to start MIPS64r6 development. MIPS64r6 removes/re-encodes certain instructions and I believe it is best to define ISA's using set-union's as far as possible rather than using set-subtraction. Reviewers: vmedic Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D3569 llvm-svn: 208221
*	llvm-cov: Explicitly namespace llvm::make_unique to keep MSVC happy	Justin Bogner	2014-05-07	1	-4/+2
\| \| \| \| \| \| \| \|	This is a followup to r208171, where a call to make_unique was disambiguated for MSVC. Disambiguate two more calls, and remove the comment about it since this is what we do everywhere. llvm-svn: 208219
*	Use range loop.	Rafael Espindola	2014-05-07	1	-3/+2
\| \| \| \|	llvm-svn: 208218
*	[InstCombine] Add optimization of redundant insertvalue instructions.	Michael Zolotukhin	2014-05-07	2	-0/+37
\| \| \| \| \| \|	rdar://problem/11861387 llvm-svn: 208214
*	[mips] Add FGR_32/FGR_64/GPR_64 adjectives and use then instead of ↵	Daniel Sanders	2014-05-07	3	-161/+156
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	FGRPredicates/GPRPredicates Summary: No functional change (confirmed by diffing tablegen-erated files). Depends on D3642 Reviewers: vmedic, dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3645 llvm-svn: 208213
*	[mips] Add INSN_<name> adverbs and start using them instead of ↵	Daniel Sanders	2014-05-07	1	-6/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	AdditionalPredicates overrides Summary: No functional change Depends on D3641 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3642 llvm-svn: 208212
*	[msan] Fix -fsanitize=memory -fno-integrated-as.	Evgeniy Stepanov	2014-05-07	1	-1/+1
\| \| \| \|	llvm-svn: 208211
*	AArch64/ARM64: optimise vector selects & enable test	Tim Northover	2014-05-07	1	-0/+41
\| \| \| \| \| \| \| \| \|	When performing a scalar comparison that feeds into a vector select, it's actually better to do the comparison on the vector side: the scalar route would be "CMP -> CSEL -> DUP", the vector is "CM -> DUP" since the vector comparisons are all mask based. llvm-svn: 208210
*	[mips] Add ISA_<name> adverbs and start using them instead of ↵	Daniel Sanders	2014-05-07	4	-42/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	AdditionalPredicates overrides Summary: One small functional change. The recently added PAUSE instruction now has the HasStdEnc predicate which was accidentally removed by a Requires<>. Depends on D3640 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3641 llvm-svn: 208209
*	Remove the UseCFI option from createAsmStreamer.	Rafael Espindola	2014-05-07	8	-29/+20
\| \| \| \| \| \|	We were already always passing true, this just removes the option. llvm-svn: 208205
*	[mips] Continue splitting Instruction.Predicates into smaller lists and ↵	Daniel Sanders	2014-05-07	3	-29/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	re-join them with !listconcat Summary: Move IsGP64bit into GPRPredicates, and IsFP64bit/NotFP64bit into FGRPredicates No functional change (confirmed by diffing tablegen-erated files). Depends on D3639 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3640 llvm-svn: 208201
*	[ARM64-BE] Fix fast-isel, and add appropriate RUN lines to appropriate tests.	James Molloy	2014-05-07	1	-0/+5
\| \| \| \|	llvm-svn: 208200
*	[ARM64-BE] Fix variable-argument saving.	James Molloy	2014-05-07	1	-1/+2
\| \| \| \|	llvm-svn: 208199
*	[ARM64-BE] Implement the lane-twiddling logic at AAPCS boundaries for big ↵	James Molloy	2014-05-07	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	endian. The AAPCS states that values passed in registers must have a value as though they had been loaded with "LDR". LDR is equivalent to "LD1.64 vX.1D" - that is, loading scalars to vector registers and loading 1-element vectors is equivalent. The logic implemented here is to ensure that at all call boundaries and during formal argument lowering all vectors are treated as their bitwidth-based floating point scalar counterpart, which is always one of f64 or f128 (v2i32 -> f64, v4i32 -> f128 etc). A BITCAST is inserted so that the appropriate REV will be generated during code generation. llvm-svn: 208198
*	[mips] Move IsFP64bit/NotFP64bit to the front of the AdditionalPredicates list	Daniel Sanders	2014-05-07	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This makes it easier to prove a more complicated change in the next commit is non-functional. Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3639 llvm-svn: 208197
*	[ARM64-BE] Implement the crazy bitcast handling for big endian vectors.	James Molloy	2014-05-07	1	-46/+326
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Because we've canonicalised on using LD1/ST1, every time we do a bitcast between vector types we must do an equivalent lane reversal. Consider a simple memory load followed by a bitconvert then a store. v0 = load v2i32 v1 = BITCAST v2i32 v0 to v4i16 store v4i16 v2 In big endian mode every memory access has an implicit byte swap. LDR and STR do a 64-bit byte swap, whereas LD1/ST1 do a byte swap per lane - that is, they treat the vector as a sequence of elements to be byte-swapped. The two pairs of instructions are fundamentally incompatible. We've decided to use LD1/ST1 only to simplify compiler implementation. LD1/ST1 perform the equivalent of a sequence of LDR/STR + REV. This makes the original code sequence: v0 = load v2i32 v1 = REV v2i32 (implicit) v2 = BITCAST v2i32 v1 to v4i16 v3 = REV v4i16 v2 (implicit) store v4i16 v3 But this is now broken - the value stored is different to the value loaded due to lane reordering. To fix this, on every BITCAST we must perform two other REVs: v0 = load v2i32 v1 = REV v2i32 (implicit) v2 = REV v2i32 v3 = BITCAST v2i32 v2 to v4i16 v4 = REV v4i16 v5 = REV v4i16 v4 (implicit) store v4i16 v5 This means an extra two instructions, but actually in most cases the two REV instructions can be combined into one. For example: (REV64_2s (REV64_4h X)) === (REV32_4h X) There is also no 128-bit REV instruction. This must be synthesized with an EXT instruction. Most bitconverts require some sort of conversion. The only exceptions are: a) Identity conversions - vNfX <-> vNiX b) Single-lane-to-scalar - v1fX <-> fX or v1iX <-> iX Even though there are hundreds of changed lines, I have a fairly high confidence that they are somewhat correct. The changes to add two REV instructions per bitcast were pretty mechanical, and once I'd done that I threw the resulting .td at a script I wrote which combined the two REVs together (and added an EXT instruction, for f128) based on an instruction description I gave it. This was much less prone to error than doing it all manually, plus my brain would not just have melted but would have vapourised. llvm-svn: 208194
*	[ARM64-BE] Predicate VLDR/VSTR for vectors as little-endian only. We must ↵	James Molloy	2014-05-07	1	-95/+131
\| \| \| \| \| \|	use LD1/ST1 on big-endian. llvm-svn: 208193
*	[ARM64-BE] Make big endian (scalar) argument passing work correctly.	James Molloy	2014-05-07	1	-6/+38
\| \| \| \| \| \| \| \| \| \|	This completes the port of r204814 (cpirker "AArch64_BE function argument passing for ARM ABI") from AArch64 to ARM64, and fixes a bunch of issues found during later development along the way. The biggest of these was that the alignment fixup logic wasn't replicated into all the places it should have been. llvm-svn: 208192
*	MergeFunctions Pass, introduced total ordering among values.	Stepan Dyatkovskiy	2014-05-07	1	-41/+96
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a third patch of patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). This patch description: Being comparing functions we need to compare values we meet at left and right sides. Its easy to sort things out for external values. It just should be the same value at left and right. But for local values (those were introduced inside function body) we have to ensure they were introduced at exactly the same place, and plays the same role. In short, patch introduces values serial numbering and comparison routine. The last one compares two values by their serial numbers. llvm-svn: 208189
*	[mips] Split Instruction.Predicates into smaller lists and re-join them with ↵	Daniel Sanders	2014-05-07	7	-77/+98
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	!listconcat Summary: The overall idea is to chop the Predicates list into subsets that are usually overridden independently. This allows subclasses to partially override the predicates of their superclasses without having to re-add all the existing predicates. This patch starts the process by moving HasStdEnc into a new EncodingPredicates list and almost everything else into AdditionalPredicates. It has revealed a couple likely bugs where 'let Predicates' has removed the HasStdEnc predicate. No functional change (confirmed by diffing tablegen-erated files). Depends on D3549, D3506 Reviewers: vmedic Differential Revision: http://reviews.llvm.org/D3550 llvm-svn: 208184
*	[tablegen] Add !listconcat operator with the similar semantics as !strconcat	Daniel Sanders	2014-05-07	4	-2/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: It concatenates two or more lists. In addition to the !strconcat semantics the lists must have the same element type. My overall aim is to make it easy to append to Instruction.Predicates rather than override it. This can be done by concatenating lists passed as arguments, or by concatenating lists passed in additional fields. Reviewers: dsanders Reviewed By: dsanders Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D3506 llvm-svn: 208183
*	[mips] Move HasStdEnc to the front of the predicates lists.	Daniel Sanders	2014-05-07	5	-61/+61
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: This will make it easier to prove that a more complicated change in the following commit is non-functional. No functional change. Depends on D3506 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3549 llvm-svn: 208179
*	[BUG][REFACTOR]	Zinovy Nis	2014-05-07	3	-43/+46
\| \| \| \| \| \| \| \| \|	1) Fix for printing debug locations for absolute paths. 2) Location printing is moved into public method DebugLoc::print() to avoid re-inventing the wheel. Differential Revision: http://reviews.llvm.org/D3513 llvm-svn: 208177
*	Second patch of patch series that improves MergeFunctions performance time ↵	Stepan Dyatkovskiy	2014-05-07	1	-4/+278
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	from O(NN) to O(Nlog(N)). The idea is to introduce total ordering among functions set. It allows to build binary tree and perform function look-up procedure in O(log(N)) time. This patch description: Introduced total ordering among constants implemented in cmpConstants method. Method performs lexicographical comparison between constants represented as hypothetical numbers of next format: <bitcastability-trait><raw-bit-contents> Please, read cmpConstants declaration comments for more details. llvm-svn: 208173
*	Work-around MSVS build breakage due to r208148	Timur Iskhodzhanov	2014-05-07	1	-2/+4
\| \| \| \|	llvm-svn: 208171
*	[asan] Add a flag to control asm instrumentation.	Evgeniy Stepanov	2014-05-07	1	-1/+8
\| \| \| \| \| \|	With this change, asm instrumentation is disabled by default. llvm-svn: 208167
*	Allow using normal .eh_frame based unwinding on ARM. Use the same	Joerg Sonnenberger	2014-05-07	3	-1/+17
\| \| \| \| \| \|	encodings as x86. Use this exception model for NetBSD. llvm-svn: 208166
*	PR19562: DebugInfo temporary MDNode leak: Don't include a temporary node to ↵	David Blaikie	2014-05-07	1	-2/+1
\| \| \| \| \| \| \| \| \| \|	replace with a variable list for methods, since they're always declarations and thus never include variables This field is used for a list of variables to ensure they are not lost during optimization (they're only included when optimizations are enabled). llvm-svn: 208159
*	[C++11] Add NArySCEV->Operands iterator range	Tobias Grosser	2014-05-07	1	-8/+6
\| \| \| \|	llvm-svn: 208158
*	ARM: mark additional instructions as MachineFrameSetup	Saleem Abdulrasool	2014-05-07	1	-5/+10
\| \| \| \| \| \| \| \|	Mark up additional instructions which are part of the function prologue as MachineFrameSetup. These instructions are part of the function prologue, emitted by the PEI pass to setup the stack for use in the activating frame. llvm-svn: 208153
*	ARM: fix WoA PEI instruction selection	Saleem Abdulrasool	2014-05-07	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \|	The ARM::BLX instruction is an ARM mode instruction. The Windows on ARM target is limited to Thumb instructions. Correctly use the thumb mode tBLXr instruction. This would manifest as an errant write into the object file as the instruction is 4-bytes in length rather than 2. The result would be a corrupted object file that would eventually result in an executable that would crash at runtime. llvm-svn: 208152
*	llvm-cov: Handle missing source files as GCOV does	Justin Bogner	2014-05-07	1	-13/+29
\| \| \| \| \| \| \| \| \| \| \|	If the source files referenced by a gcno file are missing, gcov outputs a coverage file where every line is simply /EOF/. This also occurs for lines in the coverage that are past the end of a file that is found. This change mimics gcov. llvm-svn: 208149
*	llvm-cov: Implement --no-output	Justin Bogner	2014-05-07	1	-15/+43
\| \| \| \| \| \| \| \|	In gcov, there's a -n/--no-output option, which disables the writing of any .gcov files, so that it emits only the summary info on stdout. This implements the same behaviour in llvm-cov. llvm-svn: 208148
*	[Support/MemoryBuffer] Remove the assertion that the file size did not shrink.	Argyrios Kyrtzidis	2014-05-06	1	-3/+0
\| \| \| \| \| \|	This can happen in practice with the user changing files and we can recover from it. llvm-svn: 208143
*	Fix ASan init function detection after clang r208128.	Nico Weber	2014-05-06	1	-3/+24
\| \| \| \|	llvm-svn: 208141
*	Special case aliases in GlobalValue::getSection.	Rafael Espindola	2014-05-06	2	-1/+6
\| \| \| \| \| \| \| \|	This is similar to the getAlignment patch, but is done just for completeness. It looks like we never call getSection on an alias. All the tests still pass if the if is replaced with an assert. llvm-svn: 208139
*	Update an embarassing out-of-date comment.	Andrew Trick	2014-05-06	1	-5/+6
\| \| \| \|	llvm-svn: 208137
*	Use a range based for loop for the SubtargetFeatures print function.	Eric Christopher	2014-05-06	1	-2/+2
\| \| \| \|	llvm-svn: 208132
*	Revert "Try simplifying LexicalScopes ownership again."	David Blaikie	2014-05-06	1	-32/+28
\| \| \| \| \| \| \| \| \|	Speculatively reverting due to a suspicious failure on a Windows buildbot. This reverts commit 10c37a012ea11596d44cd9059fe09c959caf30c8. llvm-svn: 208131
*	Fix odd formatting that snuck into last patch.	Eric Christopher	2014-05-06	1	-3/+3
\| \| \| \|	llvm-svn: 208130