bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	WebAssembly: update expected failures	JF Bastien	2016-01-31	1	-16/+0
\| \| \| \| \| \|	r259305 fixed a few assertions around FrameIndex, and I forgot to update these failures despite having run the torture tests. llvm-svn: 259320
*	[dsymutil] Fix FileCheck command.	Frederic Riss	2016-01-31	1	-1/+1
\| \| \| \| \| \|	Damn case-insensitive filesystem... llvm-svn: 259319
*	[dsymutil] Fix handling of common symbols.	Frederic Riss	2016-01-31	9	-16/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	llvm-dsymutil was misinterpreting the value of common symbols as their address when it actually contains their size. This didn't impact llvm-dsymutil's ability to link the debug information for common symbols because these are always found by name and not by address. Things could however go wrong when the size of a common object matched the object file address of another symbol. Depending on the link order of the symbols the common object might incorrectly evict this other object from the address to symbol mapping, and then link the evicted symbol with a wrong binary address. Use the new ability to have symbols without an object file address to fix this. llvm-svn: 259318
*	[dsymutil] Allow debug map mappings with no object file address. NFC	Frederic Riss	2016-01-31	4	-21/+31
\| \| \| \| \| \| \| \| \| \| \|	This change just changes the data structure that ties symbol names, object file address and linked binary addresses to accept mappings with no object file address. Such symbol mappings are not fed into the debug map yet, so this patch is NFC. A subsequent patch will make use of this functionality for common symbols. llvm-svn: 259317
*	[SelectionDAG] Eliminate exponential behavior in WalkChainUsers	Tim Shen	2016-01-31	1	-5/+20
\| \| \| \|	llvm-svn: 259315
*	No need to use utostr/utohexstr when writing into a raw_ostream. NFC	Craig Topper	2016-01-31	1	-24/+30
\| \| \| \|	llvm-svn: 259314
*	Shrink character buffer size in raw_ostream::write_hex to 16 characters ↵	Craig Topper	2016-01-31	1	-1/+1
\| \| \| \| \| \|	intead of 20 as that's the largest string a 64-bit hex value can be. llvm-svn: 259313
*	Use std::end instead of repeating buffer sizes.	Craig Topper	2016-01-31	2	-7/+7
\| \| \| \|	llvm-svn: 259312
*	Convert int to Twine instead of using utostr since it was already being ↵	Craig Topper	2016-01-31	1	-1/+1
\| \| \| \| \| \|	added to a Twine. NFC llvm-svn: 259308
*	[doc] improve the doc for CUDA	Jingyue Wu	2016-01-30	1	-17/+21
\| \| \| \| \| \| \| \|	1. Mentioned that CUDA support works best with trunk. 2. Simplified the example by removing its dependency on the CUDA samples. 3. Explain the --cuda-gpu-arch flag. llvm-svn: 259307
*	[WebAssembly] Fix uses of FrameIndex as store values	Derek Schuff	2016-01-30	3	-6/+17
\| \| \| \| \| \| \| \|	Previously the code assumed all uses of FI on loads and stores were as addresses. This checks whether the use is the address or a value and handles the latter case as it does for non-memory instructions. llvm-svn: 259306
*	WebAssembly: don't optimize frameindex store	JF Bastien	2016-01-30	3	-6/+24
\| \| \| \| \| \| \| \|	The previous code was incorrect (can't getReg a frameindex). We could instead optimize it to reduce tree height, but I'm not sure that's worthwhile yet because we then try to eliminate the frameindex. This patch also fixes frame index elimination for operations which may load or store: it used to assume the base was operand 2 and immediate offset operand 1. That's not true for stores, where they're 4 and 3. llvm-svn: 259305
*	WebAssembly NFC: fix build warning	JF Bastien	2016-01-30	1	-3/+3
\| \| \| \| \| \|	WebAssemblyFrameLowering.cpp:158:44: warning: enumeral and non-enumeral type in conditional expression [enabled by default] llvm-svn: 259303
*	[BasicAA] NFC - revised comment for function adjustToPointerSize()	Gerolf Hoflehner	2016-01-30	1	-1/+1
\| \| \| \|	llvm-svn: 259300
*	[BasicAA] Fix for missing must alias (D16343)	Gerolf Hoflehner	2016-01-30	2	-0/+27
\| \| \| \|	llvm-svn: 259299
*	[BasicAA] Update on r259290 - added missing cast	Gerolf Hoflehner	2016-01-30	1	-1/+1
\| \| \| \|	llvm-svn: 259298
*	AMDGPU: Fix emitting invalid workitem intrinsics for HSA	Matt Arsenault	2016-01-30	6	-37/+550
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The AMDGPUPromoteAlloca pass was emitting the read.local.size calls, which with HSA was incorrectly selected to reading from the offset mesa uses off of the kernarg pointer. Error on intrinsics which aren't supported by HSA, and start emitting the correct IR to read the workgroup size out of the dispatch pointer. Also initialize the pass so it can be tested with opt, and start moving towards not depending on the subtarget as an argument. Start emitting errors for the intrinsics not handled with HSA. llvm-svn: 259297
*	AMDGPU: Stop checking intrinsics not used by HSA for dispatch-ptr	Matt Arsenault	2016-01-30	3	-16/+175
\| \| \| \| \| \| \| \|	Only the dispatch.ptr intrinsic is supposed to be used now to get the workgroup size, and the read.local.size intrinsics do not work correctly. llvm-svn: 259296
*	InstCombine: fabs(x) * fabs(x) -> x * x	Matt Arsenault	2016-01-30	2	-4/+44
\| \| \| \|	llvm-svn: 259295
*	[WebAssembly] Refine block placement to insert blocks between trees.	Dan Gohman	2016-01-30	2	-9/+26
\| \| \| \| \| \| \| \| \|	Refine the test for whether an instruction is in an expression tree so that it detects when one tree ends and another begins, so we can place a block at that point, rather than continuing to find the first instruction not in a tree at all. llvm-svn: 259294
*	AMDGPU: Add new amdgcn workitem intrinsics	Matt Arsenault	2016-01-30	6	-87/+189
\| \| \| \| \| \| \|	These use the correct prefix and follow the HSA naming convention rather than the config register option names. llvm-svn: 259293
*	Remove references to *.h.in files and some autoconf hackery	Justin Bogner	2016-01-30	4	-33/+3
\| \| \| \| \| \|	Missed this stuff in r259291. llvm-svn: 259292
*	Remove *.h.in - these were only used by the autoconf build system	Justin Bogner	2016-01-30	3	-765/+0
\| \| \| \|	llvm-svn: 259291
*	[BasicAA] NFC - utility function for two's complement wrap-around	Gerolf Hoflehner	2016-01-30	1	-7/+15
\| \| \| \|	llvm-svn: 259290
*	Further reduce test time	Xinliang David Li	2016-01-30	1	-6/+2
\| \| \| \|	llvm-svn: 259285
*	Avoid overly large SmallPtrSet/SmallSet	Matthias Braun	2016-01-30	22	-25/+25
\| \| \| \| \| \| \|	These sets perform linear searching in small mode so it is never a good idea to use SmallSize/N bigger than 32. llvm-svn: 259283
*	Use Support/DataTypes.h instead of cstdint	Matthias Braun	2016-01-30	1	-1/+1
\| \| \| \|	llvm-svn: 259282
*	[docs] Remove references to autotools build.	Alexey Samsonov	2016-01-30	5	-352/+5
\| \| \| \|	llvm-svn: 259280
*	[CUDA] Die if we ask the NVPTX backend to emit a global ctor/dtor.	Justin Lebar	2016-01-30	4	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \|	Summary: Previously we'd just silently skip these. Reviewers: tra, jholewinski Subscribers: llvm-commits, jhen, echristo, Differential Revision: http://reviews.llvm.org/D16739 llvm-svn: 259279
*	[CodeView] Properly handle empty line tables	David Majnemer	2016-01-30	2	-7/+89
\| \| \| \| \| \| \|	Don't crash when there are no appropriate line table entries for a given function. llvm-svn: 259277
*	[Objective-C] Support a new special module flag.	Manman Ren	2016-01-29	1	-0/+1
\| \| \| \| \| \| \| \|	"Objective-C Class Properties" will be put into the objc_imageinfo struct. rdar://23891898 llvm-svn: 259270
*	[llvm-nm] Add a comment to explain why we initialize MC.	Davide Italiano	2016-01-29	1	-0/+1
\| \| \| \|	llvm-svn: 259266
*	[libFuzzer] add -timeout_exitcode option	Kostya Serebryany	2016-01-29	6	-1/+7
\| \| \| \|	llvm-svn: 259265
*	function names start with a lower case letter ; NFC	Sanjay Patel	2016-01-29	1	-25/+25
\| \| \| \|	llvm-svn: 259264
*	[libFuzzer] re-enable test for -abort_on_timeout=1, this time protecting ↵	Kostya Serebryany	2016-01-29	1	-1/+1
\| \| \| \| \| \|	from ASAN_OPTIONS set outside llvm-svn: 259263
*	fix formatting; NFC	Sanjay Patel	2016-01-29	1	-4/+8
\| \| \| \|	llvm-svn: 259262
*	Fix typo in LoopSimplifyCFG	Fiona Glaser	2016-01-29	1	-1/+1
\| \| \| \|	llvm-svn: 259261
*	[Profiling] Add a -sparse mode to llvm-profdata merge	Vedant Kumar	2016-01-29	7	-44/+125
\| \| \| \| \| \| \| \| \| \|	Add an option to llvm-profdata merge for writing out sparse indexed profiles. These profiles omit InstrProfRecords for functions which are never executed. Differential Revision: http://reviews.llvm.org/D16727 llvm-svn: 259258
*	Fix the MSVC build by moving static asserts into constructors	Reid Kleckner	2016-01-29	1	-5/+5
\| \| \| \| \| \| \|	Apparently MSVC won't allow you to ask for the sizeof() a data member at class scope. llvm-svn: 259257
*	Add LoopSimplifyCFG pass	Fiona Glaser	2016-01-29	7	-0/+160
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Loop transformations can sometimes fail because the loop, while in valid rotated LCSSA form, is not in a canonical CFG form. This is an extremely simple pass that just merges obviously redundant blocks, which can be used to fix some known failure cases. In the future, it may be enhanced with more cases (and have code shared with SimplifyCFG). This allows us to run LoopSimplifyCFG -> LoopRotate -> LoopUnroll, so that SimplifyCFG cleans up the loop before Rotate tries to run. Not currently used in the pass manager, since this pass doesn't do anything unless you can hook it up in an LPM with other loop passes. It'll be added once Chandler cleans up things to allow this. Tested in a custom pipeline out of tree to confirm it works in practice (in addition to the included trivial test). llvm-svn: 259256
*	Need #include <cstdint> for uint64_t	Matthias Braun	2016-01-29	1	-1/+2
\| \| \| \|	llvm-svn: 259255
*	Need #include <climit> for CHAR_BIT	Matthias Braun	2016-01-29	1	-0/+1
\| \| \| \|	llvm-svn: 259254
*	Improve test speed/trial 2	Xinliang David Li	2016-01-29	1	-14/+12
\| \| \| \|	llvm-svn: 259253
*	AttributeSetImpl: Summarize existing function attributes in a bitset.	Matthias Braun	2016-01-29	4	-2/+40
\| \| \| \| \| \| \| \| \| \| \|	The majority of attribute queries checks for the existence of an enum attribute in the FunctionIndex slot. We only have 48 of those and can therefore summarize them in an uint64_t bitset which measurably improves compile time. Differential Revision: http://reviews.llvm.org/D16618 llvm-svn: 259252
*	AttributeSetNode: Summarize existing attributes in a bitset.	Matthias Braun	2016-01-29	2	-12/+20
\| \| \| \| \| \| \| \| \| \| \|	The majority of queries just checks for the existince of an enum attribute. We only have 48 of those and can summaryiz them in an uint64_t bitfield so we can avoid searching the list. This improves "opt" compile time by 1-4% in my measurements. Differential Revision: http://reviews.llvm.org/D16617 llvm-svn: 259251
*	Revert 259242, 259243 -- irrelvante changes pulled in	Xinliang David Li	2016-01-29	1	-51/+13
\| \| \| \|	llvm-svn: 259244
*	Use range for loop	Xinliang David Li	2016-01-29	1	-7/+5
\| \| \| \|	llvm-svn: 259243
*	Improve test speed (interchange loop, reducing padding)	Xinliang David Li	2016-01-29	1	-13/+53
\| \| \| \|	llvm-svn: 259242
*	Annotate dump() methods with LLVM_DUMP_METHOD, addressing Richard Smith ↵	Yaron Keren	2016-01-29	51	-72/+71
\| \| \| \| \| \| \| \|	r259192 post commit comment. clang part in r259232, this is the LLVM part of the patch. llvm-svn: 259240
*	[InstCombine] avoid an insertelement transformation that induces the ↵	Sanjay Patel	2016-01-29	2	-1/+47
\| \| \| \| \| \| \| \| \| \| \|	opposite extractelement fold (PR26354) We would infinite loop because we created a shufflevector that was wider than needed and then failed to combine that with the insertelement. When subsequently visiting the extractelement from that shuffle, we see that it's unnecessary, delete it, and trigger another visit to the insertelement. llvm-svn: 259236