bcm5719-llvm - Project Ortega BCM5719 LLVM

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fixing memory leak	Andrew Kaylor	2015-05-12	1	-0/+2
\| \| \| \|	llvm-svn: 237072
*	Refactoring gc_relocate related code in CodeGenPrepare.cpp	Sanjoy Das	2015-05-11	1	-7/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: The original code inserted new instructions by following a Create->Remove->ReInsert flow. This patch removes the unnecessary Remove->ReInsert part by setting up the InsertPoint correctly at the very beginning. This change does not introduce any functionality change. Patch by Chen Li! Reviewers: reames, AndyAyers, sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9687 llvm-svn: 237070
*	Rename variables in gc_relocate related functions to follow LLVM's naming ↵	Sanjoy Das	2015-05-11	2	-47/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	conventions. Summary: This patch is to rename some variables to CamelCase in gc_relocate related functions. There is no functionality change. Patch by Chen Li! Reviewers: reames, AndyAyers, sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9681 llvm-svn: 237069
*	[lib/Fuzzer] don't record traces when trace collection is off	Kostya Serebryany	2015-05-11	1	-1/+2
\| \| \| \|	llvm-svn: 237067
*	[MemCpyOpt] Look at any dependency -not just source- for memset+memcpy.	Ahmed Bougacha	2015-05-11	1	-6/+7
\| \| \| \| \| \| \| \| \| \|	This fixes another miscompile introduced by r235232: when there was a dependency on the memcpy destination other than the memset, we would ignore it, because we only looked at the source dependency. It was a mistake to use SrcDepInfo. Instead, just use DepInfo. llvm-svn: 237066
*	Simplify a return expression and an access to an alloca's allocated type	David Blaikie	2015-05-11	1	-3/+3
\| \| \| \|	llvm-svn: 237065
*	[WinEH] Handle nested landing pads that return directly to the parent function.	Andrew Kaylor	2015-05-11	1	-10/+104
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D9684 llvm-svn: 237063
*	Readdress r236990, use of static members on a non-static variable.	David Blaikie	2015-05-11	3	-46/+35
\| \| \| \| \| \| \| \| \| \| \| \|	The TargetRegistry is just a namespace-like class, instantiated in one place to use a range-based for loop. Instead, expose access to the registry via a range-based 'targets()' function instead. This makes most uses a bit awkward/more verbose - but eventually we should just add a range-based find_if function which will streamline these functions. I'm happy to mkae them a bit awkward in the interim as encouragement to improve the algorithms in time. llvm-svn: 237059
*	Fix tablegen's PrintFatalError function to run registered file	James Y Knight	2015-05-11	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	cleanups. Also, change code in tablegen which printed a message and then called "exit(1)" to use PrintFatalError, instead. This fixes instances where an empty output file was left behind after a failed tablegen invocation, which would confuse subsequent ninja runs into not attempting to rebuild. Differential Revision: http://reviews.llvm.org/D9608 llvm-svn: 237058
*	[lib/Fuzzer] when running multiple fuzzing processes, print something every ↵	Kostya Serebryany	2015-05-11	1	-2/+14
\| \| \| \| \| \|	10 minutes to avoid buildbot timeouts llvm-svn: 237054
*	[lib/Fuzzer] rename FuzzerDFSan.cpp to FuzzerTraceState.cpp; update ↵	Kostya Serebryany	2015-05-11	4	-44/+54
\| \| \| \| \| \|	comments. NFC expected llvm-svn: 237050
*	propagate IR-level fast-math-flags to DAG nodes; 2nd try; NFC	Sanjay Patel	2015-05-11	4	-52/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a less ambitious version of: http://reviews.llvm.org/rL236546 because that was reverted in: http://reviews.llvm.org/rL236600 because it caused memory corruption that wasn't related to FMF but was actually due to making nodes with 2 operands derive from a plain SDNode rather than a BinarySDNode. This patch adds the minimum plumbing necessary to use IR-level fast-math-flags (FMF) in the backend without actually using them for anything yet. This is a follow-on to: http://reviews.llvm.org/rL235997 ...which split the existing nsw / nuw / exact flags and FMF into their own struct. llvm-svn: 237046
*	[LoopIdiomRecognize] Transform backedge-taken count check into an assertion.	Davide Italiano	2015-05-11	1	-1/+3
\| \| \| \| \| \| \| \| \|	runOnCountable() allowed the caller to call on a loop without a predictable backedge-taken count. Change the code so that only loops with computable backdge-count can call this function, in order to catch abuses. llvm-svn: 237044
*	[lib/Fuzzer] add a trace-based mutatation logic. Same idea as with ↵	Kostya Serebryany	2015-05-11	5	-12/+68
\| \| \| \| \| \|	DFSan-based mutator, but instead of relying on taint tracking, try to find the data directly in the input. More (logic and comments) to go. llvm-svn: 237043
*	Fixing build warnings	Andrew Kaylor	2015-05-11	1	-2/+2
\| \| \| \|	llvm-svn: 237042
*	[WinEH] Update exception numbering to give handlers their own base state.	Andrew Kaylor	2015-05-11	2	-16/+87
\| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D9512 llvm-svn: 237014
*	[RewriteStatepointsForGC] Fix a bug on creating gc_relocate for pointer to ↵	Sanjoy Das	2015-05-11	4	-14/+66
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vector of pointers Summary: In RewriteStatepointsForGC pass, we create a gc_relocate intrinsic for each relocated pointer, and the gc_relocate has the same type with the pointer. During the creation of gc_relocate intrinsic, llvm requires to mangle its type. However, llvm does not support mangling of all possible types. RewriteStatepointsForGC will hit an assertion failure when it tries to create a gc_relocate for pointer to vector of pointers because mangling for vector of pointers is not supported. This patch changes the way RewriteStatepointsForGC pass creates gc_relocate. For each relocated pointer, we erase the type of pointers and create an unified gc_relocate of type i8 addrspace(1)*. Then a bitcast is inserted to convert the gc_relocate to the correct type. In this way, gc_relocate does not need to deal with different types of pointers and the unsupported type mangling is no longer a problem. This change would also ease further merge when LLVM erases types of pointers and introduces an unified pointer type. Some minor changes are also introduced to gc_relocate related part in InstCombineCalls, CodeGenPrepare, and Verifier accordingly. Patch by Chen Li! Reviewers: reames, AndyAyers, sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9592 llvm-svn: 237009
*	LiveRangeCalc: Improve error messages on malformed IR	Matthias Braun	2015-05-11	1	-1/+7
\| \| \| \|	llvm-svn: 237008
*	[X86] Updates to X86 backend for f16 promotion	Pirama Arumuga Nainar	2015-05-11	1	-0/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: r235215 adds support for f16 to be considered as a load/store type and promote f16 operations to f32. This patch has miscellaneous fixes for the X86 backend so all f16 operations are handled: 1. Set loadextaction for f16 vectors to expand. 2. Handle FP_EXTEND in a switch statement when handling v2f32 3. Do not fold (FP_TO_SINT (load f16)) into FP_TO_INT*_IN_MEM or (store (SINT_TO_FP )) to a FILD. Tests included. Reviewers: ab, srhines, delena Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9092 llvm-svn: 237004
*	Rip min/max pattern matching out of InstCombine and into	James Molloy	2015-05-11	3	-99/+85
\| \| \| \| \| \| \| \| \| \| \|	ValueTracking. This matching functionality is useful in more than just InstCombine, so make it available in ValueTracking. NFC. llvm-svn: 236998
*	Silencing an MSVC warning: '<<' : result of 32-bit shift implicitly ↵	Aaron Ballman	2015-05-11	1	-1/+1
\| \| \| \| \| \|	converted to 64 bits (was 64-bit shift intended?); NFC llvm-svn: 236987
*	AVX-512: Changed CC parameter in "cmp" intrinsic	Elena Demikhovsky	2015-05-11	1	-89/+0
\| \| \| \| \| \| \| \|	from i8 to i32 according to the Intel Spec by Igor Breger (igor.breger@intel.com) llvm-svn: 236979
*	[InstCombine/PowerPC] Fix single-precision QPX load/store replacement	Hal Finkel	2015-05-11	1	-5/+10
\| \| \| \| \| \| \| \| \| \|	The QPX single-precision load/store intrinsics have implied truncation/extension from/to the declared value type of <4 x double> to the memory type of <4 x float>. When we can prove the alignment of the pointer argument, and thus replace the intrinsic with a regular load or store, we need to load or store the correct data type (<4 x float>) instead of (<4 x double>). llvm-svn: 236973
*	Fixed compilation warning, NFC.	Elena Demikhovsky	2015-05-11	1	-0/+2
\| \| \| \|	llvm-svn: 236972
*	AVX-512: Added SKX instructions and intrinsics:	Elena Demikhovsky	2015-05-11	4	-70/+101
\| \| \| \| \| \| \| \|	{add/sub/mul/div/} x {ps/pd} x {128/256} 2. max/min with sae By Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 236971
*	[InstCombine] Canonicalize single element array store	David Majnemer	2015-05-11	1	-0/+9
\| \| \| \| \| \| \| \|	Use the element type instead of the aggregate type. Differential Revision: http://reviews.llvm.org/D9591 llvm-svn: 236969
*	[InstCombine] Canonicalize single element array load	David Majnemer	2015-05-11	1	-0/+10
\| \| \| \| \| \| \| \|	Use the element type instead of the aggregate type. Differential Revision: http://reviews.llvm.org/D9596 llvm-svn: 236968
*	AVX-512: fixed UINT_TO_FP operation for 512-bit types.	Elena Demikhovsky	2015-05-10	1	-0/+7
\| \| \| \|	llvm-svn: 236955
*	[SelectionDAG] Fixed constant folding issue when legalised types are smaller ↵	Simon Pilgrim	2015-05-10	1	-2/+3
\| \| \| \| \| \| \| \|	then the folded type. Found when testing with llvm-stress on i686 targets. llvm-svn: 236954
*	SanitizerCoverage: Use `createSanitizerCtor` to create ctor and call init	Ismail Pazarbasi	2015-05-10	1	-20/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Second attempt; instead of using a named local variable, passing arguments directly to `createSanitizerCtorAndInitFunctions` worked on Windows. Reviewers: kcc, samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8780 llvm-svn: 236951
*	AVX-512: fixed a bug in i1 vectors lowering	Elena Demikhovsky	2015-05-10	2	-2/+12
\| \| \| \|	llvm-svn: 236947
*	SystemZ: silence a GCC warning	Saleem Abdulrasool	2015-05-10	1	-2/+2
\| \| \| \| \| \| \| \|	warning: enumeral and non-enumeral type in conditional expression Cast the 0 to the appropriate type. NFC. Identified by GCC 4.9.2 llvm-svn: 236942
*	Fix MergeConsecutiveStore for non-byte-sized memory accesses.	James Y Knight	2015-05-09	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The bug showed up as a compile-time assertion failure: Assertion `NumBits >= MIN_INT_BITS && "bitwidth too small"' failed when building msan tests on x86-64. Prior to r236850, this bug was masked due to a bogus alignment check, which also accidentally rejected non-byte-sized accesses. Afterwards, an invalid ElementSizeBytes == 0 got further into the function, and triggered the assertion failure. It would probably be a good idea to allow it to handle merging stores of unusual widths as well, but for now, to un-break it, I'm just making the minimal fix. Differential Revision: http://reviews.llvm.org/D9626 llvm-svn: 236927
*	MachineCSE: Add a target query for the LookAheadLimit heurisitic	Tom Stellard	2015-05-09	2	-2/+5
\| \| \| \| \| \| \| \| \|	This is used to determine whether or not to CSE physical register defs. Differential Revision: http://reviews.llvm.org/D9472 llvm-svn: 236923
*	[Fast-ISel] Don't mark the first use of a remat constant as killed.	Pete Cooper	2015-05-09	1	-4/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When emitting something like 'add x, 1000' if we remat the 1000 then we should be able to mark the vreg containing 1000 as killed. Given that we go bottom up in fast-isel, a later use of 1000 will be higher up in the BB and won't kill it, or be impacted by the lower kill. However, rematerialised constant expressions aren't generated bottom up. The local value save area grows downwards. This means that if you remat 2 constant expressions which both use 1000 then the first will kill it, then the second, which is lower in the BB will read a killed register. This is the case in the attached test where the 2 GEPs both need to generate 'add x, 6680' for the constant offset. Note that this commit only makes kill flag generation conservative. There's nothing else obviously wrong with the local value save area growing downwards, and in fact it needs to for handling arbitrarily complex constant expressions. However, it would be nice if there was a solution which would let us generate more accurate kill flags, or just kill flags completely. llvm-svn: 236922
*	Fix compile error	Arnold Schwaighofer	2015-05-09	1	-1/+1
\| \| \| \|	llvm-svn: 236921
*	Revert r236912.	Quentin Colombet	2015-05-09	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Author: dblaikie Date: Fri May 8 17:47:50 2015 New Revision: 236912 URL: http://llvm.org/viewvc/llvm-project?rev=236912&view=rev Log: [opaque pointer type] Cleanup a few references to pointee types using nearby non-pointee types of the same value & cleanup a convoluted return expression while I'm here llvm-svn: 236919
*	[Target/ARM] Remove unused 'private' from class.	Davide Italiano	2015-05-08	1	-2/+0
\| \| \| \| \| \| \|	Differential Revision: http://reviews.llvm.org/D9611 Reviewed by: rengolin llvm-svn: 236918
*	ScheduleDAGInstrs: In functions with tail calls PseudoSourceValues are not ↵	Arnold Schwaighofer	2015-05-08	6	-3/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	non-aliasing distinct objects The code that builds the dependence graph assumes that two PseudoSourceValues don't alias. In a tail calling function two FixedStackObjects might refer to the same location. Worse 'immutable' fixed stack objects like function arguments are not immutable and will be clobbered. Change this so that a load from a FixedStackObject is not invariant in a tail calling function and don't return a PseudoSourceValue for an instruction in tail calling functions when building the dependence graph so that we handle function arguments conservatively. Fix for PR23459. rdar://20740035 llvm-svn: 236916
*	[opaque pointer type] Cleanup a few references to pointee types using nearby ↵	David Blaikie	2015-05-08	1	-4/+4
\| \| \| \| \| \| \| \|	non-pointee types of the same value & cleanup a convoluted return expression while I'm here llvm-svn: 236912
*	[lib/Fuzzer] build tests that work well with dfsan also w/o dfsan	Kostya Serebryany	2015-05-08	5	-10/+12
\| \| \| \|	llvm-svn: 236909
*	[lib/Fuzzer] use -fsanitize-coverage=trace-cmp when building LLVM with ↵	Kostya Serebryany	2015-05-08	6	-10/+68
\| \| \| \| \| \|	LLVM_USE_SANITIZE_COVERAGE; in lib/Fuzzer try to reload the corpus to pick up new units from other processes llvm-svn: 236906
*	Switch lowering: cluster adjacent fall-through cases even at -O0	Hans Wennborg	2015-05-08	1	-3/+5
\| \| \| \| \| \| \|	It's cheap to do, and codegen is much faster if cases can be merged into clusters. llvm-svn: 236905
*	TargetParser: FPU/ARCH/EXT parsing refactory - NFC	Renato Golin	2015-05-08	12	-294/+220
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This new class in a global context contain arch-specific knowledge in order to provide LLVM libraries, tools and projects with the ability to understand the architectures. For now, only FPU, ARCH and ARCH extensions on ARM are supported. Current behaviour it to parse from free-text to enum values and back, so that all users can share the same parser and codes. This simplifies a lot both the ASM/Obj streamers in the back-end (where this came from), and the front-end parsers for command line arguments (where this is going to be used next). The previous implementation, using .def/.h includes is deprecated due to its inflexibility to be built without the backend support and for being too cumbersome. As more architectures join this scheme, and as more features of such architectures are added (such as hardware features, type sizes, etc) into a full blown TargetDescription class, having a set of classes is the most sane implementation. The ultimate goal of this refactor both LLVM's and Clang's target description classes into one unique interface, so that we can de-duplicate and standardise the descriptions, as well as make it available for other front-ends, tools, etc. The FPU parsing for command line options in Clang has been converted to use this new library and a number of aliases were added for compatibility: * A bogus neon-vfpv3 alias (neon defaults to vfp3) * armv5/v6 * {fp4/fp5}-{sp/dp}-d16 Next steps: * Port Clang's ARCH/EXT parsing to use this library. * Create a TableGen back-end to generate this information. * Run this TableGen process regardless of which back-ends are built. * Expose more information and rename it to TargetDescription. * Continue re-factoring Clang to use as much of it as possible. llvm-svn: 236900
*	[Fast-ISel] Clear kill flags on registers replaced by updateValueMap.	Pete Cooper	2015-05-08	1	-0/+7
\| \| \| \| \| \| \| \| \| \|	When selecting an extract instruction, we don't actually generate code but instead work out which register we are reading, and rewrite uses of the extract def to the source register. This is done via updateValueMap,. However, its possible that the source register we are rewriting to to also have uses. If those uses are after a kill of the value we are rewriting from then we have uses after a kill and the verifier fails. This code checks for the case where the to register is also used, and if so it clears all kill on the from register. This is conservative, but better that always clearing kills on the from register. llvm-svn: 236897
*	[Hexagon] Generate more hardware loops	Brendon Cahoon	2015-05-08	1	-133/+206
\| \| \| \| \| \| \| \| \|	Refactored parts of the hardware loop pass to generate more. Also, added more tests. Differential Revision: http://reviews.llvm.org/D9568 llvm-svn: 236896
*	[BasicAA] Fix zext & sext handling	Sanjoy Das	2015-05-08	1	-60/+199
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Summary: There are several unhandled edge cases in BasicAA's GetLinearExpression method. This changes fixes outstanding issues, including zext / sext of a constant with the sign bit set, and the refusal to decompose zexts or sexts of wrapping arithmetic. Test Plan: Unit tests added in //q.ext.ll//. Patch by Nick White. Reviewers: hfinkel, sanjoy Reviewed By: hfinkel, sanjoy Subscribers: sanjoy, llvm-commits, hfinkel Differential Revision: http://reviews.llvm.org/D6682 llvm-svn: 236894
*	Replace branch-to-unreachable with assertion.	David Blaikie	2015-05-08	1	-4/+2
\| \| \| \|	llvm-svn: 236893
*	[X86] Fast-ISel was incorrectly always killing the source of a truncate.	Pete Cooper	2015-05-08	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \|	A trunc from i32 to i1 on x86_64 generates an instruction such as %vreg19<def> = COPY %vreg9:sub_8bit<kill>; GR8:%vreg19 GR32:%vreg9 However, the copy here should only have the kill flag on the 32-bit path, not the 64-bit one. Otherwise, we are killing the source of the truncate which could be used later in the program. llvm-svn: 236890
*	Extend the statepoint intrinsic to allow statepoints to be marked as ↵	Pat Gavlin	2015-05-08	6	-19/+177
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	transitions from GC-aware code to code that is not GC-aware. This changes the shape of the statepoint intrinsic from: @llvm.experimental.gc.statepoint(anyptr target, i32 # call args, i32 unused, ...call args, i32 # deopt args, ...deopt args, ...gc args) to: @llvm.experimental.gc.statepoint(anyptr target, i32 # call args, i32 flags, ...call args, i32 # transition args, ...transition args, i32 # deopt args, ...deopt args, ...gc args) This extension offers the backend the opportunity to insert (somewhat) arbitrary code to manage the transition from GC-aware code to code that is not GC-aware and back. In order to support the injection of transition code, this extension wraps the STATEPOINT ISD node generated by the usual lowering lowering with two additional nodes: GC_TRANSITION_START and GC_TRANSITION_END. The transition arguments that were passed passed to the intrinsic (if any) are lowered and provided as operands to these nodes and may be used by the backend during code generation. Eventually, the lowering of the GC_TRANSITION_{START,END} nodes should be informed by the GC strategy in use for the function containing the intrinsic call; for now, these nodes are instead replaced with no-ops. Differential Revision: http://reviews.llvm.org/D9501 llvm-svn: 236888