diff options
| author | Alexey Bataev <a.bataev@hotmail.com> | 2019-01-07 21:30:43 +0000 |
|---|---|---|
| committer | Alexey Bataev <a.bataev@hotmail.com> | 2019-01-07 21:30:43 +0000 |
| commit | 7bb3353f6ab94326a79e2e5bdb8f77675666ba84 (patch) | |
| tree | 586477c94cf6e77e108bf548dbd58f8669e724d5 /clang/test/OpenMP/teams_distribute_parallel_for_codegen.cpp | |
| parent | 347b0804bce25a3b5deb28d4e43eb1a68796325b (diff) | |
| download | bcm5719-llvm-7bb3353f6ab94326a79e2e5bdb8f77675666ba84.tar.gz bcm5719-llvm-7bb3353f6ab94326a79e2e5bdb8f77675666ba84.zip | |
[OPENMP]Add call to __kmpc_push_target_tripcount() function.
Each we create the target regions with the teams distribute inner
region, we can better estimate number of the teams required to execute
the target region. Function __kmpc_push_target_tripcount() is used for
purpose, which accepts device_id and the number of the iterations,
performed by the associated loop.
llvm-svn: 350571
Diffstat (limited to 'clang/test/OpenMP/teams_distribute_parallel_for_codegen.cpp')
| -rw-r--r-- | clang/test/OpenMP/teams_distribute_parallel_for_codegen.cpp | 4 |
1 files changed, 4 insertions, 0 deletions
diff --git a/clang/test/OpenMP/teams_distribute_parallel_for_codegen.cpp b/clang/test/OpenMP/teams_distribute_parallel_for_codegen.cpp index 865ea129f97..fa425b3ecab 100644 --- a/clang/test/OpenMP/teams_distribute_parallel_for_codegen.cpp +++ b/clang/test/OpenMP/teams_distribute_parallel_for_codegen.cpp @@ -28,8 +28,12 @@ int teams_argument_global(int n){ // CK1: alloca i32, // CK1: [[TE:%.+]] = alloca i32, // CK1: [[TH:%.+]] = alloca i32, + // CK1: alloca i32, + // CK1: alloca i32, + // CK1: alloca i32, // CK1: [[TE_CAST:%.+]] = alloca i{{32|64}}, // CK1: [[TH_CAST:%.+]] = alloca i{{32|64}}, + // CK1: call void @__kmpc_push_target_tripcount(i64 -1, i64 %{{.+}}) // CK1: [[TE_PAR:%.+]] = load{{.+}}, {{.+}} [[TE_CAST]], // CK1: [[TH_PAR:%.+]] = load{{.+}}, {{.+}} [[TH_CAST]], // CK1: call i32 @__tgt_target_teams(i64 -1, i8* @{{[^,]+}}, i32 4, i8** %{{[^,]+}}, i8** %{{[^,]+}}, i{{64|32}}* {{.+}}@{{[^,]+}}, i32 0, i32 0), i64* {{.+}}@{{[^,]+}}, i32 0, i32 0), i32 {{.+}}, i32 {{.+}}) |

