[go: nahoru, domu]

Skip to content

Commit

Permalink
Fix usage of cl_intel_required_subgroup_size for non-intel driver
Browse files Browse the repository at this point in the history
PiperOrigin-RevId: 640434035
  • Loading branch information
tensorflower-gardener committed Jun 5, 2024
1 parent e6971ed commit 025efe7
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tensorflow/lite/delegates/gpu/common/tasks/conv_generic.cc
Original file line number Diff line number Diff line change
Expand Up @@ -488,7 +488,7 @@ std::string ConvGeneric::GenerateConv(const GpuInfo& gpu_info,
std::to_string(work_group_size_.y) + ", " +
std::to_string(work_group_size_.z) + ")))\n";
}
if (use_simd_broadcast && gpu_info.IsIntel() && gpu_info.IsApiOpenCl() &&
if (use_simd_broadcast && gpu_info.IsApiOpenCl() &&
gpu_info.SupportsExtension("cl_intel_required_subgroup_size")) {
c += "__attribute__((intel_reqd_sub_group_size(" +
std::to_string(simd_size) + ")))\n";
Expand Down

0 comments on commit 025efe7

Please sign in to comment.