OpenMP/OpenACC support for tensorflow #12434

abidmalikwaterloo · 2017-08-20T20:04:07Z

Is there any effort in porting the Tensorflow using OpenMP or OpenACC?

yaroslavvb · 2017-08-22T22:30:04Z

@andydavis1 and @benoitsteiner looked at it a while back. There were two issues

TensorFlow already has its own set of primitives for parallelizing work-loads
It was hard to make it build: compiling with -fopenmp made some seemingly unrelated parts of TensorFlow work incorrectly

ali01 · 2017-08-28T04:18:57Z

Thank you, Yaroslav. Closing for now.

masoodmortazavi · 2017-09-15T20:56:02Z

In the exploration of this issue, was attention paid to using Eigen in multi-threaded applications?

Please see the note on calling Eigen::initParallel() before creating application threads, in the following:
https://eigen.tuxfamily.org/dox/TopicMultiThreading.html

alexkreidler · 2017-10-27T01:03:23Z

@yaroslavvb Could OpenACC be used to optimize parallel tasks like matmuls and convolutions (IDK if you use a library) on a wider range of accelerators (like AMD GPUs), that the CUDA kernels and CPU-oriented code are not targeting? I think that this could complement the existing Tensorflow primitives for parallelization.

…d test. Imported from GitHub PR openxla/xla#12434 Copybara import of the project: -- 723c9bb29adfcc33c015b74f90ce8024c2f79255 by Ilia Sergachev <isergachev@nvidia.com>: [GPU] Fix OSS compilation problems in a previously disabled test. Merging this change closes #12434 FUTURE_COPYBARA_INTEGRATE_REVIEW=openxla/xla#12434 from openxla:fix_triton_test 723c9bb29adfcc33c015b74f90ce8024c2f79255 PiperOrigin-RevId: 633509548

…d test. Imported from GitHub PR openxla/xla#12434 Copybara import of the project: -- 723c9bb29adfcc33c015b74f90ce8024c2f79255 by Ilia Sergachev <isergachev@nvidia.com>: [GPU] Fix OSS compilation problems in a previously disabled test. Merging this change closes #12434 PiperOrigin-RevId: 633533757

ali01 closed this as completed Aug 28, 2017

ali01 added type:feature Feature requests and removed type:feature Feature requests labels Aug 28, 2017

copybara-service bot mentioned this issue May 14, 2024

PR #12434: [GPU] Fix OSS compilation problems in a previously disabled test. #67543

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenMP/OpenACC support for tensorflow #12434

OpenMP/OpenACC support for tensorflow #12434

OpenMP/OpenACC support for tensorflow #12434

OpenMP/OpenACC support for tensorflow #12434

Comments