Tensorflow and onednn logs #61322

akote123 · 2023-07-19T09:17:40Z

Hi,
I am trying to analyse the call flow of tensorflow and onednn. I am setting up environment vars as
export ONEDNN_VERBOSE=1
export TF_CPP_MAX_VLOG_LEVEL=1
export omp_num_threads=1

I am collecting the logs. I was trying to map _mklops with onednn primitive. But here the onednn and mkl calls are random in log that is after 10 mkl calls I am seeing 20 onednn calls.

Is there any flags need to be set to get logs with correct mapping or do we need to map manually
filtered_intel_log.txt

huiyan2021 · 2023-07-20T10:31:02Z

Hi @akote123 , could you share the original verbose log, thanks!

akote123 · 2023-07-20T10:41:36Z

Hi @huiyan2021, I have uploaded here vit_intel_log.zip

huiyan2021 · 2023-07-21T03:56:19Z

I guess the reason is that _mklops are logged by <<, where the outputs are fully buffered if they are redirected to a file, while oneDNN verbose always flush stdout immediately: https://github.com/search?q=repo%3Aoneapi-src%2FoneDNN+fflush&type=code

akote123 · 2023-07-21T04:01:06Z

@huiyan2021 , one more observation what I found is some ops are common_runtime/eager/execute.cc and some are in common_runtime/executor.cc , here I am not able to understand why there is two path for execution

huiyan2021 · 2023-07-21T07:37:09Z

Hi @akote123,

from the log I can see you are using xla, so for ops can not be jitted, they come to common_runtime/eager/execute.cc, otherwise they come to common_runtime/executor.cc

also, suggest that you use trace viewer to trace executions.

akote123 · 2023-07-25T05:24:18Z

@huiyan2021 , do we have file location tensorflow source code where checking happen whether to go common_runtime/eager/execute.cc or the other one.

huiyan2021 · 2023-07-25T12:54:25Z

@akote123 , you can refer to this article: https://whatdhack.medium.com/tensorflow-graph-graphdef-grappler-xla-mlir-llvm-etc-615191e96ebc, see XLA Flow part and call stack

akote123 · 2023-10-31T07:46:03Z

@huiyan2021 , In tensorflow the single model can go in both XLA and oneDNN or is it like either it will use XLA or oneDNN only

huiyan2021 · 2023-10-31T09:11:48Z

Both. There may be different scenarios:

Some parts of the model go in XLA path, some parts go in oneDNN path.
Intel recently submitted a pilot PR to accelerate XLA’s Dot op with oneDNN.

You can refer to this RFC: https://docs.google.com/document/d/1ZzMcrjxITJeN2IjjgbzUjHh-4W1YgDUus3j25Dvn9ng/edit

akote123 · 2023-10-31T09:46:41Z

@huiyan2021 ,Thank you for the pointers I will got through it .
Actually for pretrained models how we can enable XLA both for inference and transfer learning

huiyan2021 · 2023-11-01T02:54:00Z

same as training, you can refer to https://www.tensorflow.org/xla#enable_xla_for_tensorflow_models

google-ml-butler bot assigned SuryanarayanaY Jul 19, 2023

SuryanarayanaY added the comp:mkl MKL related issues label Jul 19, 2023

SuryanarayanaY assigned sachinprasadhs and unassigned SuryanarayanaY Jul 19, 2023

SuryanarayanaY added the type:support Support issues label Jul 19, 2023

sachinprasadhs assigned TensorFlow-MKL Jul 19, 2023

sachinprasadhs added the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Jul 19, 2023

SuryanarayanaY mentioned this issue Nov 24, 2023

OneDNN Verbose not visible in tfv2.14.0 and above. #62464

Closed

Venkat6871 mentioned this issue May 21, 2024

Enabling onednn graph API in TF #68243

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensorflow and onednn logs #61322

Tensorflow and onednn logs #61322

Tensorflow and onednn logs #61322

Tensorflow and onednn logs #61322

Comments