Allow py_function to support functions that return RaggedTensor #26453

woodshop · 2019-03-07T16:15:56Z

System information

TensorFlow version (you are using): 1.13.1
Are you willing to contribute it (Yes/No): No

Describe the feature and the current behavior/state.
py_function only supports functions that return Tensors. However the wrapped function is executed in eager mode and therefore ideally should support other return types consistent with eager mode. As of tf-1.13.1, py_function attempts to convert the returned objects to Tensors, but a RaggedTensor cannot be converted directly to a Tensor. Any attempt to return a RaggedTensor raises an exception during the attempted conversion. E.g.,

with tf.Graph().as_default():
    elements = [[1., 2., 3.], [4., 5.]]
    ragged1 = tf.ragged.constant(elements)
    def py_func():
        return tf.ragged.constant(elements)
    
    ragged2 = tf.py_function(
        py_func, [], tf.dtypes.float32
    )
    with tf.Session() as sess:
        print(sess.run(ragged1))
        print(sess.run(ragged2))

>>> <tf.RaggedTensorValue [[1.0, 2.0, 3.0], [4.0, 5.0]]>
>>> ...
>>> ValueError: TypeError: object of type 'RaggedTensor' has no len()

I propose that py_function detect which output arguments, if any, are RaggedTensors and returns them without attempting to convert to Tensors. If the proposal is rejected, I suggest that the documentation is updated to make clearer (either in the API or guides) that a RaggedTensor is not a suitable return type for functions wrapped by py_function.

Will this change the current api? How? No.

Who will benefit with this feature? Anyone who uses RaggedTensor in conjunction with py_function.

Any Other info.
As a workaround, one can construct a RaggedTensor from the output of py_function. E.g.,

with tf.Graph().as_default():
    elements = [[1., 2., 3.], [4., 5.]]
    ragged1 = tf.ragged.constant(elements)
    def py_func():
        lengths = [len(element) for element in elements]
        return sum(elements, []), lengths
    
    concatenated, lengths = tf.py_function(
        py_func, [], [tf.dtypes.float32, tf.dtypes.int64]
    )
    ragged2 = tf.RaggedTensor.from_row_lengths(concatenated, lengths)
    with tf.Session() as sess:
        print(sess.run(ragged1))
        print(sess.run(ragged2))

>>> <tf.RaggedTensorValue [[1.0, 2.0, 3.0], [4.0, 5.0]]>
>>> <tf.RaggedTensorValue [[1.0, 2.0, 3.0], [4.0, 5.0]]>

The text was updated successfully, but these errors were encountered:

edloper · 2021-02-18T16:54:36Z

In #27679 (comment), I showed how py_function could be extended to handle composite tensor inputs and outputs (and other nested structures, like dicts, tuples, lists, etc). If you have bandwidth to work on a PR that adds that to TensorFlow (with tests etc.), then it would be very welcome; otherwise, you could just use the new_py_function that I defined there, which wraps tf.py_function.

JXRiver · 2021-11-29T22:20:05Z

The issue should have been fixed by @edloper. The following example should work.

import tensorflow

# TF1
tf = tensorflow.compat.v1
with tf.Graph().as_default():
    elements = [[1., 2., 3.], [4., 5.]]
    ragged1 = tf.ragged.constant(elements)
    def py_func():
        return tf.ragged.constant(elements)
    
    ragged2 = tf.py_function(
        py_func, [], Tout=tf.RaggedTensorSpec([2, None], tf.float32)
    )
    with tf.Session() as sess:
        print(sess.run(ragged1))  # <tf.RaggedTensorValue [[1.0, 2.0, 3.0], [4.0, 5.0]]>
        print(sess.run(ragged2))  # <tf.RaggedTensorValue [[1.0, 2.0, 3.0], [4.0, 5.0]]>

# TF2
tf = tensorflow.compat.v2
def py_func():
  return tf.ragged.constant([[1., 2., 3.], [4., 5.]])

print(tf.py_function(py_func, [], Tout=tf.RaggedTensorSpec([2, None], tf.float32)))  # <tf.RaggedTensor [[1.0, 2.0, 3.0], [4.0, 5.0]]>

Note the main difference of this example with the example in the original issue is the Tout argument of tf.py_function. When tf.py_function returns a CompositeTensor, the Tout argument should be a subclass of tf.TypeSpec, i.e., tf.RaggedTensorSpec for a RaggedTensor.

See https://www.tensorflow.org/api_docs/python/tf/py_function for more information on tf.py_function and https://www.tensorflow.org/api_docs/python/tf/TypeSpec on tf.TypeSpec.

HuangChiEn · 2021-12-06T08:49:29Z

The issue should have been fixed by @edloper. The following example should work.
import tensorflow

# TF1
tf = tensorflow.compat.v1
with tf.Graph().as_default():
    elements = [[1., 2., 3.], [4., 5.]]
    ragged1 = tf.ragged.constant(elements)
    def py_func():
        return tf.ragged.constant(elements)
    
    ragged2 = tf.py_function(
        py_func, [], Tout=tf.RaggedTensorSpec([2, None], tf.float32)
    )
    with tf.Session() as sess:
        print(sess.run(ragged1))  # <tf.RaggedTensorValue [[1.0, 2.0, 3.0], [4.0, 5.0]]>
        print(sess.run(ragged2))  # <tf.RaggedTensorValue [[1.0, 2.0, 3.0], [4.0, 5.0]]>

# TF2
tf = tensorflow.compat.v2
def py_func():
  return tf.ragged.constant([[1., 2., 3.], [4., 5.]])

print(tf.py_function(py_func, [], Tout=tf.RaggedTensorSpec([2, None], tf.float32)))  # <tf.RaggedTensor [[1.0, 2.0, 3.0], [4.0, 5.0]]>
Note the main difference of this example with the example in the original issue is the Tout argument of tf.py_function. When tf.py_function returns a CompositeTensor, the Tout argument should be a subclass of tf.TypeSpec, i.e., tf.RaggedTensorSpec for a RaggedTensor.

See https://www.tensorflow.org/api_docs/python/tf/py_function for more information on tf.py_function and https://www.tensorflow.org/api_docs/python/tf/TypeSpec on tf.TypeSpec.

Thanks for your brief tutorial, but the given code will not work, if I convert it into the lambda function.

look forward to see the further improvement of tf..

edloper · 2021-12-06T15:15:49Z

@HuangChiEn Are you using TensorFlow 2.7? Support for using composite tensors (such as RaggedTensor) with py_function was added with 2.7, so if you're using an earlier version of TensorFlow, then it won't work. I tried executing your code as written:

tmp = lambda _: tf.ragged.constant([[1., 2., 3.], [4., 5.]])
tf.py_function(tmp, [], Tout=tf.RaggedTensorSpec([2, None], tf.float32))

And it failed with "<lambda>() missing 1 required positional argument: '_'" (which is expected, since your lambda takes one argument, but you didn't supply any arguments when you called tf.py_function). If I change it the lambda to not expect any argument:

tmp = lambda: tf.ragged.constant([[1., 2., 3.], [4., 5.]])
tf.py_function(tmp, [], Tout=tf.RaggedTensorSpec([2, None], tf.float32))

Then it succeeds for me (in TF 2.7).

ymodak added comp:ops OPs related issues type:feature Feature requests labels Mar 7, 2019

ymodak assigned edloper Mar 7, 2019

ymodak added the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Mar 7, 2019

novog mentioned this issue Aug 14, 2019

Support Sparse Tensors in py_function #30069

Closed

tensorflowbutler removed the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Feb 21, 2021

JXRiver closed this as completed Nov 29, 2021

tirk999 mentioned this issue Jun 14, 2024

tf.py_function does not output ragged tensors #69777

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow py_function to support functions that return RaggedTensor #26453

Allow py_function to support functions that return RaggedTensor #26453

Allow py_function to support functions that return RaggedTensor #26453

Allow py_function to support functions that return RaggedTensor #26453

Comments