Add integer data types to IsTrainable for use with custom gradients #25386

jvmncs · 2019-01-31T11:09:20Z

System information

TensorFlow version (you are using): 1.12.0
Are you willing to contribute it (Yes/No): Yes

Describe the feature and the current behavior/state.
Currently, there are a limited number of DTypes usable with autograd (see IsTrainable and _IsBackpropagatable below)

tensorflow/tensorflow/python/ops/gradients_impl.py

Line 300 in 6631cd5

def IsTrainable(tensor_or_dtype):

In particular, nodes with integer tensors automatically generate gradients of None, which breaks any call to tf.gradients. This is a reasonable assumption for usual practices in machine learning, however this prevents one from creating their own autograd system using tf.custom_gradient. I'd contend that, as a general purpose automatic differentiation library, TF should enable experimenting with such uses of its autograd system. This would be particularly useful when intending to perform automatic differentiation over rings or finite fields, which are often represented in computers as sets of consecutive integers (i.e. Z_p).

Will this change the current api? How?
This change shouldn't affect any standard usage of TensorFlow -- all autograd on floats will be unchanged. The only change will occur when calling autograd on graphs that compute integer arithmetic and similar.

Currently, performing tf.gradients on graphs with integer tensors will raise an unhandled error:

import numpy as np
import tensorflow as tf

x_back = np.ones([2, 2])
y_back = np.ones([2, 2])
x = tf.Variable(x_back, dtype=tf.int32)
y = tf.Variable(y_back, dtype=tf.int32)
z = x + y
vg = tf.gradients([z], [x, y])
with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    out = sess.run(vg)
    print(out)

Returns:

Traceback (most recent call last):
  File "/Users/jasonmancuso/dropout/research/customgrad/issue_min.py", line 16, in <module>
    out = sess.run(vg)
  File "/Users/jasonmancuso/anaconda/envs/tf-encrypted/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 929, in run
    run_metadata_ptr)
  File "/Users/jasonmancuso/anaconda/envs/tf-encrypted/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1137, in _run
    self._graph, fetches, feed_dict_tensor, feed_handles=feed_handles)
  File "/Users/jasonmancuso/anaconda/envs/tf-encrypted/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 471, in __init__
    self._fetch_mapper = _FetchMapper.for_fetch(fetches)
  File "/Users/jasonmancuso/anaconda/envs/tf-encrypted/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 261, in for_fetch
    return _ListFetchMapper(fetch)
  File "/Users/jasonmancuso/anaconda/envs/tf-encrypted/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 370, in __init__
    self._mappers = [_FetchMapper.for_fetch(fetch) for fetch in fetches]
  File "/Users/jasonmancuso/anaconda/envs/tf-encrypted/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 370, in <listcomp>
    self._mappers = [_FetchMapper.for_fetch(fetch) for fetch in fetches]
  File "/Users/jasonmancuso/anaconda/envs/tf-encrypted/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 258, in for_fetch
    type(fetch)))
TypeError: Fetch argument None has invalid type <class 'NoneType'>

See #783 (comment) for other cases in which Ops can generate a None gradient.

After implementing this feature, we'd return the following:

[array([[1, 1], [1, 1]], dtype=int32),
 array([[1, 1], [1, 1]], dtype=int32)]

Who will benefit with this feature?
Anyone who wants to perform automatic differentiation on integer data types, as would be the case when operating in integer rings or finite fields.

Any Other info.
This request is motivated by the tf-encrypted project.

The text was updated successfully, but these errors were encountered:

skye · 2019-02-01T19:54:21Z

This is a cool use case, but unfortunately I think is too large a project to take on at this time. I think this would be a difficult feature to fully implement, because we'd have to make sure that every integer operation has a proper gradient function defined. Gradient functions are currently written with the assumption that we don't need to handle integers (e.g. we always return None for the indices param of a gather op). The result of this is, without careful auditing and/or testing, many gradients() calls over integers would silently return None or even the wrong answer. This is a big enough feature that we'd probably want close collaboration with someone on the TF team.

If there's enough demand in the future it might be worth the effort, but for now I don't think we can properly deliver on this.

cc @alextp @ebrevdo @martinwicke -- maybe I'm missing something that makes this more tractable?

alextp · 2019-02-01T20:07:01Z

I agree with @skye . It might be easier for you to bypass TF's gradient code entirely if you want to go this route, since our ops have gradients which don't behave well at all with integers.

mortendahl · 2019-02-04T14:09:47Z

@skye to understand the problem better and out of curiosity, would you mind pointing to a place where the assumption on returning None is used?

skye · 2019-02-04T15:47:33Z

Here's one I happen to run into recently: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/ops/array_grad.py#L410
Note that we always return None for the indices grad of gather. This is a niche case, but if you were attempting to take the gradient w.r.t. a variable that determines the indices, this grad function would silently return the wrong answer (None corresponds to zero gradient more or less). There are likely more, part of the difficulty is combing through all the gradient functions and finding them :)

shaunster0 · 2019-11-22T03:48:40Z

I have this need for integer tensor and their gradients too ... it’s disappointing nothing seems to be happening with this, I think this will restrict future innovation in some interesting areas

mohantym · 2022-07-27T12:52:09Z

Hi @jvmncs !

It is not throwing any error in 2.8 version. Attached gist for reference.

Thank you!

jvmncs · 2022-07-27T17:13:16Z

Hi @mohantym, thanks for the response. It's nice to see that the error is no longer being thrown, but unfortunately I don't think that suffices to consider this issue solved. In particular, the check that forces a jump out of the backprop logic seems to be the same, just moved to a new location here. Thus it's still not possible for integer-typed tensors to have non-None gradients, even when defined explicitly with a tf.custom_gradient.

FWIW, I think my understanding of TF's goals & aims has evolved. I would not be surprised if this feature were considered a non-goal for the project. It's a relatively niche request. Anyway, I think Jax might be a bit better fit for something so experimental.

mohantym · 2022-07-28T01:05:19Z

@jvmncs !
I just checking for a update from your side. Thanks for the reply.

tilakrayal · 2024-06-26T11:30:49Z

Hi,

Thank you for opening this issue. Since this issue has been open for a long time, the code/debug information for this issue may not be relevant with the current state of the code base.

The Tensorflow team is constantly improving the framework by fixing bugs and adding new features. We suggest you try the latest TensorFlow version with the latest compatible hardware configuration which could potentially resolve the issue. If you are still facing the issue, please create a new GitHub issue with your latest findings, with all the debugging information which could help us investigate.

Please follow the release notes to stay up to date with the latest developments which are happening in the Tensorflow space.

This was referenced Jan 31, 2019

Enabling integer data types in autograd #25392

Closed

Enabling integer data types in autograd #25394

Closed

ymodak added comp:ops OPs related issues type:feature Feature requests labels Jan 31, 2019

ymodak assigned skye Jan 31, 2019

rmothukuru added the stat:contribution welcome Status - Contributions welcome label May 26, 2021

mohantym self-assigned this Jul 27, 2022

mohantym added stat:awaiting response Status - Awaiting response from author and removed stat:contribution welcome Status - Contributions welcome labels Jul 27, 2022

google-ml-butler bot removed the stat:awaiting response Status - Awaiting response from author label Jul 27, 2022

mohantym assigned gadagashwini and unassigned mohantym Jul 28, 2022

gadagashwini removed their assignment Aug 16, 2022

tilakrayal added the stat:awaiting response Status - Awaiting response from author label Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add integer data types to IsTrainable for use with custom gradients #25386

Add integer data types to IsTrainable for use with custom gradients #25386

Add integer data types to IsTrainable for use with custom gradients #25386

Add integer data types to IsTrainable for use with custom gradients #25386

Comments