GPFlow-2.0 - issue with default_float and likelihood variance #1244

daragallagher · 2020-02-03T15:48:59Z

I am new to GP and GPFlow so excuse me if this is a silly question.

I am trying to use gpflow (2.0rc) with float64 and had been struggling to get even simple examples to work. I configure gpflow using:

gpflow.config.set_default_float(np.float64)

I am using GPR:

# Model construction:
k = gpflow.kernels.Matern52(variance=1.0, lengthscale=0.3)
m = gpflow.models.GPR((X, Y), kernel=k)
m.likelihood.variance = 0.01

And indeed if I print a summary, both parameters have dtype float64. However if I try to predict with this model, I get an error.

tensorflow.python.framework.errors_impl.InvalidArgumentError: cannot compute AddV2 as input #1(zero-based) was expected to be a double tensor but is a float tensor [Op:AddV2] name: add/

A debugging session lead me to the following line in gpr.py (line 88)

s = tf.linalg.diag(tf.fill([num_data], self.likelihood.variance))

This creates a matrix with dtype float32 which causes the blow-up as described above.

Perhaps it should be something along the lines of:

s = tf.linalg.diag(tf.fill([num_data], tf.cast(self.likelihood.variance, gpflow.default_float())))

I'm not sure - I'm not a tensorflow expert either :)

Even if this is because I'm not using the API correctly, it's quite confusing (I think) that some parameters given as simple Python floats are handled correctly but setting the likelihood variance in this way causes an exception.

My environment:

GPFlow 2.0.0rc from commit f49e110
Tensorflow 2.1.0
GPU enabled
Python version 3.7.6
Windows10

Here's a full Python script:

import numpy as np
import gpflow

gpflow.config.set_default_float(np.float64)

# data:
X = np.random.rand(10, 1)
Y = np.sin(X)

assert X.dtype == np.float64
assert Y.dtype == np.float64

# Model construction:
k = gpflow.kernels.Matern52(variance=1.0, lengthscale=0.3)
m = gpflow.models.GPR((X, Y), kernel=k)
m.likelihood.variance = 0.01

gpflow.utilities.print_summary(m)

# Predict
xx = np.array([[1.0]])
assert xx.dtype == np.float64

mean, var = m.predict_y(xx)
print(f'mean: {mean}')
print(f'var: {var}')

The text was updated successfully, but these errors were encountered:

jameshensman · 2020-02-03T19:18:46Z

hi @daragallagher

Thanks for your well written question! I think I might have a very simple answer: you may have been bitten by a GPflow 2.0 gotcha.

The solution is to replace m.likelihood.variance = 0.01 with m.likelihood.variance.assign(0.01), then your example works great :)

The behaviour of gpflow's parameters has changed with version 2.0 to match that of tensorflow Variables. @awav , could you point @daragallagher to the gpflow2.0 gotchas notebook, please?

awav · 2020-02-03T19:57:29Z

@jameshensman, yes, that's the case.
@daragallagher, https://github.com/GPflow/GPflow/blob/develop/doc/source/notebooks/intro_to_gpflow2.ipynb.

Could you repost this question on StackOverflow with gpflow tag? Thanks.

daragallagher · 2020-02-04T11:02:58Z

Thanks for the quick response. I thought it might be something like that.

@awav I had read (quickly) through that intro notebook but missed the assign recommendation. Nearly all of the sample code that appears in google search results is understandably for gpflow 1.x so I think this gotcha may catch quite a few noobs like myself. It's probably not practical to support the older style?

Anyway, in case it helps others, I've posted this issue as a SO question here: https://stackoverflow.com/questions/60055919/gpflow-2-0-issue-with-default-float-and-likelihood-variance

daragallagher · 2020-02-04T16:19:50Z

@awav I also see now that my suggestion that gpflow add code to convert to the default float for the sake of API consistency will not really help since the issue really originates in tensorflow itself and has infected modules built on top of tensorflow like tensorflow-probability. So there is little that can be done in gpflow to address the consistency issues this causes. I see you've already raised this as a tensorflow issue here: tensorflow/tensorflow#26033

Here is an example of the sort of inconsistency that has bothered me, taken from one of the example notebooks;

adaptive_hmc = tfp.mcmc.SimpleStepSizeAdaptation(
    hmc, 
    num_adaptation_steps=10, 
    target_accept_prob=tf.cast(0.75, dtype=tf.float64),
    adaptation_rate=0.1
)

If using float64, one argument, adaptation_rate, does not require a cast, while the other, target_accept_prob, requires one.

This makes (IMHO) the API quite user hostile. Either the user must defensively wrap every float argument in a cast, or use a debugger to locate the source of a float argument that eventually raise an "expected to be a double tensor but is a float tensor" exception.

If tensorflow were to provide a default float setting, then this inconsistency would not arise. Even my initial example would have worked fine as tf.fill would have defaulted to float64.

I think many users of tensorflow are focused on deep learning and so never look beyond float32 which I suspect is the reason the issue hasn't been prioritized. This is unfortunate for libraries that build on tensorflow where the numeric properties of float64s make more sense.

So I'll add a "me too" comment to tensorflow issue 26033 and close this one.

st-- · 2020-02-04T18:12:07Z

@daragallagher thank you for such a well-written issue report!

daragallagher changed the title ~~GPFlow-2.0 - issue with default_float and mean variance~~ GPFlow-2.0 - issue with default_float and likelihood variance Feb 3, 2020

daragallagher closed this as completed Feb 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPFlow-2.0 - issue with default_float and likelihood variance #1244

GPFlow-2.0 - issue with default_float and likelihood variance #1244

GPFlow-2.0 - issue with default_float and likelihood variance #1244

GPFlow-2.0 - issue with default_float and likelihood variance #1244

Comments