tensorflow为什么解码jpeg图像与scipy imread不同？

tf.image.decode_jpeg()对于jpg图像，Tensorflow的功能与scipy.misc.imread()不同。虽然图像看起来相似，但像素值不同。tensorflow为什么解码jpeg图像与scipy imread不同？

import numpy as np 
import scipy 
import tensorflow as tf 
import matplotlib.pyplot as plt 
def minimal_example(): 
    def _bytes_feature(value): 
     return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value])) 

    tffilename = 'astronaut.tfrecords' 
    writer = tf.python_io.TFRecordWriter(tffilename) 
    #image_source = 'https://upload.wikimedia.org/wikipedia/commons/8/88/Astronaut-EVA.jpg' 
    image_path = 'astronaut.jpg' 
    image_file = open(image_path,'rb') 
    image = image_file.read() 
    image_scipy = scipy.misc.imread(image_path) 
    example = tf.train.Example(features=tf.train.Features(feature={'image':_bytes_feature(image)})) 
    writer.write(example.SerializeToString()) 
    writer.close() 

    record_iterator = tf.python_io.tf_record_iterator(path=tffilename) 
    example = tf.train.Example() 
    example.ParseFromString(next(record_iterator)) 
    image = example.features.feature['image'].bytes_list.value[0] 
    image_tf = tf.image.decode_jpeg(image).eval(session=tf.Session()) 
    fig = plt.figure() 
    ax1 = fig.add_subplot(121) 
    ax2 = fig.add_subplot(122) 
    ax1.imshow(image_scipy) 
    ax2.imshow(image_tf) 
    print('Reconstruction Error', np.sum(np.abs(image_tf - image_scipy))) 
    plt.show()

结果：

Reconstruction Error 3420883624

这是一个错误还是我做错了什么？

来源

2017-07-19 Mancento

的差异产生的，因为不准确的，但使用Tensorflow

快，默认离散余弦变换分析按照Source code

//为JPEG解码的TensorFlow-选择缺省是IFAST，牺牲

//图像质量的速度。

flags_.dct_method = JDCT_IFAST;

为了得到准确的解码可以例如在设置属性dct_method = 'INTEGER_ACCURATE'如示于下

def minimal_example(): 
    #image_source = 'https://upload.wikimedia.org/wikipedia/commons/8/88/Astronaut-EVA.jpg' 
    image_path = 'astronaut.jpg' 
    image_file = open(image_path,'rb') 
    image_raw = image_file.read() 
    image_scipy = scipy.misc.imread(image_path) 
    image_tf = tf.image.decode_jpeg(image_raw).eval(session=tf.Session()) 
    image_tf_accurate = tf.image.decode_jpeg(image_raw,dct_method="INTEGER_ACCURATE").eval(session=tf.Session()) 
    print('Error For Default: ', np.sum(np.abs(image_tf - image_scipy))) 
    print('Error For Accurate: ', np.sum(np.abs(image_tf_accurate - image_scipy))) 
    #Error For Default: 3420883624 
    #Error For Accurate: 0

来源

2017-07-20 18:08:42 Mancento

The JPEG standard does not require bit-to-bit identical decoding。所以，不同实现之间会有一些变化。

但是，它仍然需要

最大1个针对每个像素分量差值的比特。

所以两个输出不应该分开一个。对？

print('max diff: ', np.max(np.abs(image_tf.astype(float) - image_scipy.astype(float)))) 
# max diff: 17.0

哎哟，至少一个实现不遵循标准...

来源

2017-07-19 17:38:33 user1735003

第一次听到。很有意思！ scipy似乎使用libjpeg-dev（通过PIL/Pillow），而tensorflow可能使用[libjpeg-turbo]（https://github.com/tensorflow/tensorflow/issues/4807）。 – sascha

tensorflow为什么解码jpeg图像与scipy imread不同？

回答

相关问题