Python interface for loading super-resolution dataset #18

AnimatedRNG · 2019-01-01T23:19:06Z

This custom PyTorch Dataset loads images from a
folder, performs several image operations upon
them, and then returns patches to the caller.
To avoid expensive disk reads/writes, all of the
images are cached in memory in compressed form,
and then decompressed and cached as the iterator
iterates over them.

twtygqyy · 2019-01-02T01:46:37Z

Thanks for the PR @AnimatedRNG
I will take a look.

AnimatedRNG · 2019-01-02T02:36:39Z

@twtygqyy Awesome. I just pushed a few minor tweaks and fixes (changed the cache size, disabled shuffle). I mostly wrote this because my desktop doesn't have enough memory to run the Matlab script to create the dataset. I haven't actually trained using this method, but I'm running it right now and hopefully I'll get something similar to your pretrained model.

To perform bicubic interpolation directly on tensors, you need the nightly build of Pytorch.

I'm also using OpenCV for image augmentation and color conversions. I think you can probably replace some of these operations with numpy/PyTorch equivalents if needed.

Also, I just realized that this model doesn't seem to operate in a linear colorspace -- maybe once I get things working I'll make the dataset also convert the images from/to sRGB.

This custom PyTorch Dataset loads images from a folder, performs several image operations upon them, and then returns patches to the caller. To avoid expensive disk reads/writes, all of the images are cached in memory in compressed form, and then decompressed and cached as the iterator iterates over them.

AnimatedRNG · 2019-01-03T22:22:56Z

The image augmentation now uses PyTorch's API rather than OpenCV. torch.flip is unreasonably slow, but otherwise I'm noticing a substantial performance improvement over the old version.

AnimatedRNG · 2019-01-04T20:36:21Z

On a different note, I finished training on DIV2K using this method and I was unable to replicate your results. I haven't looked much in eval.py, so maybe I'm wrong, but I'm somewhat confused about the PSNR scores (the model trained for 1 epochs did better (22.923043055308984) than the one trained for 100 epochs (22.68411311135478), despite a clear improvement in image quality in the latter case). Perhaps you should also print out the SSIM?

For reference, here is the butterfly demo after 1 epoch of training:

and after 100 epochs:

For reference, here is how your pre-trained model looks:

twtygqyy · 2019-01-07T06:47:37Z

Hi @AnimatedRNG if you used PyTorch API to generate the data for training, and use Matlab generated data for evaluation, the script cannot output correct PSNR sore. The reason is due the different bicubic interpolation implementation between Matlab and python. Check out https://www.reddit.com/r/MachineLearning/comments/6vdo51/p_matlab_bicubic_imresize_implemented_in_python/ for more information.

AnimatedRNG · 2019-01-07T22:40:53Z

Ah, good point. I'll try generating some of my own bicubic-downsampled images and seeing if the results get any better.

I'd rather use PyTorch's bicubic downsample than Matlab's (or the imresize Python implementation that you mentioned), mostly for performance reasons. PyTorch's bicubic downsample also happens to be compatible with OpenCV's bicubic downsample, which is another plus.

So I'll either generate new .mat's for the demo/evaluation scripts or just load them as PNGs.

twtygqyy · 2019-01-10T03:17:11Z

@AnimatedRNG Thanks for the update. I think for this repo, the purpose is for reproducing paper result, thus Matlab preprocessing was applied for consistency so as to have a fair comparison with the paper. Ideally, I agree it is better to use bicubic without anti-aliasing for practical use.

leonardozcm · 2019-11-04T13:44:52Z

Hey,I'm glad to inform you that your suggests really work!!!
I write a python script to generate trainable dataset, sooner I will commit it for further usage.

AnimatedRNG changed the title ~~Python interface for loading super-resolution Dataset~~ Python interface for loading super-resolution dataset Jan 1, 2019

AnimatedRNG force-pushed the feature_dataset branch from 0d3afd9 to 4f98349 Compare January 2, 2019 02:24

AnimatedRNG force-pushed the feature_dataset branch from 4f98349 to 6e1880f Compare January 3, 2019 22:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python interface for loading super-resolution dataset #18

Python interface for loading super-resolution dataset #18

AnimatedRNG commented Jan 1, 2019

twtygqyy commented Jan 2, 2019

AnimatedRNG commented Jan 2, 2019

AnimatedRNG commented Jan 3, 2019

AnimatedRNG commented Jan 4, 2019

twtygqyy commented Jan 7, 2019

AnimatedRNG commented Jan 7, 2019

twtygqyy commented Jan 10, 2019

leonardozcm commented Nov 4, 2019

Python interface for loading super-resolution dataset #18

Are you sure you want to change the base?

Python interface for loading super-resolution dataset #18

Conversation

AnimatedRNG commented Jan 1, 2019

twtygqyy commented Jan 2, 2019

AnimatedRNG commented Jan 2, 2019

AnimatedRNG commented Jan 3, 2019

AnimatedRNG commented Jan 4, 2019

twtygqyy commented Jan 7, 2019

AnimatedRNG commented Jan 7, 2019

twtygqyy commented Jan 10, 2019

leonardozcm commented Nov 4, 2019