Learning Methodologies to Generate Kernel-Learning-Based Image Downscaler for Arbitrary Scaling Factors | IEEE Journals & Magazine | IEEE Xplore

Learning Methodologies to Generate Kernel-Learning-Based Image Downscaler for Arbitrary Scaling Factors


Abstract:

Displays and content have various resolutions and aspect ratios, requiring an image downscaler to adaptively reduce the image resolution. However, research on downscaling...Show More

Abstract:

Displays and content have various resolutions and aspect ratios, requiring an image downscaler to adaptively reduce the image resolution. However, research on downscaling has garnered less attention than upscaling, including super-resolution. In practical display systems, simple interpolation, such as a bicubic filter that cannot preserve image details well, is still widely used for image downscaling rather than frame optimization-based or learning-based methods because of following reasons: frame optimization-based methods can effectively preserve image details after downscaling but are difficult to implement due to hardware costs. Learning-based methods have not been developed because defining a target downscaled image for training is difficult and training all downscaling factors is impossible. We propose a novel kernel-learning-based image downscaler to improve detail-preservation quality while supporting arbitrary downscaling factors using simple linear mapping. For this, a method to produce the ideal target downscaling result considering aliasing artifacts and detail preservation after downscaling is proposed. Then, we propose a training technique using the positional relationship between input and output pixels and a hierarchical region analysis to reproduce target images through simple kernel-based linear mapping. Lastly, a kernel-sharing technique is proposed to generate downscaling results for downscaling factors using a minimum number of trained kernels. In the simulation results, the proposed method demonstrated excellent edge preservation by improving the recall, precision, and F1 score, measuring the edge consistency between input and downscaled images, by up to 0.141, 0.079, 0.053, respectively, compared to benchmark methods. In a paired-comparison-based user study, the proposed method obtained the highest preference among benchmark methods using simple operations.
Published in: IEEE Transactions on Image Processing ( Volume: 30)
Page(s): 4526 - 4539
Date of Publication: 20 April 2021

ISSN Information:

PubMed ID: 33877975

Funding Agency:

References is not available for this document.

I. Introduction

With the development of display technology, display devices with various aspect ratios and resolutions have been widely used in recent years. Accordingly, a technique to adjust the resolution or aspect ratio of an image to that of a target display device is in increasing demand. In accordance with the demand, image-retargeting techniques [1]–[5] for changing the aspect ratio of an image and image-scaling techniques [6]–[18] for changing the resolution of an image have been developed. Among these techniques indispensable to the display system, the image scaling that we aim to develop is not particularly limited to the display system and can be used in various fields related to image processing, such as web browsers, image viewers, and image editors, etc.

Select All
1.
S. Avidan and A. Shamir, "Seam carving for content-aware image resizing", ACM Trans. Graph., vol. 26, no. 3, pp. 10-18, 2007.
2.
B. Li, L.-Y. Duan, J. Wang, R. Ji, C.-W. Lin and W. Gao, "Spatiotemporal grid flow for video retargeting", IEEE Trans. Image Process., vol. 23, no. 4, pp. 1615-1628, Apr. 2014.
3.
S. J. Lee, S. Lee, S. I. Cho and S.-J. Kang, "Object detection-based video retargeting with spatial–temporal consistency", IEEE Trans. Circuits Syst. Video Technol., vol. 30, no. 12, pp. 4434-4439, Dec. 2020.
4.
S. I. Cho and S.-J. Kang, "Temporal incoherence-free video retargeting using foreground aware extrapolation", IEEE Trans. Image Process., vol. 29, pp. 4848-4861, Dec. 2020.
5.
S. I. Cho and S.-J. Kang, "Extrapolation-based video retargeting with backward warping using an image-to-warping vector generation network", IEEE Signal Process. Lett., vol. 27, pp. 446-450, Dec. 2020.
6.
J. Yang, J. Wright, T. Huang and Y. Ma, "Image super-resolution as sparse representation of raw image patches", Proc. IEEE Conf. Comput. Vis. Pattern Recognit., pp. 1-8, Jun. 2008.
7.
J. Yang, J. Wright, T. S. Huang and Y. Ma, "Image super-resolution via sparse representation", IEEE Trans. Image Process., vol. 19, no. 11, pp. 2861-2873, Nov. 2010.
8.
C. Dong, C. C. Loy, K. He and X. Tang, "Learning a deep convolutional network for image super-resolution", Proc. Eur. Conf. Comput. Vis. (ECCV), pp. 184-199, 2014.
9.
J.-S. Choi and M. Kim, "Super-interpolation with edge-orientation-based mapping kernels for low complex upscaling", IEEE Trans. Image Process., vol. 25, no. 1, pp. 469-483, Jan. 2016.
10.
C. Dong, C. C. Loy, K. He and X. Tang, "Image super-resolution using deep convolutional networks", IEEE Trans. Pattern Anal. Mach. Intell., vol. 38, no. 2, pp. 295-307, Feb. 2016.
11.
D. P. Mitchell and A. N. Netravali, "Reconstruction filters in computer graphics", Proc. ACM SIGGRAPH, pp. 221-228, 1988.
12.
C. E. Duchon, "Lanczos filtering in one and two dimensions", J. Appl. Meteorol., vol. 18, no. 8, pp. 1016-1022, Aug. 1979.
13.
J. Kopf, A. Shamir and P. Peers, "Content-adaptive image downscaling", ACM Trans. Graph., vol. 32, no. 6, pp. 1-8, Nov. 2013.
14.
N. Weber, M. Waechter, S. C. Amend, S. Guthe and M. Goesele, "Rapid detail-preserving image downscaling", ACM Trans. Graph., vol. 35, no. 6, pp. 1-6, Nov. 2016.
15.
A. C. Öztireli and M. Gross, "Perceptually based downscaling of images", ACM Trans. Graph., vol. 34, no. 4, 2015.
16.
J. Liu, S. He and R. W. H. Lau, " L 0 -regularized image downscaling ", IEEE Trans. Image Process., vol. 27, no. 3, pp. 1076-1085, Mar. 2018.
17.
H. Kim, M. Choi, B. Lim and K. M. Lee, "Task-aware image downscaling", Proc. Eur. Conf. Comput. Vis., pp. 399-414, 2018.
18.
Y. Li, D. Liu, H. Li, L. Li, Z. Li and F. Wu, "Learning a convolutional neural network for image compact-resolution", IEEE Trans. Image Process., vol. 28, no. 3, pp. 1092-1107, Mar. 2019.
19.
H. Hyötyniemi, "Multivariate regression-techniques and tools", 2001.
20.
T. Köhler, M. Bätz, F. Naderi, A. Kaup, A. Maier and C. Riess, "Toward bridging the simulated-to-real gap: Benchmarking super-resolution on real data", IEEE Trans. Pattern Anal. Mach. Intell., vol. 42, no. 11, pp. 2944-2959, Nov. 2019, [online] Available: https://ieeexplore.ieee.org/abstract/document/8716546.
21.
R. Zeyde, M. Elad and M. Protter, "On single image scale-up using sparse-representations" in Curves and Surfaces, New York, NY, USA:Springer, pp. 711-730, 2012.
22.
D. Martin, C. Fowlkes, D. Tal and J. Malik, "A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics", Proc. 8th IEEE Int. Conf. Comput. Vis. (ICCV), vol. 2, pp. 416-423, Jul. 2001.
23.
Kodak Lossless True Color Image Suite, Apr. 2021, [online] Available: http://r0k.us/graphics/kodak/.
24.
J. Canny, "A computational approach to edge detection", IEEE Trans. Pattern Anal. Mach. Intell., vol. PAMI-8, no. 6, pp. 679-698, Nov. 1986.
25.
D. Wang, L. Zhang and A. Vincent, "Motion-compensated frame rate up-conversion—Part I: Fast multi-frame motion estimation", IEEE Trans. Broadcast., vol. 56, no. 2, pp. 133-141, Jun. 2010.
26.
K. M. Nam, J.-S. Kim, R.-H. Park and Y. Se. Shim, "A fast hierarchical motion vector estimation algorithm using mean pyramid", IEEE Trans. Circuits Syst. Video Technol., vol. 5, no. 4, pp. 344-351, Aug. 1995.

Contact IEEE to Subscribe

References

References is not available for this document.