ZHUANG Xuanquan,LI Caixia,LI Peixing.Style transfer based on cross-layer correlation perceptual loss[J].Acta Scientiarum Naturalium Universitatis Sunyatseni,2020,59(06):126-135.
ZHUANG Xuanquan,LI Caixia,LI Peixing.Style transfer based on cross-layer correlation perceptual loss[J].Acta Scientiarum Naturalium Universitatis Sunyatseni,2020,59(06):126-135. DOI: 10.13471/j.cnki.acta.snus.2019.10.11.2019A079.
Style transfer based on cross-layer correlation perceptual loss
Great success in deep-learning-based style transfer is accelerating the development of photo artistic stylization applications. And the change of loss function from per-pixel loss to perceptual loss based on the Gram matrix is the most critical part of this progress. Gram matrix shows good performance in style feature extraction, but it only focuses on correlations among same level features. Therefore, Gram matrix cannot be considered as a complete representation of styles. However, most of the research focus on how to improve transfer speed by designing new model structure instead of analyzing and modifying the Gram matrix. The cross-layer correlation matrix is used to calculate style loss function as a replacement or supplement to the Gram matrix. By experiments, it is shown that this method can reduce 20% of the calculation time in comparison with the Gram matrix method while yielding similar outputs.
关键词
风格迁移Gram矩阵卷积神经网络风格损失函数感知损失深度学习
Keywords
style transferGram matrixconvolutional neural networkstyle loss functionperceptual lossdeep learning
references
GATYS L A, ECKER A S, BETHGE M. Image style transfer using convolutional neural networks [C]//Computer Vision and Pattern Recognition, 2016: 2414-2423.
GATYS L A, ECKER A S, BETHGE M. Texture synthesis using convolutional neural networks [C]//International Conference on Neural Information Processing Systems, 2015: 262-270.
JOHNSON J, ALAHI A, FEIFEI L. Perceptual losses for real-time style transfer and super-resolution [C]//European Conference on Computer Vision, 2016: 694-711.
SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition [J]. International Conference on Learning Representations, 2015.
LI Y, WANG N, LIU J, et al. Demystifying neural style transfer [C]//IJCAI, 2017: 2230-2236.
ULYANOV D, VEDALDI A, LEMPITSKY V. Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis [C]//Computer Vision and Pattern Recognition, 2017: 4105-4113.
DUMOULIN V, SHLENS J, KUDLUR M. A learned representation for artistic style [J]. International Conference on Learning Representations, 2017.
HUANG X, BELONGIE S. Arbitrary style transfer in real-time with adaptive instance normalization [C]//International Conference on Computer Vision, 2017:1510-1519.
WANG H, LIANG X, ZHANG H, et al. Zm-net: real-time zero-shot image manipulation network [C]//Computer Vision and Pattern Recognition,2017.
GOODFELLOW I J, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks [J]. Advances in Neural Information Processing Systems, 2014, 3: 2672-2680.
RADFORD A, METZ L, CHINTALA S. Unsupervised representation learning with deep convolutional generative adversarial networks [C]//International Conference on Learning Representations, 2016.
ZHU J, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks [C]//International Conference on Computer Vision, 2017: 2242-2251.
YI Z, ZHANG H,TAN P, et al. DualGAN: unsupervised dual learning for image-to-image translation [C]//International Conference on Computer Vision, 2017: 2868-2876.
KIM T, CHA M, KIM H, et al. Learning to discover cross-domain relations with generative adversarial networks [C]//Computer Vision and Pattern Recognition, 2017.
KARRAS T, LAINE S, AILA T, et al. A style-based generator architecture for generative adversarial networks [C]//Computer Vision and Pattern Recognition, 2019: 4401-4410.
SHAHAM T R, DEKEL T, MICHAELI T, et al. SinGAN: Learning a generative model from a single natural image [C]//International Conference on Computer Vision, 2019: 4570-4580.
KINGMA D P, BA J. Adam: a method for stochastic optimization [C]//International Conference on Learning Representations, 2015.
RUDER S. An overview of gradient descent optimization algorithms [J]. arXiv: Learning, 2016.
DOGO E M, AFOLABI O J, NWULU N I, et al. A comparative analysis of gradient descent-based optimization algorithms on convolutional neural networks [J]. International Conference on Computational Techniques, Electronics and Mechanical Systems (CTEMS), Belgaum, India, 2018: 92-99.
HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition [C]//Computer Vision and Pattern Recognition, 2016: 770-778.
GAO S, CHENG M, ZHAO K, et al. Res2Net: A new multi-scale backbone architecture [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019: 1-1.