Publication Type : Journal Article
Publisher : Springer Science and Business Media LLC
Source : Multimedia Systems
Url : https://doi.org/10.1007/s00530-023-01232-5
Campus : Amaravati
School : School of Computing
Year : 2024
Abstract : Residual networks (ResNets) have been utilized for various computer vision and image processing applications. The residual connection improves the training of the network with better gradient flow. A residual block consists of a few convolutional layers having trainable parameters, which leads to overfitting. Moreover, the present residual networks are not able to utilize the high- and low-frequency information suitably, which also challenges the generalization capability of the network. In this paper, a frequency-disentangled residual network (FDResNet) is proposed to tackle these issues. Specifically, FDResNet includes separate connections in the residual block for low- and high-frequency components, respectively. Basically, the proposed model disentangles the low- and high-frequency components to increase the generalization ability. Moreover, the computation of low- and high-frequency components using fixed filters further avoids the overfitting. The proposed model is tested on benchmark CIFAR-10/100, Caltech, and TinyImageNet datasets for image classification. The performance of the proposed model is also tested in the image retrieval framework. It is noticed that the proposed model outperforms its counterpart residual model. The effect of kernel size and standard deviation is also evaluated. The impact of the frequency disentangling is also analyzed using a saliency map.
Cite this Research Publication : Satya Rajendra Singh, Roshan Reddy Yedla, Shiv Ram Dubey, Rakesh Kumar Sanodiya, Wei-Ta Chu, Frequency disentangled residual network, Multimedia Systems, Springer Science and Business Media LLC, 2024, https://doi.org/10.1007/s00530-023-01232-5 (Q1, Impact Factor: 2.60)