I have a query? how to justify that the used CNN (VGG16 with attention mechanism) gives better results in classification as compared DenseNet, ResNet, and inception with the same attention mechanism attention as used with VGG16 for the classification task.
First, I trained three different CNN’s models (VGG16, DenseNet and ResNet). Among them DenseNet gives me excellent accuracy results as compared to other. Later I modified these CNN models and used an attention mechanism with them, but when I checked the result among them VGG16 gives me the best results now I don’t know how to justify this. Since VGG16 is an old CNN model as compared to other networks but still gives me good results.