Bilinear-Pooling-with-Hierarchical-Structure-for-Precise-Visual-Recognition

Computer vision is continuously advancing in the field of accurate visual representation. Deep learning architectures such as ResNet50 and pooling algorithms have been crucial to this process. In this study, a pioneering approach called Bilinear Pooling with Hierarchical Structure is introduced to enhance the visual recognition accuracy. Using ResNet50 the method incorporates hierarchical pooling using the ReLU activation function. This new combination improves both feature representation and the accuracy with which complex visual details are captured. Through the hierarchical organization of pooling the layers, multi-scale contextual information is efficiently extracted. Through training, it is shown that the proposed framework is effective in producing notable performance improvements. The results highlight the potential benefits of this method.

Screenshots

Introduction

The project aims to enhance fine-grained visual recognition accuracy.

Integrate Hierarchical Bilinear Pooling (HBP) with ResNet architecture.

Leverage ResNet's capabilities and pre-trained parameters.

Provide accessibility to the codebase for broader research and adaptation.

Objectives

• Improving the efficiency of the model using Two-Step Training Strategy

• Increasing accuracy through Spatial Relationship Learning

• Improved Fine-Grained Recognition using hierarchical bilinear pooling

• Hierarchical pooling to capture complex relationships

• Enhancing the feature representation of the model

• Efficiently extracting multi-scale contextual information

• Improving the performance of the model through training

ARCHITECTURE diagram

Requirements

python 2.7
pytorch 0.4.1

Train

Step 1.

Download the resnet pre-training parameters.
Download the CUB-200-2011 dataset. CUB-download

Step 2.

Set the path to the dataset and resnet parameters in the code.

Step 3. Train the fc layer only.

python train_firststep.py

Step 4. Fine-tune all layers. It gets an accuracy of around 86% on CUB-200-2011 when using resnet-50.

python train_finetune.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Demo		Demo
__pycache__		__pycache__
static/uploads		static/uploads
templates		templates
F1_Error_confusionMatrix.py		F1_Error_confusionMatrix.py
LICENSE		LICENSE
Loss_Curve_graph.py		Loss_Curve_graph.py
README.md		README.md
TrainingEpochsFINETUNING.txt		TrainingEpochsFINETUNING.txt
__init__.py		__init__.py
app.py		app.py
confusionMatrix.py		confusionMatrix.py
data.py		data.py
hbp_model.py		hbp_model.py
loss.txt		loss.txt
resnet_model.py		resnet_model.py
script.py		script.py
test_images_shuffle.txt		test_images_shuffle.txt
train_finetune.py		train_finetune.py
train_firststep.py		train_firststep.py
train_images_shuffle.txt		train_images_shuffle.txt
trial.py		trial.py
try.py		try.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bilinear-Pooling-with-Hierarchical-Structure-for-Precise-Visual-Recognition

Screenshots

Introduction

Objectives

ARCHITECTURE diagram

Requirements

Train

About

Releases

Packages

Languages

License

Sarvesh30112002/Bilinear-Pooling-with-Hierarchical-Structure-for-Precise-Visual-Recognition

Folders and files

Latest commit

History

Repository files navigation

Bilinear-Pooling-with-Hierarchical-Structure-for-Precise-Visual-Recognition

Screenshots

Introduction

Objectives

ARCHITECTURE diagram

Requirements

Train

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages