Single Object Detection Using CNN

Overview

This repository showcases my project for detecting a single person in images using a deep Convolutional Neural Network (CNN). The model is trained on images gathered from the COCO dataset, specifically selected to contain only one person in each image. The primary goal is to establish a foundational approach toward multi-object detection.

Project Journey

Motivation

The motivation behind this project stemmed from my interest in object detection and the desire to build a lightweight and efficient model. I aimed to create a model that could detect individuals in images while minimizing computational complexity.

Challenges Faced

Data Handling: Gathering and preprocessing data from the COCO dataset was challenging and time consuming. Ensuring that the images contained only one person require Consistency because of the size of annotation json file.
Complexity Management: Training deep learning models is inherently resource-intensive. I faced memory constraints and had to optimize my model architecture for efficiency.
Grayscale Conversion: Converting images to grayscale was a significant decision to reduce complexity. This approach helped simplify feature extraction but required thorough testing to ensure the model still performed well.

Achievements

Model Development: Successfully achieved accuracy on deep CNN architecture capable of predicting bounding boxes for single-person detection. The model consists of several convolutional layers, pooling layers, and fully connected layers.
Efficiency Improvements: The decision to work with grayscale images significantly reduced computational load while maintaining acceptable performance levels.
Foundational Step: This project serves as a solid stepping stone toward more complex multi-object detection tasks, laying the groundwork for future enhancements.

Sample Model Architecture

The architecture of the CNN model is designed to effectively capture features while remaining lightweight. Below is a brief overview of the architecture:

The model has demonstrated promising results in predicting bounding boxes for single-person images. Visualizations of the predicted bounding boxes can be found in the repository.

Future Directions:

Expand the model to support multi-object detection.
Explore more advanced techniques, such as Vision Transformer and more deep CNN's for effective feature extraction, to enhance accuracy and efficiency.
Experiment with data augmentation methods to improve model robustness and generalization.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
PersonsCOCO.h5		PersonsCOCO.h5
PersonsCOCO.keras		PersonsCOCO.keras
README.md		README.md
peronscoco.ipynb		peronscoco.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Single Object Detection Using CNN

Overview

Project Journey

Motivation

Challenges Faced

Achievements

Sample Model Architecture

Future Directions:

About

Releases

Packages

Languages

NishantkSingh0/Person-Bounding-box-detection

Folders and files

Latest commit

History

Repository files navigation

Single Object Detection Using CNN

Overview

Project Journey

Motivation

Challenges Faced

Achievements

Sample Model Architecture

Future Directions:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages