Abstract:
This paper introduces an innovative approach that utilizes Vision Transformers in conjunction with a loss function derived from Intersection over Union (IOU) and Mean Squ...Show MoreMetadata
Abstract:
This paper introduces an innovative approach that utilizes Vision Transformers in conjunction with a loss function derived from Intersection over Union (IOU) and Mean Squared Error (MSE) for the training and prediction of class labels and their associated bounding boxes. Our experiments, conducted on a dataset containing two distinct class labels, demonstrate an impressive 96% accuracy in label prediction and a 95% IoU accuracy for bounding boxes.
Published in: 2024 IEEE/ACIS 22nd International Conference on Software Engineering Research, Management and Applications (SERA)
Date of Conference: 30 May 2024 - 01 June 2024
Date Added to IEEE Xplore: 26 September 2024
ISBN Information: