skip to main content
10.1145/3571600acmotherconferencesBook PagePublication PagesicvgipConference Proceedingsconference-collections
ICVGIP '22: Proceedings of the Thirteenth Indian Conference on Computer Vision, Graphics and Image Processing
ACM2022 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
ICVGIP'22: Thirteenth Indian Conference on Computer Vision, Graphics and Image Processing Gandhinagar India December 8 - 10, 2022
ISBN:
978-1-4503-9822-0
Published:
12 May 2023
Recommend ACM DL
ALREADY A SUBSCRIBER?SIGN IN

Reflects downloads up to 17 Feb 2025Bibliometrics
research-article
A Novel Multi-Scale Residual Dense Dehazing Network (MSRDNet) for Single Image Dehazing✱
Article No.: 1, Pages 1–9https://doi.org/10.1145/3571600.3571601

Dehazing is a difficult process because of the damage caused by the non-uniform fog and haze distribution in images. To address these issues, a Multi-Scale Residual dense Dehazing Network (MSRDNet) is proposed in this paper. A Contextual feature ...

research-article
Interpreting Intrinsic Image Decomposition using Concept Activations
Article No.: 2, Pages 1–9https://doi.org/10.1145/3571600.3571603

Evaluation of ill-posed problems like Intrinsic Image Decomposition (IID) is challenging. IID involves decomposing an image into its constituent illumination-invariant Reflectance (R) and albedo-invariant Shading (S) components. Contemporary IID ...

research-article
Quaternion Factorized Simulated Exposure Fusion
Article No.: 3, Pages 1–9https://doi.org/10.1145/3571600.3571604

Image Fusion maximizes the visual information at each pixel location by merging content from multiple images in order to produce an enhanced image. Exposure Fusion, specifically, fuses a bracketed exposure stack of poorly lit images to generate a ...

research-article
Learning from Multiple Datasets for Recognizing Human Actions
Article No.: 4, Pages 1–9https://doi.org/10.1145/3571600.3571605

Action recognition has evolved as an important research problem in the computer vision community. Majority of the human action recognition methods focus mainly on training from a single dataset. Scarcity of labelled data in a single dataset often leads ...

research-article
Topological Shape Matching using Multi-Dimensional Reeb Graphs
Article No.: 5, Pages 1–10https://doi.org/10.1145/3571600.3571606

Shape matching or retrieval is an important problem in computer graphics and data analysis. Topological techniques based on Reeb graphs and persistence diagrams have been employed to obtain an effective solution in this problem. In the current paper, ...

research-article
Convolutional Ensembling based Few-Shot Defect Detection Technique
Article No.: 6, Pages 1–7https://doi.org/10.1145/3571600.3571607

Over the past few years, there has been a significant improvement in the domain of few-shot learning. This learning paradigm has shown promising results for the challenging problem of anomaly detection, where the general task is to deal with heavy ...

research-article
Masked Student Dataset of Expressions
Article No.: 7, Pages 1–9https://doi.org/10.1145/3571600.3571608

Facial expression recognition (FER) algorithms work well in constrained environments with little or no occlusion of the face. However, real-world face occlusion is prevalent, most notably with the need to use a face mask in the current Covid-19 ...

research-article
Performance, Trust, or both? COVID-19 Diagnosis and Prognosis using Deep Ensemble Transfer Learning on X-ray Images✱
Article No.: 8, Pages 1–9https://doi.org/10.1145/3571600.3571609

The COVID-19 pandemic still affects most parts of the world today. Despite a lot of research on diagnosis, prognosis, and treatment, a big challenge today is the limited number of expert radiologists who provide diagnosis and prognosis on X-Ray images. ...

research-article
Alzheimer’s severity classification using Transfer Learning and Residual Separable Convolution Network
Article No.: 9, Pages 1–6https://doi.org/10.1145/3571600.3571610

Severity classification is the most pivotal task in Alzheimer’s disease diagnosis. Detection of brain structural changes from brain MR images is crucial for Alzheimer’s classification. In this paper, we have proposed a transfer learning and residual ...

research-article
Detecting Coronavirus (COVID -19) Disease Cues from Chest Radiography Images
Article No.: 10, Pages 1–7https://doi.org/10.1145/3571600.3571611

This paper proposes a deep learning-based approach to detect COVID-19 infections in lung tissues from chest Computed Tomography (CT) images. A two-stage classification model is designed to identify the infection from CT scans of COVID-19 and Community ...

research-article
Posture Guided Human Action Recognition for Fitness Applications
Article No.: 11, Pages 1–9https://doi.org/10.1145/3571600.3571612

Human action recognition has attracted a lot of attention in the recent past due to newer applications in computer vision such as fitness tracking, augmented reality and virtual reality. Most of the existing deep learning based methods first deploy a ...

research-article
Towards Robust Handwritten Text Recognition with On-the-fly User Participation

Long-term OCR services aim to provide high-quality output to their users at competitive costs. It is essential to upgrade the models because of the complex data loaded by the users. The service providers encourage the users who provide data where the ...

research-article
Low Resource Degraded Quality Document Image Binarization – Domain Adaptation is the Way
Article No.: 13, Pages 1–10https://doi.org/10.1145/3571600.3571614

Usually, image binarization plays a crucial role in automatic analysis of degraded documents from their captured images. However, this binarization task is often difficult due to a number of reasons including the high similarity between noisy ...

research-article
A Globally-Connected and Trainable Hierarchical Fine-Attention Generative Adversarial Network based Adversarial Defense
Article No.: 14, Pages 1–9https://doi.org/10.1145/3571600.3571615

Deep Neural Network (DNN) inferences have been proven highly susceptible to carefully engineered adversarial perturbations, presenting a pivotal hindrance to real-world Computer Vision tasks. Most of the existing defenses have poor generalization ...

research-article
FERA-net: An emotion classifier from facial expressions using FER-net with attention mechanism✱
Article No.: 15, Pages 1–7https://doi.org/10.1145/3571600.3571616

Emotions play a significant and important role in daily life. It can be recognized by facial expressions, speech, and physiological signals such as electroencephalogram (EEG), electrocardiogram (ECG), body temperature, etc. Facial expression is one of ...

research-article
One shot learning in StyleALAE: Preserving facial identity during semantic modification
Article No.: 16, Pages 1–7https://doi.org/10.1145/3571600.3571617

Semantic face editing of real-world facial images is an important application of generative models. Recently, several works have explored possible techniques to generate such modifications by utilizing the latent structure of pre-trained GAN models. ...

research-article
Split and Knit: 3D Fingerprint Capture with a Single Camera
Article No.: 17, Pages 1–9https://doi.org/10.1145/3571600.3571618

3D fingerprint capture is less sensitive to skin moisture levels and avoids skin deformation, which is common in contact-based sensors, in addition to capturing depth information. Unfortunately, its adoption is limited due to high cost and system ...

research-article
I’m GROOT: a multi head multi GRaph netwOrk recognizing surgical actiOn Triplets✱
Article No.: 18, Pages 1–9https://doi.org/10.1145/3571600.3571619

Laparoscopic cholecystectomy is a widely performed minimally invasive surgical procedure that imposes many challenges to the operating surgeon. While we strive to understand and automate such surgeries, the key is to identify the actions involved in ...

research-article
Supervised Contrastive Multi-tasking Learning Based Hierarchical Yoga Pose Classification Using CNNs
Article No.: 19, Pages 1–9https://doi.org/10.1145/3571600.3571620

In this paper, we propose a technique for hierarchical yoga pose classification in a multi-tasking framework. Novelty lies in the proposed supervised contrastive combined loss function. We propose the usage of linear combination of three loss functions:...

research-article
Overcoming Label Noise for Source-free Unsupervised Video Domain Adaptation
Article No.: 20, Pages 1–9https://doi.org/10.1145/3571600.3571621

Despite the progress seen in classification methods, current approaches for handling videos with distribution shifts in source and target domains remain source-dependent as they require access to the source data during the adaptation stage. In this ...

research-article
REF-SHARP: REFined face and geometry reconstruction of people in loose clothing✱
Article No.: 21, Pages 1–10https://doi.org/10.1145/3571600.3571622

In this paper, we address the problem of monocular 3D human reconstruction with an acute focus on the challenge of recovering person-specific facial geometry as well as suppressing surface noise, specifically addressing the issue of false geometrical ...

research-article
End-to-End GPU-Accelerated Low-Poly Remeshing using Curvature Map and Voronoi Tessellation✱
Article No.: 22, Pages 1–9https://doi.org/10.1145/3571600.3571623

We propose a novel algorithm for low-poly remeshing of 3D surfaces that runs fully in GPU. Since the input mesh is generally not well-organized, performing mesh simplification directly on the input mesh is liable to produce a low-poly mesh with a ...

research-article
Depth estimation using Stereo Light Field Camera✱
Article No.: 23, Pages 1–9https://doi.org/10.1145/3571600.3571624

Light field imaging has emerged as a new modality, enabling to capture the angular and spatial information of a scene. This additional angular information is used to estimate the depth of a 3-D scene. The continuum of virtual view-points in light field ...

research-article
Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification✱
Article No.: 24, Pages 1–9https://doi.org/10.1145/3571600.3571625

In this paper, we study the problem of identifying logos of business brands in natural scenes in an open-set one-shot setting. This problem setup is significantly more challenging than traditionally-studied ‘closed-set’ and ‘large-scale training ...

research-article
A Fine-Grained Vehicle Detection (FGVD) Dataset for Unconstrained Roads✱
Article No.: 25, Pages 1–9https://doi.org/10.1145/3571600.3571626

The previous fine-grained datasets mainly focus on classification and are often captured in a controlled setup, with the camera focusing on the objects. We introduce the first Fine-Grained Vehicle Detection (FGVD) dataset in the wild, captured from a ...

research-article
Design of a System and Method for Optimal selection of Tumor Slice using Linear Ultrasound Imaging for Histopathology
Article No.: 26, Pages 1–8https://doi.org/10.1145/3571600.3571627

In excision biopsy, a tumor mass is surgically removed from the body. Subsequently, it is sliced at an appropriate location and investigated microscopically through a process called histopathology. Any bias in tumor slicing severely influences ...

research-article
Multi-view Learning with Two-stage Training of 2D CNNs for Tumor Sub-regions Segmentation from 3D Brain MRI Volumes
Article No.: 27, Pages 1–8https://doi.org/10.1145/3571600.3571628

In this study, we have performed brain tumor segmentation on a publicly available BraTS 2019 dataset. The training data contains multi-modal 3D volumetric brain MRI data for 259 High Grade Glioma (HGG) cases and 76 Low Grade Glioma (LGG) cases. The ...

research-article
A Dataset and Model for Crossing Indian Roads
Article No.: 28, Pages 1–8https://doi.org/10.1145/3571600.3571629

Roads in medium-sized Indian towns often have lots of traffic but no (or disregarded) traffic stops. This makes it hard for the blind to cross roads safely, because vision is crucial to determine when crossing is safe. Automatic and reliable image-...

research-article
Towards Realistic Underwater Dataset Generation and Color Restoration✱
Article No.: 29, Pages 1–9https://doi.org/10.1145/3571600.3571630

Recovery of true color from underwater images is an ill-posed problem. This is because the wide-band attenuation coefficients for the RGB color channels depend on object range, reflectance, etc. which are difficult to model. Also, there is ...

research-article
A Novel Statistical High Density Salt-and-Pepper Noise Removal Algorithm for Brain Magnetic Resonance Images
Article No.: 30, Pages 1–9https://doi.org/10.1145/3571600.3571631

Brain Magnetic Resonance Imaging (MRI) is a non-invasive technique that produces high quality images of the brain and is most suitable for analysis and diagnosis. However, these images can be soiled with noise during image acquisition or transmission. ...

Contributors
Index terms have been assigned to the content through auto-classification.

Recommendations

Acceptance Rates

Overall Acceptance Rate 95 of 286 submissions, 33%
YearSubmittedAcceptedRate
ICVGIP '162869533%
Overall2869533%