• Automatic Reconstruction of Roof Overhangs for 3D City Models
  • Real-Time Physics-Based Mesh Deformation with Haptic Feedback and Material Anisotropy
  • Improved Directional Guidance with Transparent AR Displays
  • Optimal Activation Function for Anisotropic BRDF Modeling
  • Automatic Prediction of 3D Checkpoints for Technical Gesture Learning in Virtual Environments
  • Local Reflectional Symmetry Detection in Point Clouds Using a Simple PCA-Based Shape Descriptor
  • Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models
  • Deep Interactive Volume Exploration Through Pre-Trained 3D CNN and Active Learning
  • An Immersive Feedback Framework for Scanning Probe Microscopy
  • Accurate Cutting of MSDM-Based Hybrid Surface Meshes
  • Experimental Setup and Protocol for Creating an EEG-signal Database for Emotion Analysis Using Virtual Reality Scenarios
  • Multiclass Texture Synthesis Using Generative Adversarial Networks
  • Dense Point-to-Point Correspondences Between Genus-Zero Shapes Using Cubic Mapping and Horn-Schunck Optical Flow
  • Shape Morphing as a Minimal Path in the Graph of Cubified Shapes
  • Topological Data Structure: The Fast Marching Example
  • Computerised Muscle Modelling and Simulation for Interactive Applications
  • Development of a Realistic Crowd Simulation Environment for Fine-Grained Validation of People Tracking Methods
  • Analysis of Wettability Model Using Adhesional and Spreading Works
  • Colour-Field Based Particle Categorization for Residual Stress Detection and Reduction in Solid SPH Simulations
  • Real-Time Volume Editing on Low-Power Virtual Reality Devices
  • Cartesian Robot Controlling with Sense Gloves and Virtual Control Buttons: Development of a 3D Mixed Reality Application
  • Sampling-Distribution-Based Evaluation for Monte Carlo Rendering
  • Mobile Augmented Reality for Analysis of Solar Radiation on Facades
  • Biometric Evaluation to Measure Brain Activity and Users Experience Using Electroencephalogram (EEG) Device
  • GroupGazer: A Tool to Compute the Gaze per Participant in Groups with Integrated Calibration to Map the Gaze Online to a Screen or Beamer Projection
  • Pistol: PUpil INvisible SUpportive TOOl to Extract Pupil, Iris, Eye Opening, Eye Movements, Pupil and Iris Gaze Vector, and 2D as Well as 3D Gaze
  • The Gaze and Mouse Signal as Additional Source for User Fingerprints in Browser Applications
  • Virtual Reality Simulation for Multimodal and Ubiquitous System Deployment
  • The VVAD-LRS3 Dataset for Visual Voice Activity Detection
  • Virtual Avatar Creation Support System for Novices with Gesture-Based Direct Manipulation and Perspective Switching
  • Language Agnostic Gesture Generation Model: A Case Study of Japanese Speakers' Gesture Generation Using English Text-to-Gesture Model
  • Towards Enhanced Guiding Mechanisms in VR Training Through Process Mining
  • Measuring Emotion Intensity: Evaluating Resemblance in Neural Network Facial Animation Controllers and Facial Emotion Corpora
  • Interaction-based Implicit Calibration of Eye-Tracking in an Aircraft Cockpit
  • Analysis of the User Experience (UX) of Design Interactions for a Job-Related VR Application
  • VR Virtual Prototyping Application for Airplane Cockpit: A Human-centred Design Validation
  • Usability Assessment in Scientific Data Analysis: A Literature Review
  • Co-creation of Ethical Guidelines for Designing Digital Solutions to Support Industrial Work
  • It’s not Just What You Do but also When You Do It: Novel Perspectives for Informing Interactive Public Speaking Training
  • eHMI Design: Theoretical Foundations and Methodological Process
  • Can Pupillary Responses while Listening to Short Sentences Containing Emotion Induction Words Explain the Effects on Sentence Memory?
  • Improving Throughput of Mobile Robots in Narrow Aisles
  • Spatial Positions of Operator's Finger and Operation Device Influencing Sense of Direct Manipulation and Operation Performance
  • Towards Identifying Concepts in Persuasive Social Networks: Case Study TikTok
  • On the Importance of User Role-Tailored Explanations in Industry 5.0
  • Supporting Online Game Players by the Visualization of Personalities and Skills Based on in-Game Statistics
  • Fighting Disinformation: Overview of Recent AI-Based Collaborative Human-Computer Interaction for Intelligent Decision Support Systems
  • An Immersive Virtual Reality Application to Preserve the Historical Memory of Tangible and Intangible Heritage
  • Measuring User Trust in an in-Vehicle Information System: A Comparison of Two Subjective Questionnaires
  • Comparing Conventional and Conversational Search Interaction Using Implicit Evaluation Methods
  • Examining the Potential for Conversational Exploratory Search Using a Smart Speaker Digital Assistant
  • Can Visual Information Reduce Anxiety During Autonomous Driving? Analysis and Reduction of Anxiety Based on Eye Movements in Passengers of Autonomous Personal Mobility Vehicles
  • Safety Education Method for Older Drivers to Correct Overestimation of Their Own Driving
  • Happy or Sad, Smiling or Drawing: Multimodal Search and Visualisation of Movies Based on Emotions Along Time
  • A Service-Based Preset Recommendation System for Image Stylization Applications
  • Stereoscopy in User: VR Interaction
  • Visualizing Grassmannians via Poincare Embeddings
  • Damast: A Visual Analysis Approach for Religious History Research
  • Contrast Driven Color-Group Assignment in Categorical Data Visualization
  • Visual Document Exploration with Adaptive Level of Detail: Design, Implementation and Evaluation in the Health Information Domain
  • Trajectory-Based Dynamic Boundary Map Labeling
  • The Compilation of 2D and 3D Dynamic Visualizations
  • Viewpoint-Based Quality for Analyzing and Exploring 3D Multidimensional Projections
  • Model Order in Sugiyama Layouts
  • Evaluating Differences in Insights from Interactive Dimensionality Reduction Visualizations Through Complexity and Vocabulary
  • Using Well-Known Techniques to Visualize Characteristics of Data Quality
  • Heart Rate Visualizations on a Virtual Smartwatch to Monitor Physical Activity Intensity
  • XAIVIER the Savior: A Web Application for Interactive Explainable AI in Time Series Data
  • Towards a Visual Analytics Workflow for Cybersecurity Simulations
  • Evaluating Architectures and Hyperparameters of Self-supervised Network Projections
  • BigGraphVis: Visualizing Communities in Big Graphs Leveraging GPU-Accelerated Streaming Algorithms
  • The HORM Diagramming Tool: A Domain-Specific Modelling Tool for SME Cybersecurity Awareness
  • Interactive Exploration of Complex Heterogeneous Data: A Use Case on Understanding City Economics
  • A Comparative Study on Vision Transformers in Remote Sensing Building Extraction
  • On Metavisualization and Properties of Visualization
  • An Interactive Graph Layout Constraint Framework
  • Supporting University Research and Administration via Interactive Visual Exploration of Bibliographic Data
  • Visual Analysis of Multi-Labelled Temporal Noise Data from Multiple Sensors
  • MR to CT Synthesis Using GANs: A Practical Guide Applied to Thoracic Imaging
  • A Survey of Geospatial-Temporal Visualizations for Military Operations
  • Salient Mask-Guided Vision Transformer for Fine-Grained Classification
  • Railway Switch Classification Using Deep Neural Networks
  • Deep Learning Semantic Segmentation Models for Detecting the Tree Crown Foliage
  • Generative Adversarial Network Synthesis for Improved Deep Learning Model Training of Alpine Plants with Fuzzy Structures
  • A Model-agnostic Approach for Generating Saliency Maps to Explain Inferred Decisions of Deep Learning Models
  • Robust Path Planning in the Wild for Automatic Look-Ahead Camera Control
  • Flexible Extrinsic Structured Light Calibration Using Circles
  • Detection of Microscopic Fungi and Yeast in Clinical Samples Using Fluorescence Microscopy and Deep Learning
  • CoDA-Few: Few Shot Domain Adaptation for Medical Image Semantic Segmentation
  • Let’s Get the FACS Straight: Reconstructing Obstructed Facial Features
  • Classification and Embedding of Semantic Scene Graphs for Active Cross-Domain Self-Localization
  • Semantic Segmentation on Neuromorphic Vision Sensor Event-Streams Using PointNet++ and UNet Based Processing Approaches
  • Multi-Phase Relaxation Labeling for Square Jigsaw Puzzle Solving
  • Deformable and Structural Representative Network for Remote Sensing Image Captioning
  • Deep Distance Metric Learning for Similarity Preserving Embedding of Point Clouds
  • Point Cloud Neighborhood Estimation Method Using Deep Neuro-Evolution
  • Interactive Indoor Localization Based on Image Retrieval and Question Response
  • Fully Convolutional Neural Network for Event Camera Pose Estimation
  • Fine-Tuning Restricted Boltzmann Machines Using No-Boundary Jellyfish
  • An Extension of the Radial Line Model to Predict Spatial Relations
  • Persistent Homology Based Generative Adversarial Network
  • A Basic Tool for Improving Bad Illuminated Archaeological Pictures
  • AI-Powered Management of Identity Photos for Institutional Staff Directories
  • High-Level Workflow Interpreter for Real-Time Image Processing
  • Robust RGB-D-IMU Calibration Method Applied to GPS-Aided Pose Estimation
  • PG-3DVTON: Pose-Guided 3D Virtual Try-on Network
  • Adaptive Fourier Single-Pixel Imaging Based on Probability Estimation
  • Masking and Mixing Adversarial Training
  • Hand Segmentation with Mask-RCNN Using Mainly Synthetic Images as Training Sets and Repetitive Training Strategy
  • FakeRecogna Anomaly: Fake News Detection in a New Brazilian Corpus
  • Data-Driven Fingerprint Reconstruction from Minutiae Based on Real and Synthetic Training Data
  • 3D Ego-Pose Lift-Up Robustness Study for Fisheye Camera Perturbations
  • Extractive Text Summarization Using Generalized Additive Models with Interactions for Sentence Selection
  • Concept Explainability for Plant Diseases Classification
  • UMVpose++: Unsupervised Multi-View Multi-Person 3D Pose Estimation Using Ground Point Matching
  • Maritime Surveillance by Multiple Data Fusion: An Application Based on Deep Learning Object Detection, AIS Data and Geofencing
  • Model Fitting on Noisy Images from an Acoustofluidic Micro-Cavity for Particle Density Measurement
  • ALiSNet: Accurate and Lightweight Human Segmentation Network for Fashion E-Commerce
  • Exploiting GAN Capacity to Generate Synthetic Automotive Radar Data
  • Automatic Fracture Detection and Characterization in Borehole Images Using Deep Learning-Based Semantic Segmentation
  • Data-Efficient Transformer-Based 3D Object Detection
  • EHDI: Enhancement of Historical Document Images via Generative Adversarial Network
  • Two-Model-Based Online Hand Gesture Recognition from Skeleton Data
  • Synthesis for Dataset Augmentation of H&E Stained Images with Semantic Segmentation Masks
  • Fruit Defect Detection Using CNN Models with Real and Virtual Data
  • Search for Rotational Symmetry of Binary Images via Radon Transform and Fourier Analysis
  • Uncertainty-Aware DPP Sampling for Active Learning
  • N-MuPeTS: Event Camera Dataset for Multi-Person Tracking and Instance Segmentation
  • Football360: Introducing a New Dataset for Camera Calibration in Sports Domain
  • Trajectory Prediction in First-Person Video: Utilizing a Pre-Trained Bird's-Eye View Model
  • Estimation of Robot Motion Parameters Based on Functional Consistency for Randomly Stacked Parts
  • Finger-UNet: A U-Net Based Multi-Task Architecture for Deep Fingerprint Enhancement
  • Deep Neural Network Based Attention Model for Structural Component Recognition
  • Investigating the Performance of Optimization Techniques on Deep Learning Models to Identify Dota2 Game Events
  • Near-infrared Lipreading System for Driver-Car Interaction
  • Contactless Optical Detection of Nocturnal Respiratory Events
  • Image Quality Assessment for Object Detection Performance in Surveillance Videos
  • Memory-Efficient Implementation of GMM-MRCoHOG for Human Recognition Hardware
  • Complement Objective Mining Branch for Optimizing Attention Map
  • Counting People in Crowds Using Multiple Column Neural Networks
  • Study of Coding Units Depth for Depth Maps Quality Scalable Compression Using SHVC
  • Novel View Synthesis for Unseen Surgery Recordings
  • Shape-based Features Investigation for Preneoplastic Lesions on Cervical Cancer Diagnosis
  • A Low-Cost Process for Plant Motion Magnification for Smart Indoor Farming
  • Put Your PPE on: A Tool for Synthetic Data Generation and Related Benchmark in Construction Site Scenarios
  • SynMotor: A Benchmark Suite for Object Attribute Regression and Multi-Task Learning
  • Industrial Visual Defect Inspection of Electronic Components with Siamese Neural Network
  • Finding Similar non-Collapsed Faces to Collapsed Faces Using Deep Learning Face Recognition
  • Re-Learning ShiftIR for Super-Resolution of Carbon Nanotube Images
  • A Wearable Device Application for Human-Object Interactions Detection
  • Printed Packaging Authentication: Similarity Metric Learning for Rotogravure Manufacture Process Identification
  • Generating Pedestrian Views from In-Vehicle Camera Images
  • Adaptive Resolution Selection for Improving Segmentation Accuracy of Small Objects
  • Seeing Risk of Accident from In-Vehicle Cameras
  • Towards a Robust Solution for the Supermarket Shelf Audit Problem
  • Predicting Eye Gaze Location on Websites
  • EFL-Net: An Efficient Lightweight Neural Network Architecture for Retinal Vessel Segmentation
  • Sentiment-Based Engagement Strategies for Intuitive Human-Robot Interaction
  • Concept Study for Dynamic Vision Sensor Based Insect Monitoring
  • Multichannel Analysis in Weed Detection
  • ResNet Classifier Using Shearlet-Based Features for Detecting Change in Satellite Images
  • Fast Skeletons of Handwritten Texts in Digital Images
  • Image Quality Assessment in the Context of the Brazilian Electoral System
  • IncludeVote: Development of an Assistive Technology Based on Computer Vision and Robotics for Application in the Brazilian Electoral Context
  • Evaluation of U-Net Backbones for Cloud Segmentation in Satellite Images
  • Automatic Robotic Arm Calibration for the Integrity Test of Voting Machines in the Brazillian 2022's Election Context
  • ENIGMA: Egocentric Navigator for Industrial Guidance, Monitoring and Anticipation
  • Inverse Rendering Based on Compressed Spatiotemporal Infomation by Neural Networks
  • Colonoscopic Polyp Detection with Deep Learning Assist
  • Combined Unsupervised and Supervised Learning for Improving Chest X-Ray Classification
  • Toward a Thermal Image-Like Representation
  • Face-Based Gaze Estimation Using Residual Attention Pooling Network
  • Multimodal Light-Field Camera with External Optical Filters Based on Unsupervised Learning
  • You Can Dance! Generating Music-Conditioned Dances on Real 3D Scans
  • When Continual Learning Meets Robotic Grasp Detection: A Novel Benchmark on the Jacquard Dataset
  • Handwriting Recognition in Down Syndrome Learners Using Deep Learning Methods
  • An Unsupervised IR Approach Based Density Clustering Algorithm
  • FlexPooling with Simple Auxiliary Classifiers in Deep Networks
  • IACT: Intensive Attention in Convolution-Transformer Network for Facial Landmark Localization
  • TrichANet: An Attentive Network for Trichogramma Classification
  • Multimodal Unsupervised Spatio-Temporal Interpolation of Satellite Ocean Altimetry Maps
  • Turkish Sign Language Recognition Using CNN with New Alphabet Dataset
  • Neural Style Transfer for Image-Based Garment Interchange Through Multi-Person Human Views
  • Advanced Deep Transfer Learning Using Ensemble Models for COVID-19 Detection from X-ray Images
  • Towards Human-Interpretable Prototypes for Visual Assessment of Image Classification Models
  • Curriculum Learning for Compositional Visual Reasoning
  • Learning Less Generalizable Patterns for Better Test-Time Adaptation
  • FInC Flow: Fast and Invertible k Ă— k Convolutions for Normalizing Flows
  • Emotion Transformer: Attention Model for Pose-Based Emotion Recognition
  • Efficient Deep Learning Ensemble for Skin Lesion Classification
  • Linking Data Separation, Visual Separation, and Classifier Performance Using Pseudo-labeling by Contrastive Learning
  • HaloAE: A Local Transformer Auto-Encoder for Anomaly Detection and Localization Based on HaloNet
  • Few-Shot Gaze Estimation via Gaze Transfer
  • Real-Time Monitoring of Crowd Panic Based on Biometric and Spatiotemporal Data
  • Dynamically Modular and Sparse General Continual Learning
  • Application of Deep Learning to the Detection of Foreign Object Debris at Aerodromes’ Movement Area
  • YCbCr Color Space as an Effective Solution to the Problem of Low Emotion Recognition Rate of Facial Expressions In-The-Wild
  • Applying Positional Encoding to Enhance Vision-Language Transformers
  • Brazilian Banknote Recognition Based on CNN for Blind People
  • Towards an Automatic System for Generating Synthetic and Representative Facial Data for Anonymization
  • Evaluation of Computer Vision-Based Person Detection on Low-Cost Embedded Systems
  • FPCD: An Open Aerial VHR Dataset for Farm Pond Change Detection
  • Triple-stream Deep Metric Learning of Great Ape Behavioural Actions
  • DEff-GAN: Diverse Attribute Transfer for Few-Shot Image Synthesis
  • Crane Spreader Pose Estimation from a Single View
  • 3D Mapping of Indoor Parking Space Using Edge Consistency Census Transform Stereo Odometry
  • On Computing Three-Dimensional Camera Motion from Optical Flow Detected in Two Consecutive Frames
  • Low-Cost 3D Reconstruction of Caves
  • How to Train an Accurate and Efficient Object Detection Model on any Dataset
  • Real-Time Obstacle Detection using a Pillar-based Representation and a Parallel Architecture on the GPU from LiDAR Measurements
  • Tackling Data Bias in Painting Classification with Style Transfer
  • VK-SITS: Variable Kernel Speed Invariant Time Surface for Event-Based Recognition
  • System for 3D Acquisition and 3D Reconstruction Using Structured Light for Sewer Line Inspection
  • Synthetic Driver Image Generation for Human Pose-Related Tasks
  • Body Part Information Additional in Multi-decoder Transformer-Based Network for Human Object Interaction Detection
  • BGD: Generalization Using Large Step Sizes to Attract Flat Minima
  • A Novel 3D Face Reconstruction Model from a Multi-Image 2D Set
  • Algorithmic Fairness Applied to the Multi-Label Classification Problem
  • Improvement of Vision Transformer Using Word Patches
  • The Effect of Covariate Shift and Network Training on Out-of-Distribution Detection
  • DeepCaps+: A Light Variant of DeepCaps
  • IFMix: Utilizing Intermediate Filtered Images for Domain Adaptation in Classification
  • A Lightweight Gaussian-Based Model for Fast Detection and Classification of Moving Objects
  • A Data Augmentation Strategy for Improving Age Estimation to Support CSEM Detection
  • Shuffle Mixing: An Efficient Alternative to Self Attention
  • Semantic Segmentation by Semi-Supervised Learning Using Time Series Constraint
  • 3D Human Body Reconstruction from Head-Mounted Omnidirectional Camera and Light Sources
  • 3D Reconstruction of Occluded Luminous Objects
  • Joint Training of Product Detection and Recognition Using Task-Specific Datasets
  • Human Motion Prediction on the IKEA-ASM Dataset
  • End-to-End Gaze Grounding of a Person Pictured from Behind
  • Automatic Defect Detection in Leather
  • Image Generation from a Hyper Scene Graph with Trinomial Hyperedges
  • Neural Architecture Search in the Context of Deep Multi-Task Learning
  • CrowdSim2: An Open Synthetic Benchmark for Object Detectors
  • Domain Adaptive Pedestrian Detection Based on Semantic Concepts
  • A Robust Deep Learning-Based Video Watermarking Using Mosaic Generation
  • Robust Semi-Supervised Anomaly Detection via Adversarially Learned Continuous Noise Corruption
  • An End-to-End Multi-Task Learning Model for Image-based Table Recognition
  • PanDepth: Joint Panoptic Segmentation and Depth Completion
  • Multi-Scale Feature Based Fashion Attribute Extraction Using Multi-Task Learning for e-Commerce Applications
  • On Attribute Aware Open-Set Face Verification
  • Self-Modularized Transformer: Learn to Modularize Networks for Systematic Generalization
  • Human Fall Detection from Sequences of Skeleton Features using Vision Transformer
  • Leveraging Unsupervised and Self-Supervised Learning for Video Anomaly Detection
  • Multi-Camera 3D Pedestrian Tracking Using Graph Neural Networks
  • Real-World Case Study of a Deep Learning Enhanced Elderly Person Fall Video-Detection System
  • Pyramid Swin Transformer: Different-Size Windows Swin Transformer for Image Classification and Object Detection
  • YOLO: You Only Look 10647 Times
  • Subjective Baggage-Weight Estimation from Gait: Can You Estimate How Heavy the Person Feels?
  • Visual Anomaly Detection and Localization with a Patch-Wise Transformer and Convolutional Model
  • An Experimental Consideration on Gait Spoofing
  • Flow-Based Visual-Inertial Odometry for Neuromorphic Vision Sensors Using non-Linear Optimization with Online Calibration
  • Banana Ripeness Level Classification Using a Simple CNN Model Trained with Real and Synthetic Datasets
  • Using Continual Learning on Edge Devices for Cost-Effective, Efficient License Plate Detection
  • Rotation Equivariance for Diamond Identification
  • Toward Few Pixel Annotations for 3D Segmentation of Material from Electron Tomography
  • FedBID and FedDocs: A Dataset and System for Federated Document Analysis
  • Estimating Distances Between People Using a Single Overhead Fisheye Camera with Application to Social-Distancing Oversight
  • Overcome Ethnic Discrimination with Unbiased Machine Learning for Facial Data Sets
  • How far Generated Data Can Impact Neural Networks Performance?
  • Object Detection in Floor Plans for Automated VR Environment Generation
  • Absolute-ROMP: Absolute Multi-Person 3D Mesh Prediction from a Single Image
  • Semi-Supervised Domain Adaptation with CycleGAN Guided by Downstream Task Awareness
  • MixedTeacher: Knowledge Distillation for Fast Inference Textural Anomaly Detection
  • Benchmarking Person Re-Identification Datasets and Approaches for Practical Real-World Implementations
  • Surface-Graph-Based 6DoF Object-Pose Estimation for Shrink-Wrapped Items Applicable to Mixed Depalletizing Robots
  • DeNos22: A Pipeline to Learn Object Tracking Using Simulated Depth
  • A General Context Learning and Reasoning Framework for Object Detection in Urban Scenes
  • Impact of Vehicle Speed on Traffic Signs Missed by Drivers
  • Transfer Learning for Word Spotting in Historical Arabic Documents Based Triplet-CNN
  • Combining Two Adversarial Attacks Against Person Re-Identification Systems
  • 1D-SalsaSAN: Semantic Segmentation of LiDAR Point Cloud with Self-Attention
  • Understanding of Feature Representation in Convolutional Neural Networks and Vision Transformer
  • Fast Eye Detector Using Siamese Network for NIR Partial Face Images
  • Smoothed Normal Distribution Transform for Efficient Point Cloud Registration During Space Rendezvous
  • Exploring Deep Learning Capabilities for Coastal Image Segmentation on Edge Devices
  • False Negative Reduction in Semantic Segmentation Under Domain Shift Using Depth Estimation
  • Upper Bound Tracker: A Multi-Animal Tracking Solution for Closed Laboratory Settings
  • A Patch-Based Architecture for Multi-Label Classification from Single Positive Annotations
  • Unfolding Local Growth Rate Estimates for (Almost) Perfect Adversarial Detection
  • A Multi-Class Probabilistic Optimum-Path Forest
  • Mixing Augmentation and Knowledge-Based Techniques in Unsupervised Domain Adaptation for Segmentation of Edible Insect States
  • Combining Metric Learning and Attention Heads for Accurate and Efficient Multilabel Image Classification
  • Multi-View Video Synthesis Through Progressive Synthesis and Refinement
  • WSAM: Visual Explanations from Style Augmentation as Adversarial Attacker and Their Influence in Image Classification
  • Monocular Depth Estimation for Tilted Images via Gravity Rectifier
  • Prediction of Shuttle Trajectory in Badminton Using Player's Position
  • Beyond the Third Dimension: How Multidimensional Projections and Machine Learning Can Help Each Other
  • Quantitative Analysis to Find the Optimum Scale Range for Object Representations in Remote Sensing Images
  • DaDe: Delay-Adaptive Detector for Streaming Perception
  • Environmental Information Extraction Based on YOLOv5-Object Detection in Videos Collected by Camera-Collars Installed on Migratory Caribou and Black Bears in Northern Quebec
  • Human Object Interaction Detection Primed with Context
  • Rethinking the Backbone Architecture for Tiny Object Detection
  • Fast and Reliable Template Matching Based on Effective Pixel Selection Using Color and Intensity Information
  • Vegetation Coverage and Urban Amenity Mapping Using Computer Vision and Machine Learning
  • 3D Semantic Scene Reconstruction from a Single Viewport
  • A Deep Learning Approach for Estimating the Rind Thickness of Trentingrana Cheese from Images
  • TabProIS: A Transfer Learning-Based Model for Detecting Tables in Product Information Sheets
  • An Anisotropic and Asymmetric Causal Filtering Based Corner Detection Method
  • Layer-wise External Attention for Efficient Deep Anomaly Detection
  • Normalised Color Distances
  • Fuzzy Inference System in a Local Eigenvector Based Color Image Smoothing Framework
  • Multi-Scale Surface Normal Estimation from Depth Maps
  • 3D Reference-Based Skeletal Movement Evaluation
  • FUB-Clustering: Fully Unsupervised Batch Clustering
  • Intrinsic Image Decomposition: Challenges and New Perspectives
  • From Depth Sensing to Deep Depth Estimation for 3D Reconstruction: Open Challenges
  • Deep Learning and Medical Image Analysis: Epistemology and Ethical Issues
  • An Integrated Mobile Vision System for Enhancing the Interaction of Blind and Low Vision Users with Their Surroundings
  • Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector
  • Facial Expression Recognition with Quarantine Face Masks Using a Synthetic Dataset Generator
  • A Global Multi-Temporal Dataset with STGAN Baseline for Cloud and Cloud Shadow Removal
  • Climbing with Virtual Mentor by Means of Video-Based Motion Analysis
  • Unsupervised Domain Adaptation for Video Violence Detection in the Wild
  • Emotion Based Music Visualization with Fractal Arts
  • Application of Particle Detection Methods to Solve Particle Overlapping Problems
  • Handling Data Heterogeneity in Federated Learning with Global Data Distribution