Image processing Books
Springer Computer Vision ECCV 2024
Book SynopsisMutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection.- Self-Supervised Video Copy Localization with Regional Token Representation.- Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models.- RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF.- Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture.- ControlLLM: Augment Language Models with Tools by Searching on Graphs.- UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction.- DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors.- Vamos: Versatile Action Models for Video Understanding.- Prioritized Semantic Learning for Zero-shot Instance Navigation.- RoadPainter: Points Are Ideal Navigators for Topology transformER.- FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis.- Can OOD Object Detectors Learn from Foundation Models?.- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion.- MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo.- Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training.- Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation.- Real-data-driven 2000 FPS Color Video from Mosaicked Chromatic Spikes.- Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging.- TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts.- RadEdit: stress-testing biomedical vision models via diffusion image editing.- SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow.- AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution.- Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion.- Towards Real-world Event-guided Low-light Video Enhancement and Deblurring.- Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation.- TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks.
£64.99
Springer Computer Vision ECCV 2024
Book SynopsisUpper-body Hierarchical Graph for Skeleton Based Emotion Recognition in Assistive Driving.- Fine-Grained Scene Graph Generation via Sample-Level Bias Prediction.- Exploring Guided Sampling of Conditional GANs.- MotionChain: Conversational Motion Controllers via Multimodal Prompts.- Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition.- Latent Guard: a Safety Framework for Text-to-image Generation.- MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion.- TCC-Det: Temporarily consistent cues for weakly-supervised 3D detection.- OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection.- FoundPose: Unseen Object Pose Estimation with Foundation Features.- Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation.- Kalman-Inspired Feature Propagation for Video Face Super-Resolution.- Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models.- VideoMamba: State Space Model for Efficient Video Understanding.- SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging.- Heterogeneous Graph Learning for Scene Graph Prediction in 3D Point Clouds.- Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving.- Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models.- Deep Cost Ray Fusion for Sparse Depth Video Completion.- GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection.- DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video.- GraspXL: Generating Grasping Motions for Diverse Objects at Scale.- Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models.- Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models.- JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation.- Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals.- Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection.
£64.99
Springer Cancer Prevention Detection and Intervention
Book SynopsisClassification and characterization.- Multi-center ovarian tumor classification using hierarchical transformer-based multiple-instance learning.- FoTNet Enables Preoperative Differentiation of Malignant Brain Tumors with Deep Learning.- Classification of Endoscopy and Video Capsule Images using Hybrid Model.- Multimodal Deep Learning-based Prediction of Immune Checkpoint Inhibitor Efficacy in Brain Metastases.- Seeing More with Less: Meta-Learning and Diffusion Models for Tumor Characterization in Low-data Settings.- Performance Evaluation of Deep Learning and Transformer Models Using Multimodal Data for Breast Cancer Classification.- Detection and Segmentation.- On undesired emergent behaviors in compound prostate cancer detection systems.- Optimizing Multi-Expert Consensus for Classification and Precise Localization of Barrett's Neoplasia.- Automated Hepatocellular Carcinoma Analysis in Multi-Phase CT with Deep Learning.- Refining deep learning segmentation maps with a local thresholding approach: application to liver surface nodularity quantification in CT.- Uncertainty-Aware Deep Learning Classification for MRI-based Prostate Cancer Detection.- Generalized Polyp Detection from Colonoscopy frames Using proposed EDF-YOLO8 Network.- AI-Assisted Laryngeal Examination System.- UltraWeak: Enhancing Breast Ultrasound Cancer Detection with Deformable DETR and Weak Supervision.- SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling.- Cancer/Early cancer detection, treatment, and survival prognosis.-AI Age Discrepancy: A Novel Parameter for Frailty Assessment in Kidney Tumor Patients.- Deep Neural Networks for Predicting Recurrence and Survival in Patients with Esophageal Cancer After Surgery.- Treatment efficacy prediction of focused ultrasound therapies using multi-parametric magnetic resonance imaging.- SurRecNet: A Multi-Task Model with Integrating MRI and Diagnostic Descriptions for Rectal Cancer Survival Analysis.- Improved prediction of recurrence after prostate cancer radiotherapy using multimodal data and in silico simulations.- AutoDoseRank: Automated Dosimetry-informed Segmentation Ranking for Radiotherapy.- SurvCORN: Survival Analysis with Conditional Ordinal Ranking Neural Network.
£49.99
Springer Computer Vision ECCV 2024
Book SynopsisSLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking.- Tensorial template matching for fast cross-correlation with rotations and its application for tomography.- FreeAugment: Data Augmentation Search Across All Degrees of Freedom.- Learning Representations of Satellite Images From Metadata Supervision.- I2-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM.- FlashTex: Fast Relightable Mesh Texturing with LightControlNet.- GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence.- ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling.- PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance.- SOS: Segment Object System for Open-World Instance Segmentation With Object Priors.- Lagrangian Hashing for Compressed Neural Field Representations.- EDformer: Transformer-Based Event Denoising Across Varied Noise Levels.- Foster Adaptivity and Balance in Learning with Noisy Labels.- MetaAug: Meta-Data Augmentation for Post-Training Quantization.- Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis.- Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach.- Unleashing the Power of Prompt-driven Nucleus Instance Segmentation.- Gaze Target Detection Based on Head-Local-Global Coordination.- 3DSA:Multi-View 3D Human Pose Estimation With 3D Space Attention Mechanisms.- Toward Tiny and High-quality Facial Makeup with Data Amplify Learning.- An Economic Framework for 6-DoF Grasp Detection.- GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction.- Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning.- AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer.- Multi-Label Cluster Discrimination for Visual Representation Learning.- Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation.- DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion.
£71.99
Springer Computer Vision ECCV 2024
Book SynopsisCLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks.- Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering.- Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds.- A New Dataset and Framework for Real-World Blurred Images Super-Resolution.- AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization.- RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation.- StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models.- Bidirectional Uncertainty-Based Active Learning for Open-Set Annotation.- Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective.- Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation.- SeiT++: Masked Token Modeling Improves Storage-efficient Training.- Rectify the Regression Bias in Long-Tailed Object Detection.- MagicEraser: Erasing Any Objects via Semantics-Aware Control.- Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation.- Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis.- SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images.- NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model.- Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities.- Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers.- Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification.- 3D Small Object Detection with Dynamic Spatial Pruning.- STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning.- Transferable 3D Adversarial Shape Completion using Diffusion Models.- OmniSat: Self-Supervised Modality Fusion for Earth Observation.- Distilling Diffusion Models into Conditional GANs.- Semantically Guided Representation Learning For Action Anticipation.- MemBN: Robust Test-Time Adaptation via Batch Norm with Statistics Memory.
£64.99
Springer Computer Vision ECCV 2024
Book SynopsisFREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions.- ScanTalk: 3D Talking Heads from Unregistered Scans.- Controllable Navigation Instruction Generation with Chain of Thought Prompting.- GiT: Towards Generalist Vision Transformer through Universal Language Interface.- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention.- A Cephalometric Landmark Regression Method based on Dual-encoder for High-resolution X-ray Image.- Exploring the Feature Extraction and Relation Modeling For Light-Weight Transformer Tracking.- LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment.- You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation.- Gaussian Grouping: Segment and Edit Anything in 3D Scenes.- CoMo: Controllable Motion Generation through Language Guided Pose Code Editing.- MegaScenes: Scene-Level View Synthesis at Scale.- SuperGaussian: Repurposing Video Models for 3D Super Resolution.- Towards Model-Agnostic Dataset Condensation by Heterogeneous Models.- Goldfish: Vision-Language Understanding of Arbitrarily Long Videos.- MeshFeat: Multi-Resolution Features for Neural Fields on Meshes.- Decoupling Common and Unique Representations for Multimodal Self-supervised Learning.- MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training.- Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation.- 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction.- Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models.- D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction.- Combining Generative and Geometry Priors for Wide-Angle Portrait Correction.- RealViformer: Investigating Attention for Real-World Video Super-Resolution.- Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution.- Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation.- UniFS: Universal Few-shot Instance Perception with Point Representations.
£71.99
Springer Computer Vision ECCV 2024
Book SynopsisSemanticHuman-HD: High Resolution Semantic disentangled 3D Human Generation.- CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians.- Monocular Occupancy Prediction for Scalable Indoor Scenes.- Visual Grounding for Object-Level Generalization in Reinforcement Learning.- 3DEgo: 3D Editing on the Go!.- Efficient Depth-Guided Urban View Synthesis.- Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model.- Domain-adaptive Video Deblurring via Test-time Blurring.- Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures.- NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving.- OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing.- Progressive Pretext Task Learning for Human Trajectory Prediction.- Hyperion A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM.- Isomorphic Pruning for Vision Models.- Attention Prompting on Image for Large Vision-Language Models.- Learning Cross-hand Policies of High-DOF Reaching and Grasping.- Reprojection Errors as Prompts for Efficient Scene Coordinate Regression.- Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning.- Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit Adjustment.- REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models.- DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing.- VideoClusterNet: Self-Supervised and Adaptive Face Clustering for Videos.- Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients.- Controlling the World by Sleight of Hand.- Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack.- Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection.- Cross-Domain Learning for Video Anomaly Detection with Limited Supervision.
£64.99
Springer Foundation Models for General Medical AI
Book Synopsis.- FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification..- The Importance of Downstream Networks in Digital Pathology Foundation Models..- Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation..- Navigating Data Scarcity using Foundation Models: A Benchmark of Few-Shot and Zero-Shot Learning Approaches in Medical Imaging..- AutoEncoder-Based Feature Transformation with Multiple Foundation Models in Computational Pathology..- OSATTA: One-Shot Automatic Test Time Augmentation for Domain Adaptation..- Automating MedSAM by Learning Prompts with Weak Few-Shot Supervision..- SAT-Morph: Unsupervised Deformable Medical Image Registration using Vision Foundation Models with Anatomically Aware Text Prompt..- Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs..- D- Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions..- Optimal Prompting in SAM for Few-Shot and Weakly Supervised Medical Image Segmentation..- UniCrossAdapter: Multimodal Adaptation of CLIP for Radiology Report Generation..- TUMSyn: A Text-Guided Generalist model for Customized Multimodal MR Image Synthesis..- SAMU: An Efficient and Promptable Foundation Model for Medical Image Segmentation..- Anatomical Embedding-Based Training Method for Medical Image Segmentation Foundation Models..- Boosting Vision-Language Models for Histopathology Classification: Predict all at once..- MAGDA: Multi-agent guideline-driven diagnostic assistance.
£44.99
Springer Biomedical Image Registration
Book SynopsisArchitectures.- multiGradICON A Foundation Model for Multimodal Medical Image Registration.- XSynthMorph Generative Guided Deformation for Unsupervised Ill Posed Volumetric Recovery.- Large Deformation Registration with A Confidence guided Network.- Unleashing Registration Diffusion Models for Synthetic Paired 3D Training Data.- Feedback Attention for Unsupervised Cardiac Motion Estimation in 3D Echocardiography.- Learning Intra Patient Liver Registration with Graph Cross Attention.- Mamba Catch The Hype Or Rethink What Really Helps for Image Registration.- Robustness.- Assessing the Robustness of Image Registration Models Under Domain Shifts with Learnable Input Images.- Challenging the Robustness of Image Registration Similarity Metrics with Adversarial Attacks.- Comparative Study on Co Registration Techniques for Diffusion Weighted Breast MRI and Improved ADC Mappin.- A Learning Free Approach to Mitigate Abnormal Deformations in Medical Image Registration.- Deformable MRI Sequence Registration for AI based Prostate Cancer Diagnosis.- Atlas Fusion.- SINA Sharp Implicit Neural Atlases by Joint Optimisation of Representation and Deformation.- A Novel Fusion of CT MRI and US Images Based on Depth Camera and Electromagnetic Tracking.- Deep Learning Multi Channel Structural and Diffusion Tensor Neonatal Image Registration.- Registration by Regression RbR a Framework for Interpretable and Flexible Atlas Registration.- Diffusion Model Based Hierarchical Registration Framework for Whole Body Image.- Feature Similarity Learning.- Unsupervised Similarity Learning for Image Registration with Energy Based Models.- Segmentation by registration enabled SAM prompt engineering using five reference images.- Electron Microscopy Image Registration with Twin Axial Transformer and Progressive Training.- General Vision Encoder Features as Guidance in Medical Image Registration.- Rigid Single Slice in Volume registration via rotation equivariant 2D 3D feature matching.- A Self Supervised Image Registration Approach for Measuring Local Response Patterns in Metastatic Ovarian Cancer.- CAR Contrast Agnostic Deformable Medical Image Registration with Contrast Invariant Latent Regularization.- Efficiency.- High Performance Groupwise Cortical Surface Registration with Multimodal Surface Matching.- Optimising Region of Interest Registration for Multiple Tissue Whole Slide Images.- Automatic Registration of SHG and H E Images with Feature based Initial Alignment and Intensity based Instance Optimization Contribution to the COMULIS Challenge.- Towards Fast and Accurate Non rigid Liver Fusion.
£59.99
Springer Computer Vision ECCV 2024
Book SynopsisMRSP: Learn Multi-Representations of Single Primitive for Compositional Zero-Shot Learning.- Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs.- TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance.- Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing.- Towards Open Domain Text-Driven Synthesis of Multi-Person Motions.- Generative End-to-End Autonomous Driving.- Learning to Distinguish Samples for Generalized Category Discovery.- COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark.- PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning.- Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem.- WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning.- Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice.- Encapsulating Knowledge in One Prompt.- Cross-Input Certified Training for Universal Perturbations.- Visual Relationship Transformation.- Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data.- Delving into Adversarial Robustness on Document Tampering Localization.- Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing.- Confidence-Based Iterative Generation for Real-World Image Super-Resolution.- Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy.- Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection.- Seeing Faces in Things: A Model and Dataset for Pareidolia.- Cocktail Universal Adversarial Attack on Deep Neural Networks.- Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering.- AMD: Automatic Multi-step Distillation of Large-scale Vision Models.- FairViT: Fair Vision Transformer via Adaptive Masking.- TrojVLM: Backdoor Attack Against Vision Language Models.
£64.99
Springer Computer Vision ECCV 2024
Book SynopsisGeoCalib: Learning Single-image Calibration with Geometric Optimization.- 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation.- Semicalibrated Relative Pose from an Affine Correspondence and Monodepth.- Global Structure-from-Motion Revisited.- MobileNetV4: Universal Models for the Mobile Ecosystem.- Gravity-aligned Rotation Averaging with Circular Regression.- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation.- Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments.- Quanta Video Restoration.- Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models.- CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model.- ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image.- POCA: Post-training Quantization with Temporal Alignment for Codec Avatars.- HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts.- Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras.- Unsupervised Dense Prediction using Differentiable Normalized Cuts.- Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training.- Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization.- AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion.- Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers.- EINet: Point Cloud Completion via Extrapolation and Interpolation.- Personalized Video Relighting With an At-Home Light Stage.- Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction.- A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks.- SPIRE: Semantic Prompt-Driven Image Restoration.- Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images.- HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution.
£71.99
£44.99
Springer Artificial Intelligence over Infrared Images for Medical Applications
Book Synopsis.- Thermal Radiomics for Early Detection of Diabetic Foot Ulcers Using Infrared Thermography..- Reverse Circular Logarithmic LBP for Diabetic Foot Ulcer Detection..- CNN Transformer for the Automated Detection of Rheumatoid Arthritis in Hand Thermal Images..- Generative Artificial Intelligence Approaches for Synthesizing High-Fidelity Breast Thermal Images..- About the validity of using DCGANs for data augmentation in breast thermography segmentation..- 3D-CNN for Breast Cancer Detection on Angular IR Images..- Evaluating Radiomics Feature Reduction for Thyroid Nodule Segmentation in Thermal Imaging..- Assessment of range of motion before and after hamstring percussion therapy using thermography and CNN..- The Effects of Balance and Strength on Thermal Heatmap..- The Future of Herpes Zoster Care: AI-Powered Thermal Imaging for Accurate Diagnosis and PHN Prediction.
£49.49
Springer Medical Image Computing and Computer Assisted Intervention MICCAI 2024 Workshops
Book SynopsisLDTM Workshop.- Disease Progression Modelling and Stratification for detecting sub-trajectories in the natural history of pathologies: application toParkinson's Disease trajectory modelling.- Back to the Future: Challenges of Sparse and Irregular Medical Image Time Series.- Individualized multi-horizon MRI trajectory prediction for Alzheimer's Disease.- Toward, for the Alzheimer's Disease Neuroimaging Initiative Towards Longitudinal Characterization of Multiple Sclerosis Atrophy Employing SynthSeg Framework and Normative Modeling.- BachCuadraSegHeD: Segmentation of Heterogeneous Data for Multiple SclerosisLesions with Anatomical Constraints.- Longitudinal Segmentation of MS Lesions via Temporal Difference Weighting .- Registration of Longitudinal Liver Examinations for Tumor ProgressAssessment.- Tracking lesion evolution using a Boundary Enhanced Approach for MS change segmentation (BEAMS).- A Radiological-based Coordinate System for the Human Body: A Proof-of-Concept.- MMMI-ML4MHD Workshop.- Language Models Meet Anomaly Detection for Better Interpretabilityand Generalizability.- A Diffusion Model Embedded WCSAU-Net for 3D MRI Brain Tumor Segmentation.- Predicting Human Brain States with Transformer .- Modality Image Quality Prediction for Time-Resolved CT fromBreathing Signals.- RATNUS: Rapid, Automatic Thalamic Nuclei Segmentation using Multimodal MRI inputs.- HyperMM : Robust Multimodal Learning with Varying-sized Inputs.- EMIT: H&E to Multiplex-immunohistochemistry Image Translation with Dual-Branch Pix2pix Generator.- Physics-Informed Latent Diffusion for Multimodal Brain MRI Synthesis.- ML-CDS Workshop.- MedPromptX: Grounded Multimodal Prompting for Chest X-rayDiagnosis.- Predicting Stroke through Retinal Graphs and Multimodal Self-supervised Learning.- Multimodality for Diagnosis of Asian Choroidal Vasculopathy: Resultsfrom a Novel Dataset and Deep-learning Experiments.- Multimodality Frequency Feature Customized Learning for PediatricVentricular Septal Defects Identification.
£49.99
Springer Pattern Recognition
Book Synopsis.- Clustering and Segmentation..- PARMESAN: Parameter-Free Memory Search and Transduction for Dense Prediction Tasks..- A State-of-the-Art Cutting Plane Algorithm for Clique Partitioning..- Self-Supervised Semantic Segmentation from Audio-Visual Data..- BTSeg: Barlow Twins Regularization for Domain Adaptation in Semantic Segmentation..- Learning Techniques..- FullCert: Deterministic End-to-End Certification for Training and Inference of Neural Networks..- Self-Masking Networks for Unsupervised Adaptation..- A Theoretical Formulation on the Use of Multiple Positive Views in Contrastive Learning.- Decoupling of neural network calibration measures..- Examining Common Paradigms in Multi-Task Learning..- DIAGen: Semantically Diverse Image Augmentation with Generative Models for Few-Shot Learning..- Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval ...- Anomaly Detection with Conditioned Denoising Diffusion Models..- Medical and Biological Applications..- SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network..- Foundation Models Permit Retinal Layer Segmentation Across OCT Devices..- Correlation Clustering of Organoid Images..- Animal Identification with Independent Foreground and Background Modeling..- Robust Tumor Segmentation with Hyperspectral Imaging and Graph Neural Networks..- Bigger Isn’t Always Better: Towards a General Prior for Medical Image Reconstruction..- Uncertainty and Explainability..- Latent Diffusion Counterfactual Explanations..- Enhancing Surface Neural Implicits with Curvature-Guided Sampling and Uncertainty-Augmented Representations..- Uncertainty Voting Ensemble for Imbalanced Deep Regression..- Analytical Uncertainty-Based Loss Weighting in Multi-Task Learning.
£59.99
Springer Pattern Recognition
Book Synopsis.- Modelling of Faces and Shapes..- 360° Volumetric Portrait Avatar..- How Do You Perceive My Face? Recognizing Facial Expressions in Multi-Modal Context by Modeling Mental Representations..- A Latent Implicit 3D Shape Model for Multiple Levels of Detail..- Image Generation and Reconstruction..- Coloring the Past: Neural Historical Monuments Reconstruction from Archival Photography..- Expanding the Image Embedding Space for Language-Free Text-to-Face Image Generation..- Towards synthetic generation of realistic wooden logs..- 3D Analysis and Sythesis..- G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles..- LiFCal: Online Light Field Camera Calibration via Bundle Adjustment..- CARLA Drone: Monocular 3D Object Detection from a Different Perspective..- Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors..- DynaPix SLAM: A Pixel-Based Dynamic Visual SLAM Approach..- Leveraging Image Matching Toward End-to-End Relative Camera Pose Regression..- Erasing the Ephemeral: Joint Camera Refinement and Transient Object Removal for Street View Synthesis..- Physically Plausible Object Pose Refinement in Cluttered Scenes..- Gaussian Splatting in Style..- Video Analysis..- Bounding Boxes and Probabilistic Graphical Models: Video Anomaly Detection Simplified..- STAR: Screen Time and Actor Recognition in Video Content..- Photogrammetry and Remote Sensing..- Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery..- Worldwide High-fidelity Road Extraction from Aerial and Satellite Imagery enabled by Low-fidelity OpenStreetMap Labels..- SenPa-MAE: Sensor Parameter Aware Multi-Satellite Masked Autoencoder for Multispectral Earth Observation Imagery..- PuzzleBoard: A new Camera Calibration Pattern with Position Encoding.
£59.99
Springer Computational Diffusion MRI
Book Synopsis- Super-Resolution of Diffusion-Weighted Images via TDI-Conditioned Diffusion Model..- Diffusion-Based Gray-White Matter Mapping for Quantitative Tractography in Glioma Patients..- Ground-truth effects in learning-based fiber orientation distribution estimation in neonatal brains..- Synthesizing 3D axon morphology: springs are all we need..- Randomly COMMITting: Iterative Convex Optimization for Microstructure-Informed Tractography..- AID-DTI: Accelerating High-fidelity Diffusion Tensor Imaging with Detail-preserving Model-based Deep Learning..- Multi-dimensional Parameter Space Exploration for Streamline-specific Tractography..- Cross-domain Fiber Cluster Shape Analysis for Language Performance Cognitive Score Prediction..- Can Transfer Learning Improve Supervised Segmentation of White Matter Bundles in Glioma Patients..- Image Quality Transfer of Diffusion MRI Guided By High-Resolution Structural MRI..- QID2: An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data..- Ts-FWE: Token-Aware Single-shell Free Water Estimation for Brain Diffusion MRI..- Assessing Early Motor System Degeneration in the Spinal Cord of ALS Patients Using Diffusion MRI: An Exploratory Study..- RobNODDI: Robust NODDI Parameter Estimation with Adaptive Sampling under Continuous Representation..- Introducing QuantConn: Overcoming challenging diffusion acquisitions with harmonization..- Learning Low-Rank Tensor Approximation for GPU-based Tractography..- Deep multivariate autoencoder for capturing complexity in Brain Structure and Behaviour Relationships..- Heritability and Genetic Correlations Along the Corticospinal Tract..- Corpus Callosum Parcellation Methods: What Can Tractography Tell Us About Them?.
£48.44
Springer Intelligent and Efficient Video Moment Localization
Book SynopsisChapter 1: Introduction.- Chapter 2: Semantic Enhanced Video Moment Localization.- Chapter 3: Semantic Alignment Video Moment Localization.- Chapter 4: Semantic Pruning Video Moment Localization.- Chapter 5: Semantic Collaborative Video Moment Localization.- Chapter 6: Weakly-Supervised Video Moment Localization.- Chapter 7: Efficient Hashing based Video Moment Localization.- Chapter 8: Research Frontiers.
£133.48
£191.90
£68.40
£68.40
£66.49
£66.49
£66.49
Springer MICCAI Challenges 2024 ToothFairy 3DTeethLand STS LNCS
Book SynopsisToothFairy2: Multi-Structure Segmentation in CBCT Volumes.-Inferior Alveolar Nerve Segmentation in CBCT Images Using Connectivity-based Selective Re-training.- Scaling nnU-Net for CBCT Segmentation.- DiENTeS: Dynamic ENTity Segmentation with Local-Global Transformers.- Enhanced Multi-Structure Segmentation in CBCT Images with Adaptive Structure Optimization.- Weakly-Supervised Convolutional Neural Networks for Inferior Alveolar Nerve Segmentation in CBCT images.- A Multi-Axial Network for Oral Structural Segmentation.- Automatic Multi-Structure Segmentation in Cone Beam Computed Tomography Volumes Using Deep Encoder-Decoder Architectures.- Video Foundation Model for Medical 3D Segmentation.-STS: Semi-supervised Teeth Segmentation.-A Two-Stage Semi-Supervised nnU-Net Model for Automated Tooth Segmentation in Panoramic X-ray Images.- Two-Stage Semi-Supervised nnU-Net Framework for Tooth Segmentation in CBCT Images.- SemiT-SAM: Building a Visual Foundation Model for Tooth Instance Segmentation on Panoramic Radiographs.- Multi-stage Dental Visual Detection Based on YOLOv8: Dental 3D CBCT.- Efficient Semi-Supervised Tooth Instance Segmentation in Panoramic X-rays Using ResUnet50 and SAM Networks.- DAE-Net: Dual Attention Embedding-based Tooth Instance Segmentation Approach for Panoramic X-ray Images.- A Self-Training Pipeline for Semi-Supervised 2D Teeth Instance Segmentation.- Deformable Inherent Consistent Learning Network for Accurate Tooth Segmentation in Dental Panoramic Radiographs.- Semi-Supervised 2D Dental Image Segmentation via Cross Teaching Network.- A Novel Two-Stage Approach for 3D Dental Tooth Instance Segmentation.- 3DTeethLand24: 3D Teeth Landmarks Detection Challenge.-A Two-Stage Framework with Dual-Branch Network for End-to-End 3D Tooth Landmark Detection.- Leveraging Point Transformers for Detecting Anatomical Landmarks in Digital Dentistry.- ToothInstanceNet: Comprehensive Information from Intra-Oral Scans by Integration of Large-Context and High-Resolution Predictions.
£104.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisYCB-Ev 1.1: Event-vision dataset for 6DoF object pose estimation.- PCR-99: A Practical Method for Point Cloud Registration with 99 Percent Outliers.- Robust Single Rotation Averaging Revisited.- LanPose: Language-Instructed 6D Object Pose Estimation for Robotic Assembly.- SABER-6D: Shape Representation Based Implicit Object Pose Estimation.- FruitBin: a tunable large-scale dataset for advancing 6D pose estimation in fruit bin-picking automation.- Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation?.- What's Wrong with the Absolute Trajectory Error?.- MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation.- KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction.- LVG-SfM: Learning-based View-Graph generation for robust on-the-fly SfM.- Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces.- PoTATO: A Dataset for Analyzing Polarimetric Traces of Afloat Trash Objects.- EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment.- Object Pose Estimation Using Implicit Representation For Transparent Objects.- TRICKY 2024 Challenge on Monocular Depth from Images of Specular and Transparent Surfaces.- ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet.- Automatic Generation of Fashion Images using Prompting in Generative Machine Learning Models.- Fashion Attribute Extraction Under an Evolving Ontology.- Capturing and modeling real cloth deformations for virtual garment design.- MDiFF: Exploiting Multimodal Score-based Diffusion Models for New Fashion Product Performance Forecasting.- Deep Armocromia: A Novel Dataset for Face Seasonal Color Analysis and Classification.- DIVA: Deep Indic Virtual Apparel Try-On.- Garment Attribute Manipulation with Multi-level Attention.- Machine Learning-Driven Marketing Personas for the Luxury Fashion Market.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisSelf-supervised disentangled representation learning of artistic style through Neural Style Transfer.- Similar paintings retrieval from individual and multiple poses.- NeAT: Neural Artistic Tracing for high resolution Style Transfer.- DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer.- Analysis of Hybrid Compositions in Animation Film with Weakly Supervised Learning.- BackFlip: The Impact of Local and Global Data Augmentations on Artistic Image Aesthetic Assessment.- Cultural Heritage 3D Reconstruction with Diffusion Networks.- Context-Infused Visual Grounding for Art.- ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement.- Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces.- Evaluating Usability and Engagement of Large Language Models in Virtual Reality for Traditional Scottish Curling.- EUFCC-CIR: a composed image retrieval dataset for GLAM collections.- µgat: Improving Single-Page Document Parsing by Providing Multi-Page Context.- Pixels of Faith: Exploiting Visual Saliency to Detect Religious Image Manipulation.- Automatic Die Studies for Ancient Numismatics.- San Vitale Challenge: Automatic Reconstruction of Ancient Colored Glass Windows.- The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives.- An approach for dataset extension for object detection in artworks using open-vocabulary models.- A Data-Centric Module for Neural Rendering.- Structured Analysis of Alphabets in Historical Handwritten Ciphers.- Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans..- HybridFormer: Bridging Local and Global Spatio-Temporal Dynamics for Efficient Skeleton-Based Action Recognition..- MPL: Lifting 3D Human Pose from Multi-view 2D Poses..- THP3D: Text-Driven Multi-Granularity 3D Human Parsing..- ROMEO: Revisiting Optimization Methods for Reconstructing 3D HumanObject Interaction Models From Images..- Leveraging key-points Encoded Human Pose Images for Human Activity Recognition..- Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity..- Multi-Camera Industrial Open-Set Person Re-Identification and Tracking..- Enhanced Action Quality Assessment with Two-Stream Pose and Video Feature Integration..- PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation..- Enhancing Gait Recognition: Data Augmentation via Physics-Based Biomechanical Simulation..- Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB..- VATE: a Large Scale Multimodal Spontaneous Dataset for Affective Evaluation..- Guidelines for Query and Gallery Image Extraction in Person ReIdentification Systems..- Pose-independent 3D Anthropometry from Sparse Data..- Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance..- FlexControl: Flexible and Efficient Full-Body Controllable Text-to-Motion Generation..- Coarse to Fine Human Mesh Recovery with Transformers..- Motion Reconstruction via Human Anatomy Diffusion from Sparse Tracking..- A vision-based framework for human behavior understanding in industrial assembly lines..- Boosting Pose Estimators via Cross-Representation Distillation..- Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- AirLetters: An Open Video Dataset of Characters Drawn in the Air..- RegionGrasp: A Novel Task for Contact Region Controllable Hand Grasp Generation..- Generative Hierarchical Temporal Transformer for Hand Pose and Action Modeling..- Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss..- Conditional Hand Image Generation using Latent Space Supervision in Random Variable Variational Autoencoders..- ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild..- EMAG: Ego-motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos..- Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents..- RGMIM: Region-Guided Masked Image Modeling for Learning Meaningful Representations from X-Ray Images..- A Biologically-inspired Approach to Biomedical Image Segmentation..- Human-based Low-Level Visual Processing Neural Network for Image Segmentation..- Glia Cell Inspired Reinforcement Learning Agent for Neural Network Optimization..- Growing Deep Neural Network Considering with Similarity between Neurons..- Representation Learning in a Decomposed Encoder Design for Bio-inspired Hebbian Learning..- Reducing Catastrophic Forgetting in Online Class Incremental Learning Using Self-Distillation..- What makes a face look like a hat: Decoupling low-level and high-level visual properties with image triplets..- ScanDDM: Generalised Zero-Shot Neuro-Dynamical Modelling of GoalDirected Attention..- Limited but consistent gains in adversarial robustness by co-training object recognition models with human EEG..- Online Learning via Memory: Retrieval-Augmented Detector Adaptation..- AHMF: Adaptive Hybrid-Memory-Fusion Model for Driver Attention Prediction..- Accuracy Improvement of Cell Image Segmentation Using Feedback Former..- Variable resolution improves visual question answering under a limited pixel budget..- MOSAIC: Skeleton-based human motion recognition with compositional representations..- Adapting Large Language Model for Cross-Subject Semantic Decoding from Video-Stimulated fMRI..- A System 1 and System 2 Perspective on Continual Learning for Practical Implementation..- Connectivity-Inspired Network for Context-Aware Recognition..- Generalizability analysis of deep learning predictions of human brain responses to augmented and semantically novel visual stimuli.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- MMA-MRNNet: Harnessing Multiple Models of Affect and Dynamic Masked RNN for Precise Facial Expression Intensity Estimation..- ToddlerAct: A Toddler Action Recognition Dataset for Gross Motor Development Assessment..- 7th abaw competition: Multi-task learning and compound expression recognition..- Are Visual-Language Models Effective in Action Recognition? A Comparative Study..- Textualized and Feature-based Models for Compound Multimodal Emotion Recognition in the Wild..- Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints..- Single Image 3D Human Pose Estimation Using Sequential Joint Group Generation..- MVP: Multimodal Emotion Recognition based on Video and Physiological Signals..- Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech..- Massively Multi-Person 3D Human Motion Forecasting with Scene Context..- TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans..- Improving face generation quality and prompt following with synthetic captions..- Tracking Virtual Meetings in the Wild: Re-identification in MultiParticipant Virtual Meetings..- rPPG-SysDiaGAN: Systolic-Diastolic Feature Localization in rPPG Using Generative Adversarial Network with Multi-Domain Discriminator..- MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition..- Predicting Emotions in Interpersonal Interaction Videos: I Know What You Feel..- Multi-Task Affective Behaviour Analysis based on MT-EmotiNet Models..- Smoothing Predictions of Multi-Task EmotiNet Models for Compound Facial Expression Recognition..- ABAW7 Challenge: A Facial Affect Recognition Approach based on Transformer Encoder and Multilayer Perceptron..- Compound Expression Recognition via Curriculum Learning..- Boundary Matching and Refinement Network with Cross-modal Contrastive Learning for Temporal Moment Localization..- Enhancing Facial Expression Recognition through Dual-Direction Attention Mixed Feature Networks: Application to 7th ABAW Challenge..- Introducing Gating and Context into Temporal Action Detection..- Better Spanish Emotion Recognition In-the-wild: Bringing Attention to Deep Spectrum Voice Analysis..- Monitoring Viewer Attention During Online Ads..- Affective Behaviour Analysis via Progressive Learning..- Are We Friends? End-to-End Prediction of Child Rapport in Guided Play..- Affective Behavior Analysis using Task-adaptive and AU-assisted Graph..- Ig3D: Integrating 3D Face Representations in Facial Expression Inference.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval..- Rethinking Sparse Lexical Representations for Image Retrieval in the Ageof Rising Multi-Modal Large Language Models..- Boundary Attention: Learning curves, corners, junctions and grouping..- Deep Learning Meets Satellite Images - An Evaluation on Handcrafted andLearning-based Features for Multi-date Satellite Stereo Images..- Image Color Consistency in Datasets: The Smooth-TPS3D Method..- Unsupervised Video Summarization: A Reconstruction Model with ProximalGradient Methods..- TF-OCM: Training Free Optimal Community Matching for Domain Generalized Few Shot Learning..- Perspective-Equivariance for Unsupervised Imaging with Camera Geometry..- Reliable Probabilistic Human Trajectory Prediction for Autonomous Applications..- Calibration of Network Confidence for Unsupervised Domain AdaptationUsing Estimated Accuracy..- Logit disagreement: OoD Detection with Bayesian Neural Networks..- Can Your Generative Model Detect Out-of-Distribution Covariate Shift?..- Multi-label out-of-distribution detection via evidential learning..- UTrack: Multi-Object Tracking with Uncertain Detections..- TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection..- Sanity Checks for Explanation Uncertainty..- Sources of Uncertainty in 3D Scene Reconstruction..- The BRAVO Semantic Segmentation Challenge Results in UNCV2024..- Maximally Separated Active Learning..- Hyperbolic Metric Learning for Visual Outlier Detection..- A Bottom-Up Approach to Class-Agnostic Image Segmentation..- Adversarial Attacks on Hyperbolic Networks..- Hyperbolic Learning with Multimodal Large Language Models..- Embedding Geometries of Contrastive Language-Image Pre-Training..- Learning Multi-Manifold Embedding for Out-Of-Distribution Detection.- ProxyDR: Deep Hyperspherical Metric Learning with Distance Ratio-BasedFormulation..- Backward-Compatible Aligned Representations via Orthogonal Transformation Layer.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- DeepClean: Machine Unlearning on the Cheap by Resetting Privacy Sensitive Weights using the Fisher Diagonal..- Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models..- Aligning Vision Language Models with Contrastive Learning..- Open-set object detection: towards unified problem formulation and benchmarking..- Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts..- SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation..- Online Stochastic Optimization for Data with Temporal Dependencies..- A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-Time Adaptation for Vision-Language Models..- OSSA: Unsupervised One-Shot Style Adaptation..- ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer..- Open-set Plankton Recognition..- Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?..- On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes..- Source-Free Domain Adaptation for YOLO Object Detection..- Task-Specific Adaptation of Segmentation Foundation Model via Prompt Learning..- Utilizing Class-Agnostic Point-to-Box Regressors as Object Proposal Generators..- Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective..- Improving Generalization in Visual Reasoning via Self-Ensemble..- BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation..- Image Translation with Kernel Prediction Networks for Semantic Segmentation..- Robust fine-tuning and adaptation of zero-shot models via adaptive weightspace ensembling..- Robustness to Spurious Correlation: A Comprehensive Review.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- Fine-tuning a Multiple Instance Learning Feature Extractor with Masked Context Modelling and Knowledge Distillation..- Advancing Medical Radiograph Representation Learning: A Hybrid Pretraining Paradigm with Multilevel Semantic Granularity..- Can virtual staining for high-throughput screening generalize?..- SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images..- A Good Feature Extractor Is All You Need for Weakly Supervised Pathology Slide Classification..- Boosting Medical Image Registration Network Inherently via Collaborative Learning..- Genetic Information Analysis of Age-Related Macular Degeneration Fellow Eye Using Multi-Modal Selective ViT..- CHOTA: A Higher Order Accuracy Metric for Cell Tracking..- Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification..- Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images..- BATseg: Boundary-aware Multiclass Spinal Cord Tumor Segmentation on 3D MRI Scans..- Affinity-VAE: incorporating prior knowledge in representation learning from scientific images..- Towards the Discovery of Down Syndrome Brain Biomarkers Using Generative Models..- Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis..- SS-MIL: Attention-Based Selective Correlated Multiple Instance Learning for Whole Slide Image Classification..- MicroSSIM: Improved Structured Similarity for Comparing Microscopy Data..- Generalized Segmentation for Maxillary Sinus and Mandibular Canal in Dental Panoramic X-rays..- MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation..- NCT-CRC-HE: Not All Histopathological Datasets Are Equally Useful..- Tracking one-in-a-million: Large-scale benchmark for microbial single-cell tracking with experiment-aware robustness metrics..- A Novel Approach to Linking Histology Images with DNA Methylation.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisValeo4Cast: A Modular Approach to End-to-End Forecasting.- AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data.- Autonomous Drone-Person Tracking and Following in Uniform Appearance Scenarios.- Continual Reinforcement Learning with Implicit Generative Replay for Autonomous Driving.- Self-supervised Road Accident Anticipation with Non-decreasing Danger.- 3D Object Detection and Tracking Refinement with Ensemble Methods and Spatiotemporal Filtering.- Conditional Unscented Autoencoders for Trajectory Prediction.- Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation.- TrackLidFormer: a Transformer-based Approach for Occluded Object Tracking.- Good Data Is All Imitation Learning Needs.- What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon.- High Dynamic Range Modulo Imaging for Robust Object Detection in Autonomous Driving.- RLNet: Adaptive Fusion of 4D Radar and Lidar for 3D Object Detection.- Improving Online Source-Free Domain Adaptation for Object Detection by Unsupervised Data Acquisition.- AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving.- On Camera and LiDAR Positions in End-to-End Autonomous Driving.- ProGBA: Prompt Guided Bayesian Augmentation for Zero-shot Domain Adaptation.- ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable.- Loop Mining Large-Scale Unlabeled Data for Corner Case Detection in Autonomous Driving.- HumanSim: Human-Like Multi-Agent Novel Driving Simulation for Corner Case Generation.- Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding.- RoSA Dataset: Road Construct zone Segmentation for Autonomous Driving.- A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection.- The Second Visual Object Tracking Segmentation VOTS2024 Challenge Results.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisMulti-agent Collaborative Perception for Robotic Fleet: A Systematic Review.- RP3D: A Roadside Perception Framework for 3D Object Detection via Multi-View Sensor Fusion.- StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection.- GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest.- SC-Track: State Transition and Constrained Non-negative Matrix Factorization for Multi-Camera Multi-Target Tracking.- Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones.- VICooper: A Practical Vehicle-Infrastructure Cooperative Perception Framework for Autonomous Driving.- MEDCO: Medical Education Copilots Based on A Multi-Agent Framework.- V2X-Based Decentralized Singular Value Decomposition in Dynamic Vehicular Environment.- LLaMAPed: Multi-modal Pedestrian Crossing Intention Prediction.- Optimization of Layer Skipping and Frequency Scaling for Convolutional Neural Networks under Latency Constraint.- An Infrastructure-based Localization Method for Articulated Vehicles.- HEAD: A Bandwidth-Efficient Cooperative Perception Approach for Heterogeneous Connected and Autonomous Vehicles.- Rethinking the Role of Infrastructure in Collaborative Perception.- Empowering Autonomous Shuttles with Next-Generation Infrastructure.- MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs' Cooperative Decision-Making.- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version).- iIPPC-V2X: Multi-modality Fusion Perception System for Cooperative Vehicle Infrastructure System with Self-supervised Learning.- Non-verbal Interaction and Interface with a Quadruped Robot using Body and Hand Gestures: Design and User Experience Evaluation.- Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisWild Berry image dataset collected in Finnish forests and peatlands using drones.- Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks.- A Framework for Enhanced Decision Support in Digital Agriculture Using Explainable Machine Learning.- Lincoln's Annotated Spatio-Temporal Strawberry Dataset (LAST-Straw).- 3D Phenotyping of Canopy Occupation Volume as a Major Predictor for Canopy Photosynthesis in Rice (Oryza sativa L.).- Retrieval of sun-induced plant fluorescence in the O2-A absorption band from DESIS imagery.- Unsupervised Tomato Split Anomaly Detection using Hyperspectral Imaging and Variational Autoencoders.- KAN You See It? KANs and Sentinel for Effective and Explainable Crop Field Segmentation.- RoWeeder: Unsupervised Weed Mapping through Crop-Row Detection.- Consolidation of symbolic instances using sensor data via tracklet merging for long-term monitoring of crops.- Automated Generation of Accurate, Compact and Focused Crop and Weed Segmentation Models.- Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection.- Towards Auto-Generated Ground Truth for Evaluation of Perception Systems in Agriculture.- AgriBench: A Hierarchical Agriculture Benchmark for Multimodal Large Language Models.- Deep Learning Based Growth Modeling of Plant Phenotypes.- A simple approach to pavement cell segmentation.- Enhancing weed detection performance by means of GenAI-based image augmentation.- SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture.- Robust UDA for Crop and Weed Segmentation: Multi-Scale Attention and Style-Adaptive Techniques.- Ordinal-Meta Learning for Fine-grained Fruit Quality Prediction.- Beyond Annotations: Efficient Wheat Head Segmentation Using L-Systems, Game Engines, and Student-Teacher Models.- Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisDiffusion-based Light Field Synthesis.- Diffusion-Promoted HDR Video Reconstruction.- Lightweight Deep Learning Model for Defective Pixel Detection and Recovery from the Image Sensors.- IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts.- Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models.- MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance.- RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content.- Detecting Forged Sentinel-2 Images Through Parallax-Based Cloud Analysis.- PRISM: Progressive Restoration for Scene Graph-based Image Manipulation.- DAVIDE: Depth-Aware Video Deblurring.- RenDetNet: Weakly-supervised Shadow Detection with Shadow Caster Verification.- Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model.- Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications.- QSD: Query-Selection Denoising score for Image Edit-ing in Latent Diffusion Model.- PDB Unet: A spatio temporal video Fixed Pattern Noise removal network.- Reversible and Cascaded Lightweight Colour Constancy: Jointly Addressing Illumination Correction and White Balance.- Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising.- Solving Inverse Problem With Unspecified Forward Operator Using Diffusion Models.- A Disentangled Approach to Predict the Aesthetic Outcomes of Breast Cancer Treatment.- LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model.- Pushing Joint Image Denoising and Classification to the Edge.- Self-Supervised HDR Imaging from Motion and Exposure Cues.- Closer to Ground Truth: Realistic Shape and Appearance Labeled Data Generation for Unsupervised Underwater Image Segmentation.- Edge-aware Consistent Stereo Video Depth Estimation.- Low-Cost Stereoscopic Optical-Coding Design for Depth Estimation Using End-to-End Optimization.- 360U-Former: HDR Illumination Estimation with Panoramic Adapted Vision Transformers.- Satellite Image Dehazing Via Masked Image Modeling and Jigsaw Transformation.- UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- TONO: a synthetic dataset for face image compliance to ISO/ICAO standard..- mproving Post-Earthquake Crack Detection using Semi-Synthetic Gener ated Images..- DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition..- Neural Transcoding Vision Transformers for EEG-to-fMRI Synthesis..- RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models..- NeRFmentation: NeRF-based Augmentation for Monocular Depth Estima tion..- Synthetic to Authentic: Transferring Realism to 3D Face Renderings for Boosting Face Recognition..- Time-Resolved MNIST Dataset for Single-Photon Recognition..- NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Hu man Pose Estimation in Top-View Fisheye Images..- Training and Benchmarking Leukocyte Sub-types Classification Methods with Synthetic Images..- DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling..- Contextual Knowledge Pursuit for Faithful Visual Synthesis..- SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models..- Diffusion-based Synthetic Dataset Generation for Egocentric 3D Human Pose Estimation..- BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabil ities in Pretrained Diffusion Models..- A CycleGAN Model to Synthesize Missing and Unpaired MRI Sequences for Under-Represented Multiple Sclerosis Lesions..- The Impact of Balancing Real and Synthetic Data on Accuracy and Fairness in Face Recognition..- DreamTexture: High-Fidelity Synthetic 3D Data Generation through De coupled Geometry and Texture Synthesis..- Control+Shift: Generating Controllable Distribution Shifts..- Comparative Analysis of Synthetic and Real Melanoma Images in AI-Driven Diagnosis..- How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recog nition..- Synthetic Generation of Dermatoscopic Images with GAN and Closed-Form Factorization..- FABRIC: Personalizing Diffusion Models with Iterative Feedback..- TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisFew-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting.- On Scaling Up 3D Gaussian Splatting Training.- AEPnP: A Less-constrained EPnP Solver for Pose Estimation with Anisotropic Scaling.- Scalable Indoor Novel-View Synthesis using Drone-Captured 360 Imagery with 3D Gaussian Splatting.- Space3D-Bench: Spatial 3D Question Answering Benchmark.- VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field.- NeRF-Supervised Feature Point Detection and Description.- Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks.- Real-Time 2nd-order Gaze Metrics.- Normalized Validity Scores for DNNs in Regression based Eye Feature Extraction and Real-Time Models for the Raspberry Pi.- Helios: An extremely low power event-based gesture recognition for always-on smart eyewear.- CondSeg: Ellipse Estimation of Pupil and Iris via Conditioned Segmentation.- Towards Unsupervised Eye-Region Segmentation for Eye Tracking.- Towards Low-power, High-frequency Gaze Direction Tracking with an Event-camera.- Towards Resource-aware Visual Inertial SLAM.- Evaluating Human Pose Estimation Algorithms for Resource-Constrained Smart Eyewear Device.- Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses With TinyissimoYOLO.- Towards Real-Time Online Egocentric Action Recognition on Smart Eye-wear.- High-frequency near-eye ground truth for event-based eye tracking.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- Landmark-Based Screening: Femoral Head Coverage and Graf Classificationin Infant Developmental Dysplasia of the Hip..- MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition..- One-Shot Image Restoration..- Medical Image Segmentation with SAM-generated Annotations..- Manipulating and Mitigating Generative Model Biases without Retraining..- Fake or JPEG? Revealing Common Biases in Generated Image DetectionDatasets..- Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image GenerativeModels..- A semiotic methodology for assessing the compositional effectiveness of generativetext-to-image models (Midjourney and DALLoE)..- A Framework for Critical Evaluation of Text-to-Image Models: IntegratingArt Historical Analysis, Artistic Exploration, and Critical Prompt Engineering..- Civiverse: A Dataset for Analyzing User Engagement with Open-SourceTTI-Models..- Exploring the Boundaries of Content Moderation in Text-to-Image Generation..- Rethinking HTG Evaluation: Bridging Generation and Recognition..- Evaluation Framework for Feedback Generation Methods in Skeletal MovementAssessment..- FaceOracle: Chat with a Face Image Oracle..- Makeup-Guided Facial Privacy Protection via Untrained Neural NetworkPriors..- How to Squeeze An Explanation Out of Your Model..- How were you created? Explaining synthetic face images generated by diffusionmodels..- Frequency Matters: Explaining Biases of Face Recognition in the FrequencyDomain..- How green is continual learning, really? Analyzing the energy consumptionin continual training of vision foundation models..- Architecture-Agnostic Unsupervised Gradient Regularization ForParameter-Efficient Transfer Learning..- Foundation Model or Finetune? Evaluation of few-shot semantic segmentationfor river pollution..- Personalizing Multimodal Large Language Models for Image Captioning: AnExperimental Analysis..- Improved Baselines for Data-efficient Perceptual Augmentation of LLMs..- Watt for What: Rethinking Deep Learning’s Energy-Performance Relationship.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisGenerating Binary Species Range Maps.- WildFusion: Individual Animal Identification with Calibrated Similarity Fusion.- Towards Zero-Shot Camera Trap Image Categorization.- Larval Hostplant Prediction from Luehdorfia japonica Image using Multi-label ABN.- Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation.- Underwater Uncertainty: A Multi-Annotator Image Dataset for Benthic Habitat Classification.- Deep Learning for Automated Shark Detection and Biometrics Without Keypoints.- Improving in situ real-time classification of long-tail marine plankton images for ecosystem studies.- KAN-Mixer: Kolmogorov-Arnold Networks for Gene Expression Prediction in Plant Species.- Multi-Scale and Multimodal Species Distribution Modelling.- Semantic Segmentation of Benthic Classes in Reef Environments using a Large Vision Transformer.- POLO - Point-based, multi-class animal detection.- Mining Field Data for Tree Species Recognition at Scale.- MaskSDM: Adaptive species distribution modeling through data masking.- Fine-tuning for Bird Sound Classification: An Empirical Study.- Multimodal Fusion Strategies for Mapping Biophysical Landscape Features.- I-Design: Personalized LLM Interior Designer.- NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training.- GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning.- DiVR: incorporating context from diverse VR scenes for human trajectory prediction.- Skeleton-Aware Motion Retargeting Using Masked Pose Modeling.- LucidDreaming: Controllable Object-Centric 3D Generation.- BehAVE: Behaviour Alignment of Video Game Encodings.- Collaborative Control for Geometry-Conditioned PBR Image Generation.- Real-Time Neural Cloth Deformation using a Compact Latent Space and a Latent Vector Predictor.- Level Up Your Tutorials: VLMs for Game Tutorials Quality Assessment.- Across-Game Engagement Modelling via Few-Shot Learning.- Hand2Any: Hand-to-Any Motion Mapping with Few-Shot User Adaptation for Avatar Manipulation.- SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers.- PlaMo: Plan and Move in Rich 3D Physical Environments.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisCEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding.- Event-based Motion Deblurring with Dual Channel Attention.- Optimal OnTheFly Feedback Control of Event Sensors. EventSleep: Sleep Activity Recognition with Event Cameras.- ES-PTAM: Event-based Stereo Parallel Tracking and Mapping.- Lossy Encoding of Time-aggregated Neuromorphic Vision Sensor Data based on Point Cloud Compression.- Drone Detection Using a Low-Power Neuromorphic Virtual Tripwire.- Event Stream Super Resolution using Sigma Delta Neural Network.- Tracking-Assisted Object Detection with Event Cameras.- MouseSIS: A Frames-and-Events Dataset for Space-Time Instance Segmentation of Mice.- HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision.- Millisecond-latency Visual Fault-buttons using Event-cameras.- Neuromorphic Facial Analysis with Cross-Modal Supervision.- Evaluating Image-Based Face and Eye Tracking with Event Cameras.- Scaling Up Resonate-and-Fire Networks for Fast Deep Learning.- Neuromorphic Drone Detection: an Event-RGB Multimodal Approach.- Pushing the boundaries of event subsampling in event-based video classifcation using CNNs.- Vibration Vision: Real-Time Machinery Fault Diagnosis with Event Cameras.- S-ROPE: Spectral Frame Representation of Periodic Events.- Autobiasing Event Cameras.- Recent Event Camera Innovations: A Survey.- EvDownsampling: A Robust Method For Downsampling Event Camera Data.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisModelling the Distribution of Human Motion for Sign Language Assessment.- Enhancing Human-Robot Collaborative Search through Efficient Space Sharing with On-demand Interaction.- Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models.- Hand Gesture Recognition using Dual Graph Hierarchical Edges Representation and Graph Transformer Network.- BurnSafe: Automatic Assistive Tool for Burn Severity Assessment by Semantic Segmentation.- DiffSign: AI-Assisted Generation of Customizable Sign Language Videos With Enhanced Realism.- Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents.- Multi-view Pose Fusion for Occlusion-Aware 3D Human Pose Estimation.- HAVANA: Hierarchical stochastic neighbor embedding for Accelerated Video ANnotAtions.- Aligning Object Detector Bounding Boxes with Human Preference.- GSK-C2F: Graph Skeleton Modelization for Action Segmentation and Recognition using a Coarse-to-Fine strategy.- Machine Learning Approaches for Analyzing Physiological Data in Remote Patient Monitoring.- OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare.- VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis.- Video Editing for Video Retrieval.- REST–HANDS: Rehabilitation with Egocentric Vision using Smartglasses for Treatment of Hands after Surviving Stroke.- Towards Wearable Multi-Modal Human Activity Recognition with Deep Fusion Networks.- Segmenting Object Affordances: Reproducibility and Sensitivity to Scale.- Target-Oriented Object Grasping via Multimodal Human Guidance.- A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning.- ExeChecker: Where Did I Go Wrong?.- Assistive Visual Tool: Enhancing Safe Navigation with Video Remapping in AR Headsets.- OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation.- BodyShapeGPT: SMPL Body Shape Manipulation with LLMs.- Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance.- MCRE: Multimodal Conditional Representation and Editing for Text-Motion Generation.- Towards motion from video diffusion models.- N Heads Are Better Than One: Exploring Theoretical Performance Bounds of 3D Face Reconstruction Methods.- GECO: GPT-Driven Estimation of 3D Human-Scene Contact in the Wild.- MI-NeRF: Learning a Single NeRF for Multiple Identities.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisFALCON: Fair Active Learning for Content Moderation.- Generalizing Fairness to Generative Language Models via Reformulation of Non-discrimination Criteria.- Beyond the Surface: A Comprehensive Analysis of Implicit Bias in Vision-Language Models.- Fairness of AI Systems in the Legal Context.- DebiasPI: Inference-time Debiasing by Prompt Iteration of a Text-to-Image Generative Model.- Fairness Under Cover: Evaluating the Impact of Occlusions on Demographic Bias in Facial Recognition.- Prompt and Prejudice.- Localization-Guided Supervision for Robust Medical Image Classification by Vision Transformers.- Top-GAP: Integrating Size Priors in CNNs for more Interpretability, Robustness, and Bias Mitigation.- Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers.- An Investigation on The Position Encoding in Vision-Based Dynamics Prediction.- What could go wrong? Discovering and describing failure modes in computer vision.- Image-guided topic modeling for interpretable privacy classification.- Integrating Local and Global Interpretability for Deep Concept-Based Reasoning Models.- From Flexibility to Manipulation: The Slippery Slope of XAI Evaluation.- Feature Contribution in Monocular Depth Estimation.- Concept-Based Explanations in Computer Vision: Where Are We and Where Could We Go?.- Explanation Alignment: Quantifying the Correctness of Model Reasoning At Scale.- Detect Fake with Fake: Leveraging Synthetic Data-driven Representationfor Synthetic Image Detection.- Incremental and Decremental Continual Learning for Privacy-preservingVideo Recognition.- Exploring Strengths and Weaknesses of Super-Resolution Attack in DeepfakeDetection.- Are CLIP features all you need for Universal Synthetic Image Origin Attribution?.- GLoFool: global enhancements and local perturbations to craft adversarial images.- Evolution of Detection Performance throughout the Online Lifespan of Synthetic Images.- Your diffusion model is an implicit synthetic image detector.- The Phantom Menace: Unmasking Privacy Leakages in Vision-LanguageModels.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- On the Application of Egocentric Computer Vision to Industrial Inspection..- NeuroSymbolic Visual Transform based on Logic Tensor Network for Defect Detection..- Multimodal computer vision techniques for wooden utility pole density esti mation with contact-free sensing..- Dynamic Label Injection for Imbalanced Industrial Defect Segmentation..- XAI-guided Insulator Anomaly Detection for Imbalanced Datasets..- Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging..- Foreground-Aware Knowledge Distillation for Enhanced Damage Detection..- AnomalyFactory: Regard Anomaly Generation as Unsupervised Anomaly Localization..- Interactive Explainable Anomaly Detection for Industrial Settings..- DAS3D: Dual-modality Anomaly Synthesis for 3D Anomaly Detection..- SQUAD: Scalar Quantized representation learning for Unsupervised Anomaly Detection and localization..- Deep Unsupervised Segmentation of Log Point Clouds..- A Computer Vision System for Automatic Edge Detection of Magnetic Grain Profile..- Find the Assembly Mistakes: Error Segmentation for Industrial Applications..- EM Based Nano-Scale Defect Analysis in Semiconductor Man ufacturing for Advanced IC Nodes..- On The Relationship between Visual Anomaly-free and Anomalous Representations..- DIE-VIS: an Automated Visual Inspection System for Cardboard Box Manufacturing..- When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections..- AnomalousPatchCore: Exploring the Use of Anomalous Samples in Industrial Anomaly Detection..- Self-supervised Models are Strong Industrial Few-shot Classification Learners..- Hyperspectral Imaging and Computer Vision Based Remote Monitoring of SO2 Emissions in Maritime Vessels..- Temporal-consistent CAMs for Weakly Supervised Video Segmentation in Waste Sorting..- Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities..- SplatPose+: Real Time Image-Based Pose-Agnostic 3D Anomaly Detection..- BBD-Polyp: Weakly Supervised Polyp Segmentation via Bounding Box andDepth Map..- ENSTRECT: A Stage-based Approach to 2.5D Structural Damage Detection..- An Augmentation-based Model Re-adaptation Framework for Robust Image Segmentation..- Meta Learning-Driven Iterative Refinement for Robust Anomaly Detection in Industrial Inspection.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisBridging Text and Image for Artist Style Transfer via Contrastive Learning.- Magic-Me: Identity-Specific Video Customized Diffusion.- Alfie: Democratising RGBA Image Generation With No $$$.- ComiCap: A VLM pipeline for dense captioning of Comic Panels.- Making Images from Images: Tightly Constrained Parallel Denoising.- ArCSEM: Artistic Colorization of SEM Images via Gaussian Splatting.- DreamWalk: Style Space Exploration using Diffusion Guidance.- An Art-centric perspective on AI-based content moderation of nudity.- Evaluation of Illustration Generators with Domain-Specific Representations.- Unlocking Comics: The AI4VA Dataset for Visual Understanding.- Art2Mus: Bridging Visual Arts and Music through Cross-Modal Generation.- Art Forgery Detection using Kolmogorov Arnold and Convolutional Neural Networks.- Sketch & Paint: Stroke-by-Stroke Evolution of Visual Artworks.- Storytelling Video Generation with Retrieval Augmentation and Character Consistency.- MACGaussian: Robust 3D Gaussian Splatting from sparse input views using high-precision Measurement-Arm-Camera (MAC) capture.- xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations.- VQA-Driven Facet-Level Texture Segmentation in 3D Surfaces.- Khattat: Enhancing Readability and Concept Representation of Semantic Typography.- Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning.
£123.49
Springer Computer Vision and Image Processing
£123.49
Springer Computer Vision and Image Processing
£123.49