Image processing Books
Springer Computer Vision ECCV 2024
Book SynopsisKernel Diffusion: An Alternate Approach to Blind Deconvolution.- MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty.- Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning.- Bidirectional Progressive Transformer for Interaction Intention Anticipation.- Reinforcement Learning Meets Visual Odometry.- Bucketed Ranking-based Losses for Efficient Training of Object Detectors.- Robustness Tokens: Towards Adversarial Robustness of Transformers.- RSL-BA: Rolling Shutter Line Bundle Adjustment.- DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images.- DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation.- Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models.- N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields.- ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction.- PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments.- Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph.- Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision.- ReCON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories.- AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval.- TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models.- 3D Hand Sequence Recovery from Real Blurry Images and Event Stream.- GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation.- Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection.- StyleCity: Large-Scale 3D Urban Scenes Stylization.- ViG-Bias: Visually Grounded Bias Discovery and Mitigation.- DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior.- Assessing Sample Quality via the Latent Space of Generative Models.- Relightable Neural Actor with Intrinsic Decomposition and Pose Control.
£64.99
Springer Computer Vision ECCV 2024
Book SynopsisHyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions.- InstructGIE: Towards Generalizable Image Editing.- HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation.- Navigating Text-to-Image Generative Bias across Indic Languages.- Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning.- CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models.- Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation.- VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation.- A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation.- Towards Scene Graph Anticipation.- Non-Line-of-Sight Estimation of Fast Human Motion with Slow Scanning Imagers.- Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding.- NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration.- Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models.- Image Manipulation Detection With Implicit Neural Representation and Limited Supervision.- Scalar Function Topology Divergence: Comparing Topology of 3D Objects.- Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks.- Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models.- DeTra: A Unified Model for Object Detection and Trajectory Forecasting.- ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems.- Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction.- Common Sense Reasoning for Deep Fake Detection.- Let the Avatar Talk using Texts without Paired Training Data.- NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields.- GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning.- Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks.- AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale.
£71.99
Springer Computer Vision ECCV 2024
Book SynopsisMONTAGE: Monitoring Training for Attribution of Generative Diffusion Models.- Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations.- Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination.- Self-supervised visual learning from interactions with objects.- OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation.- BAFFLE: A Baseline of Backpropagation-Free Federated Learning.- Sequential Representation Learning via Static-Dynamic Conditional Disentanglement.- OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects.- 3R-INN: How to be climate friendly while consuming/delivering videos?.- Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction.- Towards Robust Full Low-bit Quantization of Super Resolution Networks.- Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking.- Diverse Text-to-3D Synthesis with Augmented Text Embedding.- Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation.- LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang.- Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks.- AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems.- iHuman: Instant Animatable Digital Humans From Monocular Videos.- SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation.- Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier.- Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering.- Solving the inverse problem of microscopy deconvolution with a residual Beylkin-Coifman-Rokhlin neural network.- Face Reconstruction Transfer Attack as Out-of-Distribution Generalization.- FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models.- Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems.- Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation.- PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects.
£71.99
Springer Computer Vision ECCV 2024
Book SynopsisIs Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images.- Octopus: Embodied Vision-Language Programmer from Environmental Feedback.- FunQA: Towards Surprising Video Comprehension.- 4D Contrastive Superflows are Dense 3D Representation Learners.- ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation.- Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos.- Robust Fitting on a Gate Quantum Computer.- H-V2X: A Large Scale Highway Dataset for BEV Perception.- Learning Camouflaged Object Detection from Noisy Pseudo Label.- Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance.- Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions.- CLR-GAN: Improving GANs Stability and Quality via Consistent Latent Representation and Reconstruction.- Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence.- PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts.- Motion Mamba: Efficient and Long Sequence Motion Generation.- Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis.- Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance.- A Direct Approach to Viewing Graph Solvability.- CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization.- SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving.- ZeST: Zero-Shot Material Transfer from a Single Image.- 3D Congealing: 3D-Aware Image Alignment in the Wild.- SMooDi: Stylized Motion Diffusion Model.- ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs.- SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion.- WordRobe: Text-Guided Generation of Textured 3D Garments.- Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation.
£104.49
Springer Computer Vision ECCV 2024
Book SynopsisLGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.- Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization.- RAW-Adapter: Adapting Pretrained Visual Model to Camera RAW Images.- SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic.- AFreeCA: Annotation-Free Counting for All.- Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap.- LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation.- Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion.- Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration.- GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation.- PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery.- Sapiens: Foundation for Human Vision Models.- Linearly Controllable GAN: Unsupervised Feature Categorization and Decomposition for Image Generation and Manipulation.- Generating Human Interaction Motions in Scenes with Text Control.- NOVUM: Neural Object Volumes for Robust Object Classification.- Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception.- HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects.- SAIR: Learning Semantic-aware Implicit Representation.- ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video Colorization.- UNIC: Universal Classification Models via Multi-teacher Distillation.- Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation.- Eliminating Warping Shakes for Unsupervised Online Video Stitching.- Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models.- Merlin: Empowering Multimodal LLMs with Foresight Minds.- ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders.- E.T. the Exceptional Trajectory: Text-to-camera-trajectory generation with character awareness.- OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding.
£66.49
Springer Computer Vision ECCV 2024
Book SynopsisWalker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs.- Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition.- DiffiT: Diffusion Vision Transformers for Image Generation.- WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation.- GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding.- FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis.- FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection.- SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs.- ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities.- MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?.- See and Think: Embodied Agent in Virtual Environment.- PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects.- Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases.- VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding.- Masked Angle-Aware Autoencoder for Remote Sensing Images.- Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm.- MultiGen: Zero-shot Image Generation from Multi-modal Prompts.- GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths.- Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning.- SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis.- Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets.- FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition.- Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting.- UniCode : Learning a Unified Codebook for Multimodal Large Language Models.- When Do We Not Need Larger Vision Models?.- GVGEN: Text-to-3D Generation with Volumetric Representation.- Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model.
£66.49
Springer Computer Vision ECCV 2024
Book SynopsisCoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing.- Noise-assisted Prompt Learning for Image Forgery Detection and Localization.- Data Collection-free Masked Video Modeling.- Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model.- Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization.- AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation.- SEED: A Simple and Effective 3D DETR in Point Clouds.- AEDNet: Adaptive Embedding and Multiview-Aware Disentanglement for Point Cloud Completion.- Synergy of Sight and Semantics: Visual Intention Understanding with CLIP.- Intrinsic Single-Image HDR Reconstruction.- T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning.- Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification.- Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching.- BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models.- Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.- DATENeRF: Depth-Aware Text-based Editing of NeRFs.- XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution.- ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting.- Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery.- LaRa: Efficient Large-Baseline Radiance Fields.- Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement.- MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment.- Grounding Language Models for Visual Entity Recognition.- ELSE: Efficient Deep Neural Network Inference through Line-based Sparsity Exploration.- DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation.- DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation.- TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos.
£71.99
Springer Computer Vision ECCV 2024
Book SynopsisMutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection.- Self-Supervised Video Copy Localization with Regional Token Representation.- Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models.- RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF.- Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture.- ControlLLM: Augment Language Models with Tools by Searching on Graphs.- UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction.- DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors.- Vamos: Versatile Action Models for Video Understanding.- Prioritized Semantic Learning for Zero-shot Instance Navigation.- RoadPainter: Points Are Ideal Navigators for Topology transformER.- FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis.- Can OOD Object Detectors Learn from Foundation Models?.- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion.- MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo.- Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training.- Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation.- Real-data-driven 2000 FPS Color Video from Mosaicked Chromatic Spikes.- Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging.- TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts.- RadEdit: stress-testing biomedical vision models via diffusion image editing.- SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow.- AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution.- Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion.- Towards Real-world Event-guided Low-light Video Enhancement and Deblurring.- Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation.- TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks.
£64.99
Springer Computer Vision ECCV 2024
Book SynopsisUpper-body Hierarchical Graph for Skeleton Based Emotion Recognition in Assistive Driving.- Fine-Grained Scene Graph Generation via Sample-Level Bias Prediction.- Exploring Guided Sampling of Conditional GANs.- MotionChain: Conversational Motion Controllers via Multimodal Prompts.- Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition.- Latent Guard: a Safety Framework for Text-to-image Generation.- MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion.- TCC-Det: Temporarily consistent cues for weakly-supervised 3D detection.- OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection.- FoundPose: Unseen Object Pose Estimation with Foundation Features.- Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation.- Kalman-Inspired Feature Propagation for Video Face Super-Resolution.- Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models.- VideoMamba: State Space Model for Efficient Video Understanding.- SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging.- Heterogeneous Graph Learning for Scene Graph Prediction in 3D Point Clouds.- Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving.- Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models.- Deep Cost Ray Fusion for Sparse Depth Video Completion.- GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection.- DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video.- GraspXL: Generating Grasping Motions for Diverse Objects at Scale.- Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models.- Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models.- JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation.- Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals.- Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection.
£64.99
Springer Cancer Prevention Detection and Intervention
Book SynopsisClassification and characterization.- Multi-center ovarian tumor classification using hierarchical transformer-based multiple-instance learning.- FoTNet Enables Preoperative Differentiation of Malignant Brain Tumors with Deep Learning.- Classification of Endoscopy and Video Capsule Images using Hybrid Model.- Multimodal Deep Learning-based Prediction of Immune Checkpoint Inhibitor Efficacy in Brain Metastases.- Seeing More with Less: Meta-Learning and Diffusion Models for Tumor Characterization in Low-data Settings.- Performance Evaluation of Deep Learning and Transformer Models Using Multimodal Data for Breast Cancer Classification.- Detection and Segmentation.- On undesired emergent behaviors in compound prostate cancer detection systems.- Optimizing Multi-Expert Consensus for Classification and Precise Localization of Barrett's Neoplasia.- Automated Hepatocellular Carcinoma Analysis in Multi-Phase CT with Deep Learning.- Refining deep learning segmentation maps with a local thresholding approach: application to liver surface nodularity quantification in CT.- Uncertainty-Aware Deep Learning Classification for MRI-based Prostate Cancer Detection.- Generalized Polyp Detection from Colonoscopy frames Using proposed EDF-YOLO8 Network.- AI-Assisted Laryngeal Examination System.- UltraWeak: Enhancing Breast Ultrasound Cancer Detection with Deformable DETR and Weak Supervision.- SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling.- Cancer/Early cancer detection, treatment, and survival prognosis.-AI Age Discrepancy: A Novel Parameter for Frailty Assessment in Kidney Tumor Patients.- Deep Neural Networks for Predicting Recurrence and Survival in Patients with Esophageal Cancer After Surgery.- Treatment efficacy prediction of focused ultrasound therapies using multi-parametric magnetic resonance imaging.- SurRecNet: A Multi-Task Model with Integrating MRI and Diagnostic Descriptions for Rectal Cancer Survival Analysis.- Improved prediction of recurrence after prostate cancer radiotherapy using multimodal data and in silico simulations.- AutoDoseRank: Automated Dosimetry-informed Segmentation Ranking for Radiotherapy.- SurvCORN: Survival Analysis with Conditional Ordinal Ranking Neural Network.
£49.99
Springer Computer Vision ECCV 2024
Book SynopsisSLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking.- Tensorial template matching for fast cross-correlation with rotations and its application for tomography.- FreeAugment: Data Augmentation Search Across All Degrees of Freedom.- Learning Representations of Satellite Images From Metadata Supervision.- I2-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM.- FlashTex: Fast Relightable Mesh Texturing with LightControlNet.- GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence.- ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling.- PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance.- SOS: Segment Object System for Open-World Instance Segmentation With Object Priors.- Lagrangian Hashing for Compressed Neural Field Representations.- EDformer: Transformer-Based Event Denoising Across Varied Noise Levels.- Foster Adaptivity and Balance in Learning with Noisy Labels.- MetaAug: Meta-Data Augmentation for Post-Training Quantization.- Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis.- Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach.- Unleashing the Power of Prompt-driven Nucleus Instance Segmentation.- Gaze Target Detection Based on Head-Local-Global Coordination.- 3DSA:Multi-View 3D Human Pose Estimation With 3D Space Attention Mechanisms.- Toward Tiny and High-quality Facial Makeup with Data Amplify Learning.- An Economic Framework for 6-DoF Grasp Detection.- GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction.- Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning.- AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer.- Multi-Label Cluster Discrimination for Visual Representation Learning.- Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation.- DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion.
£71.99
Springer Computer Vision ECCV 2024
Book SynopsisCLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks.- Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering.- Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds.- A New Dataset and Framework for Real-World Blurred Images Super-Resolution.- AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization.- RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation.- StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models.- Bidirectional Uncertainty-Based Active Learning for Open-Set Annotation.- Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective.- Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation.- SeiT++: Masked Token Modeling Improves Storage-efficient Training.- Rectify the Regression Bias in Long-Tailed Object Detection.- MagicEraser: Erasing Any Objects via Semantics-Aware Control.- Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation.- Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis.- SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images.- NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model.- Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities.- Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers.- Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification.- 3D Small Object Detection with Dynamic Spatial Pruning.- STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning.- Transferable 3D Adversarial Shape Completion using Diffusion Models.- OmniSat: Self-Supervised Modality Fusion for Earth Observation.- Distilling Diffusion Models into Conditional GANs.- Semantically Guided Representation Learning For Action Anticipation.- MemBN: Robust Test-Time Adaptation via Batch Norm with Statistics Memory.
£64.99
Springer Computer Vision ECCV 2024
Book SynopsisFREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions.- ScanTalk: 3D Talking Heads from Unregistered Scans.- Controllable Navigation Instruction Generation with Chain of Thought Prompting.- GiT: Towards Generalist Vision Transformer through Universal Language Interface.- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention.- A Cephalometric Landmark Regression Method based on Dual-encoder for High-resolution X-ray Image.- Exploring the Feature Extraction and Relation Modeling For Light-Weight Transformer Tracking.- LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment.- You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation.- Gaussian Grouping: Segment and Edit Anything in 3D Scenes.- CoMo: Controllable Motion Generation through Language Guided Pose Code Editing.- MegaScenes: Scene-Level View Synthesis at Scale.- SuperGaussian: Repurposing Video Models for 3D Super Resolution.- Towards Model-Agnostic Dataset Condensation by Heterogeneous Models.- Goldfish: Vision-Language Understanding of Arbitrarily Long Videos.- MeshFeat: Multi-Resolution Features for Neural Fields on Meshes.- Decoupling Common and Unique Representations for Multimodal Self-supervised Learning.- MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training.- Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation.- 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction.- Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models.- D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction.- Combining Generative and Geometry Priors for Wide-Angle Portrait Correction.- RealViformer: Investigating Attention for Real-World Video Super-Resolution.- Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution.- Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation.- UniFS: Universal Few-shot Instance Perception with Point Representations.
£71.99
Springer Computer Vision ECCV 2024
Book SynopsisSemanticHuman-HD: High Resolution Semantic disentangled 3D Human Generation.- CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians.- Monocular Occupancy Prediction for Scalable Indoor Scenes.- Visual Grounding for Object-Level Generalization in Reinforcement Learning.- 3DEgo: 3D Editing on the Go!.- Efficient Depth-Guided Urban View Synthesis.- Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model.- Domain-adaptive Video Deblurring via Test-time Blurring.- Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures.- NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving.- OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing.- Progressive Pretext Task Learning for Human Trajectory Prediction.- Hyperion A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM.- Isomorphic Pruning for Vision Models.- Attention Prompting on Image for Large Vision-Language Models.- Learning Cross-hand Policies of High-DOF Reaching and Grasping.- Reprojection Errors as Prompts for Efficient Scene Coordinate Regression.- Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning.- Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit Adjustment.- REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models.- DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing.- VideoClusterNet: Self-Supervised and Adaptive Face Clustering for Videos.- Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients.- Controlling the World by Sleight of Hand.- Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack.- Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection.- Cross-Domain Learning for Video Anomaly Detection with Limited Supervision.
£64.99
Springer Foundation Models for General Medical AI
Book Synopsis.- FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification..- The Importance of Downstream Networks in Digital Pathology Foundation Models..- Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation..- Navigating Data Scarcity using Foundation Models: A Benchmark of Few-Shot and Zero-Shot Learning Approaches in Medical Imaging..- AutoEncoder-Based Feature Transformation with Multiple Foundation Models in Computational Pathology..- OSATTA: One-Shot Automatic Test Time Augmentation for Domain Adaptation..- Automating MedSAM by Learning Prompts with Weak Few-Shot Supervision..- SAT-Morph: Unsupervised Deformable Medical Image Registration using Vision Foundation Models with Anatomically Aware Text Prompt..- Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIs..- D- Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions..- Optimal Prompting in SAM for Few-Shot and Weakly Supervised Medical Image Segmentation..- UniCrossAdapter: Multimodal Adaptation of CLIP for Radiology Report Generation..- TUMSyn: A Text-Guided Generalist model for Customized Multimodal MR Image Synthesis..- SAMU: An Efficient and Promptable Foundation Model for Medical Image Segmentation..- Anatomical Embedding-Based Training Method for Medical Image Segmentation Foundation Models..- Boosting Vision-Language Models for Histopathology Classification: Predict all at once..- MAGDA: Multi-agent guideline-driven diagnostic assistance.
£44.99
Springer Biomedical Image Registration
Book SynopsisArchitectures.- multiGradICON A Foundation Model for Multimodal Medical Image Registration.- XSynthMorph Generative Guided Deformation for Unsupervised Ill Posed Volumetric Recovery.- Large Deformation Registration with A Confidence guided Network.- Unleashing Registration Diffusion Models for Synthetic Paired 3D Training Data.- Feedback Attention for Unsupervised Cardiac Motion Estimation in 3D Echocardiography.- Learning Intra Patient Liver Registration with Graph Cross Attention.- Mamba Catch The Hype Or Rethink What Really Helps for Image Registration.- Robustness.- Assessing the Robustness of Image Registration Models Under Domain Shifts with Learnable Input Images.- Challenging the Robustness of Image Registration Similarity Metrics with Adversarial Attacks.- Comparative Study on Co Registration Techniques for Diffusion Weighted Breast MRI and Improved ADC Mappin.- A Learning Free Approach to Mitigate Abnormal Deformations in Medical Image Registration.- Deformable MRI Sequence Registration for AI based Prostate Cancer Diagnosis.- Atlas Fusion.- SINA Sharp Implicit Neural Atlases by Joint Optimisation of Representation and Deformation.- A Novel Fusion of CT MRI and US Images Based on Depth Camera and Electromagnetic Tracking.- Deep Learning Multi Channel Structural and Diffusion Tensor Neonatal Image Registration.- Registration by Regression RbR a Framework for Interpretable and Flexible Atlas Registration.- Diffusion Model Based Hierarchical Registration Framework for Whole Body Image.- Feature Similarity Learning.- Unsupervised Similarity Learning for Image Registration with Energy Based Models.- Segmentation by registration enabled SAM prompt engineering using five reference images.- Electron Microscopy Image Registration with Twin Axial Transformer and Progressive Training.- General Vision Encoder Features as Guidance in Medical Image Registration.- Rigid Single Slice in Volume registration via rotation equivariant 2D 3D feature matching.- A Self Supervised Image Registration Approach for Measuring Local Response Patterns in Metastatic Ovarian Cancer.- CAR Contrast Agnostic Deformable Medical Image Registration with Contrast Invariant Latent Regularization.- Efficiency.- High Performance Groupwise Cortical Surface Registration with Multimodal Surface Matching.- Optimising Region of Interest Registration for Multiple Tissue Whole Slide Images.- Automatic Registration of SHG and H E Images with Feature based Initial Alignment and Intensity based Instance Optimization Contribution to the COMULIS Challenge.- Towards Fast and Accurate Non rigid Liver Fusion.
£59.99
Springer Computer Vision ECCV 2024
Book SynopsisMRSP: Learn Multi-Representations of Single Primitive for Compositional Zero-Shot Learning.- Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs.- TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance.- Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing.- Towards Open Domain Text-Driven Synthesis of Multi-Person Motions.- Generative End-to-End Autonomous Driving.- Learning to Distinguish Samples for Generalized Category Discovery.- COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark.- PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning.- Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem.- WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning.- Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice.- Encapsulating Knowledge in One Prompt.- Cross-Input Certified Training for Universal Perturbations.- Visual Relationship Transformation.- Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data.- Delving into Adversarial Robustness on Document Tampering Localization.- Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing.- Confidence-Based Iterative Generation for Real-World Image Super-Resolution.- Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy.- Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection.- Seeing Faces in Things: A Model and Dataset for Pareidolia.- Cocktail Universal Adversarial Attack on Deep Neural Networks.- Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering.- AMD: Automatic Multi-step Distillation of Large-scale Vision Models.- FairViT: Fair Vision Transformer via Adaptive Masking.- TrojVLM: Backdoor Attack Against Vision Language Models.
£64.99
Springer Computer Vision ECCV 2024
Book SynopsisGeoCalib: Learning Single-image Calibration with Geometric Optimization.- 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation.- Semicalibrated Relative Pose from an Affine Correspondence and Monodepth.- Global Structure-from-Motion Revisited.- MobileNetV4: Universal Models for the Mobile Ecosystem.- Gravity-aligned Rotation Averaging with Circular Regression.- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation.- Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments.- Quanta Video Restoration.- Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models.- CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model.- ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image.- POCA: Post-training Quantization with Temporal Alignment for Codec Avatars.- HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts.- Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras.- Unsupervised Dense Prediction using Differentiable Normalized Cuts.- Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training.- Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization.- AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion.- Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers.- EINet: Point Cloud Completion via Extrapolation and Interpolation.- Personalized Video Relighting With an At-Home Light Stage.- Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction.- A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks.- SPIRE: Semantic Prompt-Driven Image Restoration.- Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images.- HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution.
£71.99
£44.99
Springer Artificial Intelligence over Infrared Images for Medical Applications
Book Synopsis.- Thermal Radiomics for Early Detection of Diabetic Foot Ulcers Using Infrared Thermography..- Reverse Circular Logarithmic LBP for Diabetic Foot Ulcer Detection..- CNN Transformer for the Automated Detection of Rheumatoid Arthritis in Hand Thermal Images..- Generative Artificial Intelligence Approaches for Synthesizing High-Fidelity Breast Thermal Images..- About the validity of using DCGANs for data augmentation in breast thermography segmentation..- 3D-CNN for Breast Cancer Detection on Angular IR Images..- Evaluating Radiomics Feature Reduction for Thyroid Nodule Segmentation in Thermal Imaging..- Assessment of range of motion before and after hamstring percussion therapy using thermography and CNN..- The Effects of Balance and Strength on Thermal Heatmap..- The Future of Herpes Zoster Care: AI-Powered Thermal Imaging for Accurate Diagnosis and PHN Prediction.
£49.49
Springer Medical Image Computing and Computer Assisted Intervention MICCAI 2024 Workshops
Book SynopsisLDTM Workshop.- Disease Progression Modelling and Stratification for detecting sub-trajectories in the natural history of pathologies: application toParkinson's Disease trajectory modelling.- Back to the Future: Challenges of Sparse and Irregular Medical Image Time Series.- Individualized multi-horizon MRI trajectory prediction for Alzheimer's Disease.- Toward, for the Alzheimer's Disease Neuroimaging Initiative Towards Longitudinal Characterization of Multiple Sclerosis Atrophy Employing SynthSeg Framework and Normative Modeling.- BachCuadraSegHeD: Segmentation of Heterogeneous Data for Multiple SclerosisLesions with Anatomical Constraints.- Longitudinal Segmentation of MS Lesions via Temporal Difference Weighting .- Registration of Longitudinal Liver Examinations for Tumor ProgressAssessment.- Tracking lesion evolution using a Boundary Enhanced Approach for MS change segmentation (BEAMS).- A Radiological-based Coordinate System for the Human Body: A Proof-of-Concept.- MMMI-ML4MHD Workshop.- Language Models Meet Anomaly Detection for Better Interpretabilityand Generalizability.- A Diffusion Model Embedded WCSAU-Net for 3D MRI Brain Tumor Segmentation.- Predicting Human Brain States with Transformer .- Modality Image Quality Prediction for Time-Resolved CT fromBreathing Signals.- RATNUS: Rapid, Automatic Thalamic Nuclei Segmentation using Multimodal MRI inputs.- HyperMM : Robust Multimodal Learning with Varying-sized Inputs.- EMIT: H&E to Multiplex-immunohistochemistry Image Translation with Dual-Branch Pix2pix Generator.- Physics-Informed Latent Diffusion for Multimodal Brain MRI Synthesis.- ML-CDS Workshop.- MedPromptX: Grounded Multimodal Prompting for Chest X-rayDiagnosis.- Predicting Stroke through Retinal Graphs and Multimodal Self-supervised Learning.- Multimodality for Diagnosis of Asian Choroidal Vasculopathy: Resultsfrom a Novel Dataset and Deep-learning Experiments.- Multimodality Frequency Feature Customized Learning for PediatricVentricular Septal Defects Identification.
£49.99
Springer Pattern Recognition
Book Synopsis.- Clustering and Segmentation..- PARMESAN: Parameter-Free Memory Search and Transduction for Dense Prediction Tasks..- A State-of-the-Art Cutting Plane Algorithm for Clique Partitioning..- Self-Supervised Semantic Segmentation from Audio-Visual Data..- BTSeg: Barlow Twins Regularization for Domain Adaptation in Semantic Segmentation..- Learning Techniques..- FullCert: Deterministic End-to-End Certification for Training and Inference of Neural Networks..- Self-Masking Networks for Unsupervised Adaptation..- A Theoretical Formulation on the Use of Multiple Positive Views in Contrastive Learning.- Decoupling of neural network calibration measures..- Examining Common Paradigms in Multi-Task Learning..- DIAGen: Semantically Diverse Image Augmentation with Generative Models for Few-Shot Learning..- Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval ...- Anomaly Detection with Conditioned Denoising Diffusion Models..- Medical and Biological Applications..- SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network..- Foundation Models Permit Retinal Layer Segmentation Across OCT Devices..- Correlation Clustering of Organoid Images..- Animal Identification with Independent Foreground and Background Modeling..- Robust Tumor Segmentation with Hyperspectral Imaging and Graph Neural Networks..- Bigger Isn’t Always Better: Towards a General Prior for Medical Image Reconstruction..- Uncertainty and Explainability..- Latent Diffusion Counterfactual Explanations..- Enhancing Surface Neural Implicits with Curvature-Guided Sampling and Uncertainty-Augmented Representations..- Uncertainty Voting Ensemble for Imbalanced Deep Regression..- Analytical Uncertainty-Based Loss Weighting in Multi-Task Learning.
£59.99
Springer Pattern Recognition
Book Synopsis.- Modelling of Faces and Shapes..- 360° Volumetric Portrait Avatar..- How Do You Perceive My Face? Recognizing Facial Expressions in Multi-Modal Context by Modeling Mental Representations..- A Latent Implicit 3D Shape Model for Multiple Levels of Detail..- Image Generation and Reconstruction..- Coloring the Past: Neural Historical Monuments Reconstruction from Archival Photography..- Expanding the Image Embedding Space for Language-Free Text-to-Face Image Generation..- Towards synthetic generation of realistic wooden logs..- 3D Analysis and Sythesis..- G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles..- LiFCal: Online Light Field Camera Calibration via Bundle Adjustment..- CARLA Drone: Monocular 3D Object Detection from a Different Perspective..- Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors..- DynaPix SLAM: A Pixel-Based Dynamic Visual SLAM Approach..- Leveraging Image Matching Toward End-to-End Relative Camera Pose Regression..- Erasing the Ephemeral: Joint Camera Refinement and Transient Object Removal for Street View Synthesis..- Physically Plausible Object Pose Refinement in Cluttered Scenes..- Gaussian Splatting in Style..- Video Analysis..- Bounding Boxes and Probabilistic Graphical Models: Video Anomaly Detection Simplified..- STAR: Screen Time and Actor Recognition in Video Content..- Photogrammetry and Remote Sensing..- Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery..- Worldwide High-fidelity Road Extraction from Aerial and Satellite Imagery enabled by Low-fidelity OpenStreetMap Labels..- SenPa-MAE: Sensor Parameter Aware Multi-Satellite Masked Autoencoder for Multispectral Earth Observation Imagery..- PuzzleBoard: A new Camera Calibration Pattern with Position Encoding.
£59.99
Springer Computational Diffusion MRI
Book Synopsis- Super-Resolution of Diffusion-Weighted Images via TDI-Conditioned Diffusion Model..- Diffusion-Based Gray-White Matter Mapping for Quantitative Tractography in Glioma Patients..- Ground-truth effects in learning-based fiber orientation distribution estimation in neonatal brains..- Synthesizing 3D axon morphology: springs are all we need..- Randomly COMMITting: Iterative Convex Optimization for Microstructure-Informed Tractography..- AID-DTI: Accelerating High-fidelity Diffusion Tensor Imaging with Detail-preserving Model-based Deep Learning..- Multi-dimensional Parameter Space Exploration for Streamline-specific Tractography..- Cross-domain Fiber Cluster Shape Analysis for Language Performance Cognitive Score Prediction..- Can Transfer Learning Improve Supervised Segmentation of White Matter Bundles in Glioma Patients..- Image Quality Transfer of Diffusion MRI Guided By High-Resolution Structural MRI..- QID2: An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data..- Ts-FWE: Token-Aware Single-shell Free Water Estimation for Brain Diffusion MRI..- Assessing Early Motor System Degeneration in the Spinal Cord of ALS Patients Using Diffusion MRI: An Exploratory Study..- RobNODDI: Robust NODDI Parameter Estimation with Adaptive Sampling under Continuous Representation..- Introducing QuantConn: Overcoming challenging diffusion acquisitions with harmonization..- Learning Low-Rank Tensor Approximation for GPU-based Tractography..- Deep multivariate autoencoder for capturing complexity in Brain Structure and Behaviour Relationships..- Heritability and Genetic Correlations Along the Corticospinal Tract..- Corpus Callosum Parcellation Methods: What Can Tractography Tell Us About Them?.
£48.44
Springer Intelligent and Efficient Video Moment Localization
Book SynopsisChapter 1: Introduction.- Chapter 2: Semantic Enhanced Video Moment Localization.- Chapter 3: Semantic Alignment Video Moment Localization.- Chapter 4: Semantic Pruning Video Moment Localization.- Chapter 5: Semantic Collaborative Video Moment Localization.- Chapter 6: Weakly-Supervised Video Moment Localization.- Chapter 7: Efficient Hashing based Video Moment Localization.- Chapter 8: Research Frontiers.
£133.48
£191.90
£68.40
£68.40
£66.49
£66.49
£66.49
Springer MICCAI Challenges 2024 ToothFairy 3DTeethLand STS LNCS
Book SynopsisToothFairy2: Multi-Structure Segmentation in CBCT Volumes.-Inferior Alveolar Nerve Segmentation in CBCT Images Using Connectivity-based Selective Re-training.- Scaling nnU-Net for CBCT Segmentation.- DiENTeS: Dynamic ENTity Segmentation with Local-Global Transformers.- Enhanced Multi-Structure Segmentation in CBCT Images with Adaptive Structure Optimization.- Weakly-Supervised Convolutional Neural Networks for Inferior Alveolar Nerve Segmentation in CBCT images.- A Multi-Axial Network for Oral Structural Segmentation.- Automatic Multi-Structure Segmentation in Cone Beam Computed Tomography Volumes Using Deep Encoder-Decoder Architectures.- Video Foundation Model for Medical 3D Segmentation.-STS: Semi-supervised Teeth Segmentation.-A Two-Stage Semi-Supervised nnU-Net Model for Automated Tooth Segmentation in Panoramic X-ray Images.- Two-Stage Semi-Supervised nnU-Net Framework for Tooth Segmentation in CBCT Images.- SemiT-SAM: Building a Visual Foundation Model for Tooth Instance Segmentation on Panoramic Radiographs.- Multi-stage Dental Visual Detection Based on YOLOv8: Dental 3D CBCT.- Efficient Semi-Supervised Tooth Instance Segmentation in Panoramic X-rays Using ResUnet50 and SAM Networks.- DAE-Net: Dual Attention Embedding-based Tooth Instance Segmentation Approach for Panoramic X-ray Images.- A Self-Training Pipeline for Semi-Supervised 2D Teeth Instance Segmentation.- Deformable Inherent Consistent Learning Network for Accurate Tooth Segmentation in Dental Panoramic Radiographs.- Semi-Supervised 2D Dental Image Segmentation via Cross Teaching Network.- A Novel Two-Stage Approach for 3D Dental Tooth Instance Segmentation.- 3DTeethLand24: 3D Teeth Landmarks Detection Challenge.-A Two-Stage Framework with Dual-Branch Network for End-to-End 3D Tooth Landmark Detection.- Leveraging Point Transformers for Detecting Anatomical Landmarks in Digital Dentistry.- ToothInstanceNet: Comprehensive Information from Intra-Oral Scans by Integration of Large-Context and High-Resolution Predictions.
£104.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisYCB-Ev 1.1: Event-vision dataset for 6DoF object pose estimation.- PCR-99: A Practical Method for Point Cloud Registration with 99 Percent Outliers.- Robust Single Rotation Averaging Revisited.- LanPose: Language-Instructed 6D Object Pose Estimation for Robotic Assembly.- SABER-6D: Shape Representation Based Implicit Object Pose Estimation.- FruitBin: a tunable large-scale dataset for advancing 6D pose estimation in fruit bin-picking automation.- Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation?.- What's Wrong with the Absolute Trajectory Error?.- MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation.- KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction.- LVG-SfM: Learning-based View-Graph generation for robust on-the-fly SfM.- Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces.- PoTATO: A Dataset for Analyzing Polarimetric Traces of Afloat Trash Objects.- EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment.- Object Pose Estimation Using Implicit Representation For Transparent Objects.- TRICKY 2024 Challenge on Monocular Depth from Images of Specular and Transparent Surfaces.- ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet.- Automatic Generation of Fashion Images using Prompting in Generative Machine Learning Models.- Fashion Attribute Extraction Under an Evolving Ontology.- Capturing and modeling real cloth deformations for virtual garment design.- MDiFF: Exploiting Multimodal Score-based Diffusion Models for New Fashion Product Performance Forecasting.- Deep Armocromia: A Novel Dataset for Face Seasonal Color Analysis and Classification.- DIVA: Deep Indic Virtual Apparel Try-On.- Garment Attribute Manipulation with Multi-level Attention.- Machine Learning-Driven Marketing Personas for the Luxury Fashion Market.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisSelf-supervised disentangled representation learning of artistic style through Neural Style Transfer.- Similar paintings retrieval from individual and multiple poses.- NeAT: Neural Artistic Tracing for high resolution Style Transfer.- DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer.- Analysis of Hybrid Compositions in Animation Film with Weakly Supervised Learning.- BackFlip: The Impact of Local and Global Data Augmentations on Artistic Image Aesthetic Assessment.- Cultural Heritage 3D Reconstruction with Diffusion Networks.- Context-Infused Visual Grounding for Art.- ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement.- Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces.- Evaluating Usability and Engagement of Large Language Models in Virtual Reality for Traditional Scottish Curling.- EUFCC-CIR: a composed image retrieval dataset for GLAM collections.- µgat: Improving Single-Page Document Parsing by Providing Multi-Page Context.- Pixels of Faith: Exploiting Visual Saliency to Detect Religious Image Manipulation.- Automatic Die Studies for Ancient Numismatics.- San Vitale Challenge: Automatic Reconstruction of Ancient Colored Glass Windows.- The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives.- An approach for dataset extension for object detection in artworks using open-vocabulary models.- A Data-Centric Module for Neural Rendering.- Structured Analysis of Alphabets in Historical Handwritten Ciphers.- Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans..- HybridFormer: Bridging Local and Global Spatio-Temporal Dynamics for Efficient Skeleton-Based Action Recognition..- MPL: Lifting 3D Human Pose from Multi-view 2D Poses..- THP3D: Text-Driven Multi-Granularity 3D Human Parsing..- ROMEO: Revisiting Optimization Methods for Reconstructing 3D HumanObject Interaction Models From Images..- Leveraging key-points Encoded Human Pose Images for Human Activity Recognition..- Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity..- Multi-Camera Industrial Open-Set Person Re-Identification and Tracking..- Enhanced Action Quality Assessment with Two-Stream Pose and Video Feature Integration..- PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation..- Enhancing Gait Recognition: Data Augmentation via Physics-Based Biomechanical Simulation..- Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB..- VATE: a Large Scale Multimodal Spontaneous Dataset for Affective Evaluation..- Guidelines for Query and Gallery Image Extraction in Person ReIdentification Systems..- Pose-independent 3D Anthropometry from Sparse Data..- Exploring 3D Face Reconstruction and Fusion Methods for Face Verification: A Case-Study in Video Surveillance..- FlexControl: Flexible and Efficient Full-Body Controllable Text-to-Motion Generation..- Coarse to Fine Human Mesh Recovery with Transformers..- Motion Reconstruction via Human Anatomy Diffusion from Sparse Tracking..- A vision-based framework for human behavior understanding in industrial assembly lines..- Boosting Pose Estimators via Cross-Representation Distillation..- Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- AirLetters: An Open Video Dataset of Characters Drawn in the Air..- RegionGrasp: A Novel Task for Contact Region Controllable Hand Grasp Generation..- Generative Hierarchical Temporal Transformer for Hand Pose and Action Modeling..- Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss..- Conditional Hand Image Generation using Latent Space Supervision in Random Variable Variational Autoencoders..- ChildPlay-Hand: A Dataset of Hand Manipulations in the Wild..- EMAG: Ego-motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos..- Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents..- RGMIM: Region-Guided Masked Image Modeling for Learning Meaningful Representations from X-Ray Images..- A Biologically-inspired Approach to Biomedical Image Segmentation..- Human-based Low-Level Visual Processing Neural Network for Image Segmentation..- Glia Cell Inspired Reinforcement Learning Agent for Neural Network Optimization..- Growing Deep Neural Network Considering with Similarity between Neurons..- Representation Learning in a Decomposed Encoder Design for Bio-inspired Hebbian Learning..- Reducing Catastrophic Forgetting in Online Class Incremental Learning Using Self-Distillation..- What makes a face look like a hat: Decoupling low-level and high-level visual properties with image triplets..- ScanDDM: Generalised Zero-Shot Neuro-Dynamical Modelling of GoalDirected Attention..- Limited but consistent gains in adversarial robustness by co-training object recognition models with human EEG..- Online Learning via Memory: Retrieval-Augmented Detector Adaptation..- AHMF: Adaptive Hybrid-Memory-Fusion Model for Driver Attention Prediction..- Accuracy Improvement of Cell Image Segmentation Using Feedback Former..- Variable resolution improves visual question answering under a limited pixel budget..- MOSAIC: Skeleton-based human motion recognition with compositional representations..- Adapting Large Language Model for Cross-Subject Semantic Decoding from Video-Stimulated fMRI..- A System 1 and System 2 Perspective on Continual Learning for Practical Implementation..- Connectivity-Inspired Network for Context-Aware Recognition..- Generalizability analysis of deep learning predictions of human brain responses to augmented and semantically novel visual stimuli.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- MMA-MRNNet: Harnessing Multiple Models of Affect and Dynamic Masked RNN for Precise Facial Expression Intensity Estimation..- ToddlerAct: A Toddler Action Recognition Dataset for Gross Motor Development Assessment..- 7th abaw competition: Multi-task learning and compound expression recognition..- Are Visual-Language Models Effective in Action Recognition? A Comparative Study..- Textualized and Feature-based Models for Compound Multimodal Emotion Recognition in the Wild..- Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints..- Single Image 3D Human Pose Estimation Using Sequential Joint Group Generation..- MVP: Multimodal Emotion Recognition based on Video and Physiological Signals..- Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech..- Massively Multi-Person 3D Human Motion Forecasting with Scene Context..- TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans..- Improving face generation quality and prompt following with synthetic captions..- Tracking Virtual Meetings in the Wild: Re-identification in MultiParticipant Virtual Meetings..- rPPG-SysDiaGAN: Systolic-Diastolic Feature Localization in rPPG Using Generative Adversarial Network with Multi-Domain Discriminator..- MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition..- Predicting Emotions in Interpersonal Interaction Videos: I Know What You Feel..- Multi-Task Affective Behaviour Analysis based on MT-EmotiNet Models..- Smoothing Predictions of Multi-Task EmotiNet Models for Compound Facial Expression Recognition..- ABAW7 Challenge: A Facial Affect Recognition Approach based on Transformer Encoder and Multilayer Perceptron..- Compound Expression Recognition via Curriculum Learning..- Boundary Matching and Refinement Network with Cross-modal Contrastive Learning for Temporal Moment Localization..- Enhancing Facial Expression Recognition through Dual-Direction Attention Mixed Feature Networks: Application to 7th ABAW Challenge..- Introducing Gating and Context into Temporal Action Detection..- Better Spanish Emotion Recognition In-the-wild: Bringing Attention to Deep Spectrum Voice Analysis..- Monitoring Viewer Attention During Online Ads..- Affective Behaviour Analysis via Progressive Learning..- Are We Friends? End-to-End Prediction of Child Rapport in Guided Play..- Affective Behavior Analysis using Task-adaptive and AU-assisted Graph..- Ig3D: Integrating 3D Face Representations in Facial Expression Inference.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval..- Rethinking Sparse Lexical Representations for Image Retrieval in the Ageof Rising Multi-Modal Large Language Models..- Boundary Attention: Learning curves, corners, junctions and grouping..- Deep Learning Meets Satellite Images - An Evaluation on Handcrafted andLearning-based Features for Multi-date Satellite Stereo Images..- Image Color Consistency in Datasets: The Smooth-TPS3D Method..- Unsupervised Video Summarization: A Reconstruction Model with ProximalGradient Methods..- TF-OCM: Training Free Optimal Community Matching for Domain Generalized Few Shot Learning..- Perspective-Equivariance for Unsupervised Imaging with Camera Geometry..- Reliable Probabilistic Human Trajectory Prediction for Autonomous Applications..- Calibration of Network Confidence for Unsupervised Domain AdaptationUsing Estimated Accuracy..- Logit disagreement: OoD Detection with Bayesian Neural Networks..- Can Your Generative Model Detect Out-of-Distribution Covariate Shift?..- Multi-label out-of-distribution detection via evidential learning..- UTrack: Multi-Object Tracking with Uncertain Detections..- TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection..- Sanity Checks for Explanation Uncertainty..- Sources of Uncertainty in 3D Scene Reconstruction..- The BRAVO Semantic Segmentation Challenge Results in UNCV2024..- Maximally Separated Active Learning..- Hyperbolic Metric Learning for Visual Outlier Detection..- A Bottom-Up Approach to Class-Agnostic Image Segmentation..- Adversarial Attacks on Hyperbolic Networks..- Hyperbolic Learning with Multimodal Large Language Models..- Embedding Geometries of Contrastive Language-Image Pre-Training..- Learning Multi-Manifold Embedding for Out-Of-Distribution Detection.- ProxyDR: Deep Hyperspherical Metric Learning with Distance Ratio-BasedFormulation..- Backward-Compatible Aligned Representations via Orthogonal Transformation Layer.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- DeepClean: Machine Unlearning on the Cheap by Resetting Privacy Sensitive Weights using the Fisher Diagonal..- Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models..- Aligning Vision Language Models with Contrastive Learning..- Open-set object detection: towards unified problem formulation and benchmarking..- Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts..- SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation..- Online Stochastic Optimization for Data with Temporal Dependencies..- A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-Time Adaptation for Vision-Language Models..- OSSA: Unsupervised One-Shot Style Adaptation..- ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer..- Open-set Plankton Recognition..- Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?..- On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes..- Source-Free Domain Adaptation for YOLO Object Detection..- Task-Specific Adaptation of Segmentation Foundation Model via Prompt Learning..- Utilizing Class-Agnostic Point-to-Box Regressors as Object Proposal Generators..- Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective..- Improving Generalization in Visual Reasoning via Self-Ensemble..- BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation..- Image Translation with Kernel Prediction Networks for Semantic Segmentation..- Robust fine-tuning and adaptation of zero-shot models via adaptive weightspace ensembling..- Robustness to Spurious Correlation: A Comprehensive Review.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- Fine-tuning a Multiple Instance Learning Feature Extractor with Masked Context Modelling and Knowledge Distillation..- Advancing Medical Radiograph Representation Learning: A Hybrid Pretraining Paradigm with Multilevel Semantic Granularity..- Can virtual staining for high-throughput screening generalize?..- SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images..- A Good Feature Extractor Is All You Need for Weakly Supervised Pathology Slide Classification..- Boosting Medical Image Registration Network Inherently via Collaborative Learning..- Genetic Information Analysis of Age-Related Macular Degeneration Fellow Eye Using Multi-Modal Selective ViT..- CHOTA: A Higher Order Accuracy Metric for Cell Tracking..- Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification..- Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images..- BATseg: Boundary-aware Multiclass Spinal Cord Tumor Segmentation on 3D MRI Scans..- Affinity-VAE: incorporating prior knowledge in representation learning from scientific images..- Towards the Discovery of Down Syndrome Brain Biomarkers Using Generative Models..- Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis..- SS-MIL: Attention-Based Selective Correlated Multiple Instance Learning for Whole Slide Image Classification..- MicroSSIM: Improved Structured Similarity for Comparing Microscopy Data..- Generalized Segmentation for Maxillary Sinus and Mandibular Canal in Dental Panoramic X-rays..- MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation..- NCT-CRC-HE: Not All Histopathological Datasets Are Equally Useful..- Tracking one-in-a-million: Large-scale benchmark for microbial single-cell tracking with experiment-aware robustness metrics..- A Novel Approach to Linking Histology Images with DNA Methylation.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisValeo4Cast: A Modular Approach to End-to-End Forecasting.- AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data.- Autonomous Drone-Person Tracking and Following in Uniform Appearance Scenarios.- Continual Reinforcement Learning with Implicit Generative Replay for Autonomous Driving.- Self-supervised Road Accident Anticipation with Non-decreasing Danger.- 3D Object Detection and Tracking Refinement with Ensemble Methods and Spatiotemporal Filtering.- Conditional Unscented Autoencoders for Trajectory Prediction.- Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation.- TrackLidFormer: a Transformer-based Approach for Occluded Object Tracking.- Good Data Is All Imitation Learning Needs.- What Matters in Autonomous Driving Anomaly Detection: A Weakly Supervised Horizon.- High Dynamic Range Modulo Imaging for Robust Object Detection in Autonomous Driving.- RLNet: Adaptive Fusion of 4D Radar and Lidar for 3D Object Detection.- Improving Online Source-Free Domain Adaptation for Object Detection by Unsupervised Data Acquisition.- AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving.- On Camera and LiDAR Positions in End-to-End Autonomous Driving.- ProGBA: Prompt Guided Bayesian Augmentation for Zero-shot Domain Adaptation.- ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable.- Loop Mining Large-Scale Unlabeled Data for Corner Case Detection in Autonomous Driving.- HumanSim: Human-Like Multi-Agent Novel Driving Simulation for Corner Case Generation.- Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding.- RoSA Dataset: Road Construct zone Segmentation for Autonomous Driving.- A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection.- The Second Visual Object Tracking Segmentation VOTS2024 Challenge Results.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisMulti-agent Collaborative Perception for Robotic Fleet: A Systematic Review.- RP3D: A Roadside Perception Framework for 3D Object Detection via Multi-View Sensor Fusion.- StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection.- GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest.- SC-Track: State Transition and Constrained Non-negative Matrix Factorization for Multi-Camera Multi-Target Tracking.- Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones.- VICooper: A Practical Vehicle-Infrastructure Cooperative Perception Framework for Autonomous Driving.- MEDCO: Medical Education Copilots Based on A Multi-Agent Framework.- V2X-Based Decentralized Singular Value Decomposition in Dynamic Vehicular Environment.- LLaMAPed: Multi-modal Pedestrian Crossing Intention Prediction.- Optimization of Layer Skipping and Frequency Scaling for Convolutional Neural Networks under Latency Constraint.- An Infrastructure-based Localization Method for Articulated Vehicles.- HEAD: A Bandwidth-Efficient Cooperative Perception Approach for Heterogeneous Connected and Autonomous Vehicles.- Rethinking the Role of Infrastructure in Collaborative Perception.- Empowering Autonomous Shuttles with Next-Generation Infrastructure.- MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs' Cooperative Decision-Making.- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version).- iIPPC-V2X: Multi-modality Fusion Perception System for Cooperative Vehicle Infrastructure System with Self-supervised Learning.- Non-verbal Interaction and Interface with a Quadruped Robot using Body and Hand Gestures: Design and User Experience Evaluation.- Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisWild Berry image dataset collected in Finnish forests and peatlands using drones.- Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks.- A Framework for Enhanced Decision Support in Digital Agriculture Using Explainable Machine Learning.- Lincoln's Annotated Spatio-Temporal Strawberry Dataset (LAST-Straw).- 3D Phenotyping of Canopy Occupation Volume as a Major Predictor for Canopy Photosynthesis in Rice (Oryza sativa L.).- Retrieval of sun-induced plant fluorescence in the O2-A absorption band from DESIS imagery.- Unsupervised Tomato Split Anomaly Detection using Hyperspectral Imaging and Variational Autoencoders.- KAN You See It? KANs and Sentinel for Effective and Explainable Crop Field Segmentation.- RoWeeder: Unsupervised Weed Mapping through Crop-Row Detection.- Consolidation of symbolic instances using sensor data via tracklet merging for long-term monitoring of crops.- Automated Generation of Accurate, Compact and Focused Crop and Weed Segmentation Models.- Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection.- Towards Auto-Generated Ground Truth for Evaluation of Perception Systems in Agriculture.- AgriBench: A Hierarchical Agriculture Benchmark for Multimodal Large Language Models.- Deep Learning Based Growth Modeling of Plant Phenotypes.- A simple approach to pavement cell segmentation.- Enhancing weed detection performance by means of GenAI-based image augmentation.- SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture.- Robust UDA for Crop and Weed Segmentation: Multi-Scale Attention and Style-Adaptive Techniques.- Ordinal-Meta Learning for Fine-grained Fruit Quality Prediction.- Beyond Annotations: Efficient Wheat Head Segmentation Using L-Systems, Game Engines, and Student-Teacher Models.- Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisDiffusion-based Light Field Synthesis.- Diffusion-Promoted HDR Video Reconstruction.- Lightweight Deep Learning Model for Defective Pixel Detection and Recovery from the Image Sensors.- IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts.- Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models.- MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance.- RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content.- Detecting Forged Sentinel-2 Images Through Parallax-Based Cloud Analysis.- PRISM: Progressive Restoration for Scene Graph-based Image Manipulation.- DAVIDE: Depth-Aware Video Deblurring.- RenDetNet: Weakly-supervised Shadow Detection with Shadow Caster Verification.- Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model.- Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications.- QSD: Query-Selection Denoising score for Image Edit-ing in Latent Diffusion Model.- PDB Unet: A spatio temporal video Fixed Pattern Noise removal network.- Reversible and Cascaded Lightweight Colour Constancy: Jointly Addressing Illumination Correction and White Balance.- Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising.- Solving Inverse Problem With Unspecified Forward Operator Using Diffusion Models.- A Disentangled Approach to Predict the Aesthetic Outcomes of Breast Cancer Treatment.- LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment Model.- Pushing Joint Image Denoising and Classification to the Edge.- Self-Supervised HDR Imaging from Motion and Exposure Cues.- Closer to Ground Truth: Realistic Shape and Appearance Labeled Data Generation for Unsupervised Underwater Image Segmentation.- Edge-aware Consistent Stereo Video Depth Estimation.- Low-Cost Stereoscopic Optical-Coding Design for Depth Estimation Using End-to-End Optimization.- 360U-Former: HDR Illumination Estimation with Panoramic Adapted Vision Transformers.- Satellite Image Dehazing Via Masked Image Modeling and Jigsaw Transformation.- UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment.
£123.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- TONO: a synthetic dataset for face image compliance to ISO/ICAO standard..- mproving Post-Earthquake Crack Detection using Semi-Synthetic Gener ated Images..- DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition..- Neural Transcoding Vision Transformers for EEG-to-fMRI Synthesis..- RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models..- NeRFmentation: NeRF-based Augmentation for Monocular Depth Estima tion..- Synthetic to Authentic: Transferring Realism to 3D Face Renderings for Boosting Face Recognition..- Time-Resolved MNIST Dataset for Single-Photon Recognition..- NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Hu man Pose Estimation in Top-View Fisheye Images..- Training and Benchmarking Leukocyte Sub-types Classification Methods with Synthetic Images..- DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling..- Contextual Knowledge Pursuit for Faithful Visual Synthesis..- SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models..- Diffusion-based Synthetic Dataset Generation for Egocentric 3D Human Pose Estimation..- BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabil ities in Pretrained Diffusion Models..- A CycleGAN Model to Synthesize Missing and Unpaired MRI Sequences for Under-Represented Multiple Sclerosis Lesions..- The Impact of Balancing Real and Synthetic Data on Accuracy and Fairness in Face Recognition..- DreamTexture: High-Fidelity Synthetic 3D Data Generation through De coupled Geometry and Texture Synthesis..- Control+Shift: Generating Controllable Distribution Shifts..- Comparative Analysis of Synthetic and Real Melanoma Images in AI-Driven Diagnosis..- How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recog nition..- Synthetic Generation of Dermatoscopic Images with GAN and Closed-Form Factorization..- FABRIC: Personalizing Diffusion Models with Iterative Feedback..- TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisFew-shot Novel View Synthesis using Depth Aware 3D Gaussian Splatting.- On Scaling Up 3D Gaussian Splatting Training.- AEPnP: A Less-constrained EPnP Solver for Pose Estimation with Anisotropic Scaling.- Scalable Indoor Novel-View Synthesis using Drone-Captured 360 Imagery with 3D Gaussian Splatting.- Space3D-Bench: Spatial 3D Question Answering Benchmark.- VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field.- NeRF-Supervised Feature Point Detection and Description.- Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks.- Real-Time 2nd-order Gaze Metrics.- Normalized Validity Scores for DNNs in Regression based Eye Feature Extraction and Real-Time Models for the Raspberry Pi.- Helios: An extremely low power event-based gesture recognition for always-on smart eyewear.- CondSeg: Ellipse Estimation of Pupil and Iris via Conditioned Segmentation.- Towards Unsupervised Eye-Region Segmentation for Eye Tracking.- Towards Low-power, High-frequency Gaze Direction Tracking with an Event-camera.- Towards Resource-aware Visual Inertial SLAM.- Evaluating Human Pose Estimation Algorithms for Resource-Constrained Smart Eyewear Device.- Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses With TinyissimoYOLO.- Towards Real-Time Online Egocentric Action Recognition on Smart Eye-wear.- High-frequency near-eye ground truth for event-based eye tracking.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book Synopsis.- Landmark-Based Screening: Femoral Head Coverage and Graf Classificationin Infant Developmental Dysplasia of the Hip..- MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition..- One-Shot Image Restoration..- Medical Image Segmentation with SAM-generated Annotations..- Manipulating and Mitigating Generative Model Biases without Retraining..- Fake or JPEG? Revealing Common Biases in Generated Image DetectionDatasets..- Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image GenerativeModels..- A semiotic methodology for assessing the compositional effectiveness of generativetext-to-image models (Midjourney and DALLoE)..- A Framework for Critical Evaluation of Text-to-Image Models: IntegratingArt Historical Analysis, Artistic Exploration, and Critical Prompt Engineering..- Civiverse: A Dataset for Analyzing User Engagement with Open-SourceTTI-Models..- Exploring the Boundaries of Content Moderation in Text-to-Image Generation..- Rethinking HTG Evaluation: Bridging Generation and Recognition..- Evaluation Framework for Feedback Generation Methods in Skeletal MovementAssessment..- FaceOracle: Chat with a Face Image Oracle..- Makeup-Guided Facial Privacy Protection via Untrained Neural NetworkPriors..- How to Squeeze An Explanation Out of Your Model..- How were you created? Explaining synthetic face images generated by diffusionmodels..- Frequency Matters: Explaining Biases of Face Recognition in the FrequencyDomain..- How green is continual learning, really? Analyzing the energy consumptionin continual training of vision foundation models..- Architecture-Agnostic Unsupervised Gradient Regularization ForParameter-Efficient Transfer Learning..- Foundation Model or Finetune? Evaluation of few-shot semantic segmentationfor river pollution..- Personalizing Multimodal Large Language Models for Image Captioning: AnExperimental Analysis..- Improved Baselines for Data-efficient Perceptual Augmentation of LLMs..- Watt for What: Rethinking Deep Learning’s Energy-Performance Relationship.
£66.49
Springer Computer Vision ECCV 2024 Workshops
Book SynopsisGenerating Binary Species Range Maps.- WildFusion: Individual Animal Identification with Calibrated Similarity Fusion.- Towards Zero-Shot Camera Trap Image Categorization.- Larval Hostplant Prediction from Luehdorfia japonica Image using Multi-label ABN.- Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation.- Underwater Uncertainty: A Multi-Annotator Image Dataset for Benthic Habitat Classification.- Deep Learning for Automated Shark Detection and Biometrics Without Keypoints.- Improving in situ real-time classification of long-tail marine plankton images for ecosystem studies.- KAN-Mixer: Kolmogorov-Arnold Networks for Gene Expression Prediction in Plant Species.- Multi-Scale and Multimodal Species Distribution Modelling.- Semantic Segmentation of Benthic Classes in Reef Environments using a Large Vision Transformer.- POLO - Point-based, multi-class animal detection.- Mining Field Data for Tree Species Recognition at Scale.- MaskSDM: Adaptive species distribution modeling through data masking.- Fine-tuning for Bird Sound Classification: An Empirical Study.- Multimodal Fusion Strategies for Mapping Biophysical Landscape Features.- I-Design: Personalized LLM Interior Designer.- NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training.- GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning.- DiVR: incorporating context from diverse VR scenes for human trajectory prediction.- Skeleton-Aware Motion Retargeting Using Masked Pose Modeling.- LucidDreaming: Controllable Object-Centric 3D Generation.- BehAVE: Behaviour Alignment of Video Game Encodings.- Collaborative Control for Geometry-Conditioned PBR Image Generation.- Real-Time Neural Cloth Deformation using a Compact Latent Space and a Latent Vector Predictor.- Level Up Your Tutorials: VLMs for Game Tutorials Quality Assessment.- Across-Game Engagement Modelling via Few-Shot Learning.- Hand2Any: Hand-to-Any Motion Mapping with Few-Shot User Adaptation for Avatar Manipulation.- SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers.- PlaMo: Plan and Move in Rich 3D Physical Environments.
£123.49