Description

Book Synopsis

Depth-guided NeRF Training via Earth Mover's Distance.- INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding.- DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks.- Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time.- Diagnosing and Re-learning for Balanced Multimodal Learning.- Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration.- Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders.- BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion.- SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views.- MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning.- Discovering Unwritten Visual Classifiers with Large Language Models.- LITA: Language Instructed Temporal-Localization Assistant.- MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain.- Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs.- Bridging the Pathology Domain Gap: Efficiently Adapting CLIP for Pathology Image Analysis with Limited Labeled Data.- AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation.- CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection.- SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging.- Minimalist Vision with Freeform Pixels.- All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation.- LatentEditor: Text Driven Local Editing of 3D Scenes.- Single-Photon 3D Imaging with Equi-Depth Photon Histograms.- Asynchronous Bioplausible Neuron for Spiking Neural Networks for Event-Based Vision.- Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models.- POET: Prompt Offset Tuning for Continual Human Action Adaptation.- Domain Generalization of 3D Object Detection by Density-Resampling.- IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers.

Computer Vision ECCV 2024

    Product form

    £59.99

    Includes FREE delivery

    Order before 4pm today for delivery by Mon 15 Jun 2026.

    A Paperback by Aleš Leonardis

    15 in stock


      View other formats and editions of Computer Vision ECCV 2024 by Aleš Leonardis

      Publisher: Springer
      Publication Date: 31/10/2024
      ISBN13: 9783031730382, 978-3031730382
      ISBN10:

      Description

      Book Synopsis

      Depth-guided NeRF Training via Earth Mover's Distance.- INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding.- DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks.- Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time.- Diagnosing and Re-learning for Balanced Multimodal Learning.- Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration.- Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders.- BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion.- SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views.- MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning.- Discovering Unwritten Visual Classifiers with Large Language Models.- LITA: Language Instructed Temporal-Localization Assistant.- MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain.- Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs.- Bridging the Pathology Domain Gap: Efficiently Adapting CLIP for Pathology Image Analysis with Limited Labeled Data.- AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation.- CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection.- SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging.- Minimalist Vision with Freeform Pixels.- All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation.- LatentEditor: Text Driven Local Editing of 3D Scenes.- Single-Photon 3D Imaging with Equi-Depth Photon Histograms.- Asynchronous Bioplausible Neuron for Spiking Neural Networks for Event-Based Vision.- Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models.- POET: Prompt Offset Tuning for Continual Human Action Adaptation.- Domain Generalization of 3D Object Detection by Density-Resampling.- IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers.

      Recently viewed products

      © 2026 Book Curl

        • American Express
        • Apple Pay
        • Diners Club
        • Discover
        • Google Pay
        • Maestro
        • Mastercard
        • PayPal
        • Shop Pay
        • Union Pay
        • Visa

        Login

        Forgot your password?

        Don't have an account yet?
        Create account