Description

Book Synopsis

SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions.- InterFusion: Text-Driven Generation of 3D Human-Object Interaction.- GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval.- DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving.- Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition.- NeRF-XL: NeRF at Any Scale with Multi-GPU.- CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems.- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?.- Compositional Substitutivity of Visual Reasoning for Visual Question Answering.- LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models.- DNI: Dilutional Noise Initialization for Diffusion Video Editing.- Two-Stage Video Shadow Detection via Temporal-Spatial Adaption.- Towards Physical World Backdoor Attacks against Skeleton Action Recognition.- SAM-guided Graph Cut for 3D Instance Segmentation.- Fully Authentic Visual Question Answering Dataset from Online Communities.- Active Generation for Image Classification.- FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors.- Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes.- Understanding Multi-compositional learning in Vision and Language models via Category Theory.- FedRA: A Random Allocation Strategy for Federated Tuning to Unleash the Power of Heterogeneous Clients.- Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration.- Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image.- Diffusion-Guided Weakly Supervised Semantic Segmentation.- Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment.- When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset.- NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image.- Segment and Recognize Anything at Any Granularity.

Computer Vision ECCV 2024

    Product form

    £64.99

    Includes FREE delivery

    Order before 4pm today for delivery by Mon 15 Jun 2026.

    A Paperback by Aleš Leonardis

    15 in stock


      View other formats and editions of Computer Vision ECCV 2024 by Aleš Leonardis

      Publisher: Springer
      Publication Date: 27/11/2024
      ISBN13: 9783031731945, 978-3031731945
      ISBN10:

      Description

      Book Synopsis

      SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions.- InterFusion: Text-Driven Generation of 3D Human-Object Interaction.- GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval.- DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving.- Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition.- NeRF-XL: NeRF at Any Scale with Multi-GPU.- CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems.- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?.- Compositional Substitutivity of Visual Reasoning for Visual Question Answering.- LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models.- DNI: Dilutional Noise Initialization for Diffusion Video Editing.- Two-Stage Video Shadow Detection via Temporal-Spatial Adaption.- Towards Physical World Backdoor Attacks against Skeleton Action Recognition.- SAM-guided Graph Cut for 3D Instance Segmentation.- Fully Authentic Visual Question Answering Dataset from Online Communities.- Active Generation for Image Classification.- FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors.- Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes.- Understanding Multi-compositional learning in Vision and Language models via Category Theory.- FedRA: A Random Allocation Strategy for Federated Tuning to Unleash the Power of Heterogeneous Clients.- Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration.- Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image.- Diffusion-Guided Weakly Supervised Semantic Segmentation.- Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment.- When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset.- NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image.- Segment and Recognize Anything at Any Granularity.

      Recently viewed products

      © 2026 Book Curl

        • American Express
        • Apple Pay
        • Diners Club
        • Discover
        • Google Pay
        • Maestro
        • Mastercard
        • PayPal
        • Shop Pay
        • Union Pay
        • Visa

        Login

        Forgot your password?

        Don't have an account yet?
        Create account