Description

Book Synopsis

SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark.- AttnZero: Efficient Attention Discovery for Vision Transformers.- Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search.- Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search.- UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation.- TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning.- Spectral Subsurface Scattering for Material Classification.- nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding.- Dynamic Neural Radiance Field From Defocused Monocular Video.- PiTe: Pixel-Temporal Alignment for Large Video-Language Model.- CarFormer: Self-Driving with Learned Object-Centric Representations.- FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models.- Plain-Det: A Plain Multi-Dataset Object Detector.- Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation.- Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation.- Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching.- Text-Guided Video Masked Autoencoder.- Diffusion Models for Open-Vocabulary Segmentation.- Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation.- EvSign: Sign Language Recognition and Translation with Streaming Events.- QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.- Zero-shot Object Counting with Good Exemplars.- TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering.- SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds.- PartSTAD: 2D-to-3D Part Segmentation Task Adaptation.- FutureDepth: Learning to Predict the Future Improves Video Depth Estimation.- LLM as Copilot for Coarse-grained Vision-and-Language Navigation.

Computer Vision ECCV 2024

    Product form

    £64.99

    Includes FREE delivery

    Order before 4pm today for delivery by Mon 15 Jun 2026.

    A Paperback by Aleš Leonardis

    15 in stock


      View other formats and editions of Computer Vision ECCV 2024 by Aleš Leonardis

      Publisher: Springer
      Publication Date: 30/10/2024
      ISBN13: 9783031726514, 978-3031726514
      ISBN10:

      Description

      Book Synopsis

      SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark.- AttnZero: Efficient Attention Discovery for Vision Transformers.- Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search.- Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search.- UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation.- TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning.- Spectral Subsurface Scattering for Material Classification.- nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding.- Dynamic Neural Radiance Field From Defocused Monocular Video.- PiTe: Pixel-Temporal Alignment for Large Video-Language Model.- CarFormer: Self-Driving with Learned Object-Centric Representations.- FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models.- Plain-Det: A Plain Multi-Dataset Object Detector.- Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation.- Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation.- Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching.- Text-Guided Video Masked Autoencoder.- Diffusion Models for Open-Vocabulary Segmentation.- Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation.- EvSign: Sign Language Recognition and Translation with Streaming Events.- QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.- Zero-shot Object Counting with Good Exemplars.- TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering.- SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds.- PartSTAD: 2D-to-3D Part Segmentation Task Adaptation.- FutureDepth: Learning to Predict the Future Improves Video Depth Estimation.- LLM as Copilot for Coarse-grained Vision-and-Language Navigation.

      Recently viewed products

      © 2026 Book Curl

        • American Express
        • Apple Pay
        • Diners Club
        • Discover
        • Google Pay
        • Maestro
        • Mastercard
        • PayPal
        • Shop Pay
        • Union Pay
        • Visa

        Login

        Forgot your password?

        Don't have an account yet?
        Create account