Matches in SemOpenAlex for { <https://semopenalex.org/work/W3206836360> ?p ?o ?g. }
- W3206836360 abstract "Recently, DETR and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors. However, their performance on Video Object Detection (VOD) has not been well explored. In this paper, we present TransVOD, an end-to-end video object detection model based on a spatial-temporal Transformer architecture. The goal of this paper is to streamline the pipeline of VOD, effectively removing the need for many hand-crafted components for feature aggregation, e.g., optical flow, recurrent neural networks, relation networks. Besides, benefited from the object query design in DETR, our method does not need complicated post-processing methods such as Seq-NMS or Tubelet rescoring, which keeps the pipeline simple and clean. In particular, we present temporal Transformer to aggregate both the spatial object queries and the feature memories of each frame. Our temporal Transformer consists of three components: Temporal Deformable Transformer Encoder (TDTE) to encode the multiple frame spatial details, Temporal Query Encoder (TQE) to fuse object queries, and Temporal Deformable Transformer Decoder (TDTD) to obtain current frame detection results. These designs boost the strong baseline deformable DETR by a significant margin (3%-4% mAP) on the ImageNet VID dataset. TransVOD yields comparable results performance on the benchmark of ImageNet VID. We hope our TransVOD can provide a new perspective for video object detection." @default.
- W3206836360 created "2021-10-25" @default.
- W3206836360 creator A5015078046 @default.
- W3206836360 creator A5024097240 @default.
- W3206836360 creator A5024417931 @default.
- W3206836360 creator A5031210225 @default.
- W3206836360 creator A5032618817 @default.
- W3206836360 creator A5035339773 @default.
- W3206836360 creator A5045854934 @default.
- W3206836360 creator A5069553088 @default.
- W3206836360 creator A5084218062 @default.
- W3206836360 creator A5089900108 @default.
- W3206836360 date "2021-10-17" @default.
- W3206836360 modified "2023-10-18" @default.
- W3206836360 title "End-to-End Video Object Detection with Spatial-Temporal Transformers" @default.
- W3206836360 cites W1861492603 @default.
- W3206836360 cites W2108598243 @default.
- W3206836360 cites W2117539524 @default.
- W3206836360 cites W2194775991 @default.
- W3206836360 cites W2336589871 @default.
- W3206836360 cites W2565639579 @default.
- W3206836360 cites W2601564443 @default.
- W3206836360 cites W2898044248 @default.
- W3206836360 cites W2904617485 @default.
- W3206836360 cites W2921015377 @default.
- W3206836360 cites W2962766617 @default.
- W3206836360 cites W2962855257 @default.
- W3206836360 cites W2963091558 @default.
- W3206836360 cites W2963585656 @default.
- W3206836360 cites W2964086649 @default.
- W3206836360 cites W2969727121 @default.
- W3206836360 cites W2982723417 @default.
- W3206836360 cites W2982770724 @default.
- W3206836360 cites W2990578161 @default.
- W3206836360 cites W2996794639 @default.
- W3206836360 cites W3034467781 @default.
- W3206836360 cites W3084874594 @default.
- W3206836360 cites W3092900809 @default.
- W3206836360 cites W3096609285 @default.
- W3206836360 cites W3097550038 @default.
- W3206836360 cites W3100094580 @default.
- W3206836360 cites W3106250896 @default.
- W3206836360 cites W607748843 @default.
- W3206836360 cites W639708223 @default.
- W3206836360 doi "https://doi.org/10.1145/3474085.3475285" @default.
- W3206836360 hasPublicationYear "2021" @default.
- W3206836360 type Work @default.
- W3206836360 sameAs 3206836360 @default.
- W3206836360 citedByCount "33" @default.
- W3206836360 countsByYear W32068363602022 @default.
- W3206836360 countsByYear W32068363602023 @default.
- W3206836360 crossrefType "proceedings-article" @default.
- W3206836360 hasAuthorship W3206836360A5015078046 @default.
- W3206836360 hasAuthorship W3206836360A5024097240 @default.
- W3206836360 hasAuthorship W3206836360A5024417931 @default.
- W3206836360 hasAuthorship W3206836360A5031210225 @default.
- W3206836360 hasAuthorship W3206836360A5032618817 @default.
- W3206836360 hasAuthorship W3206836360A5035339773 @default.
- W3206836360 hasAuthorship W3206836360A5045854934 @default.
- W3206836360 hasAuthorship W3206836360A5069553088 @default.
- W3206836360 hasAuthorship W3206836360A5084218062 @default.
- W3206836360 hasAuthorship W3206836360A5089900108 @default.
- W3206836360 hasBestOaLocation W32068363602 @default.
- W3206836360 hasConcept C104317684 @default.
- W3206836360 hasConcept C111919701 @default.
- W3206836360 hasConcept C118505674 @default.
- W3206836360 hasConcept C119599485 @default.
- W3206836360 hasConcept C127413603 @default.
- W3206836360 hasConcept C153180895 @default.
- W3206836360 hasConcept C154945302 @default.
- W3206836360 hasConcept C165801399 @default.
- W3206836360 hasConcept C185592680 @default.
- W3206836360 hasConcept C2776151529 @default.
- W3206836360 hasConcept C31972630 @default.
- W3206836360 hasConcept C41008148 @default.
- W3206836360 hasConcept C55493867 @default.
- W3206836360 hasConcept C66322947 @default.
- W3206836360 hasConcept C66746571 @default.
- W3206836360 hasConceptScore W3206836360C104317684 @default.
- W3206836360 hasConceptScore W3206836360C111919701 @default.
- W3206836360 hasConceptScore W3206836360C118505674 @default.
- W3206836360 hasConceptScore W3206836360C119599485 @default.
- W3206836360 hasConceptScore W3206836360C127413603 @default.
- W3206836360 hasConceptScore W3206836360C153180895 @default.
- W3206836360 hasConceptScore W3206836360C154945302 @default.
- W3206836360 hasConceptScore W3206836360C165801399 @default.
- W3206836360 hasConceptScore W3206836360C185592680 @default.
- W3206836360 hasConceptScore W3206836360C2776151529 @default.
- W3206836360 hasConceptScore W3206836360C31972630 @default.
- W3206836360 hasConceptScore W3206836360C41008148 @default.
- W3206836360 hasConceptScore W3206836360C55493867 @default.
- W3206836360 hasConceptScore W3206836360C66322947 @default.
- W3206836360 hasConceptScore W3206836360C66746571 @default.
- W3206836360 hasFunder F4320321001 @default.
- W3206836360 hasFunder F4320335777 @default.
- W3206836360 hasLocation W32068363601 @default.
- W3206836360 hasLocation W32068363602 @default.
- W3206836360 hasOpenAccess W3206836360 @default.
- W3206836360 hasPrimaryLocation W32068363601 @default.
- W3206836360 hasRelatedWork W1497101000 @default.