2 code implementations • 25 Mar 2024 • Zicong Fan, Takehiko Ohkawa, Linlin Yang, Nie Lin, Zhishan Zhou, Shihao Zhou, Jiajun Liang, Zhong Gao, Xuanyang Zhang, Xue Zhang, Fei Li, Liu Zheng, Feng Lu, Karim Abou Zeid, Bastian Leibe, Jeongwan On, Seungryul Baek, Aditya Prakash, Saurabh Gupta, Kun He, Yoichi Sato, Otmar Hilliges, Hyung Jin Chang, Angela Yao
We interact with the world with our hands and see it through our own (egocentric) perspective.
1 code implementation • 8 Feb 2024 • Patrick Wienholt, Alexander Hermans, Firas Khader, Behrus Puladi, Bastian Leibe, Christiane Kuhl, Sven Nebelung, Daniel Truhn
This study investigates the application of ordinal regression methods for categorizing disease severity in chest radiographs.
no code implementations • 8 Feb 2024 • Idil Esen Zulfikar, Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe
We propose a novel Point-VOS task with a spatio-temporally sparse point-wise annotation scheme that substantially reduces the annotation effort.
1 code implementation • 28 Jan 2024 • Johannes Raufeisen, Kunpeng Xie, Fabian Hörst, Till Braunschweig, Jianning Li, Jens Kleesiek, Rainer Röhrig, Jan Egger, Bastian Leibe, Frank Hölzle, Alexander Hermans, Behrus Puladi
Utilizing this dataset, we compared the performance of Cyto R-CNN to other popular cell segmentation algorithms, including QuPath's built-in algorithm, StarDist and Cellpose.
no code implementations • 8 Dec 2023 • Jonas Schult, Sam Tsai, Lukas Höllein, Bichen Wu, Jialiang Wang, Chih-Yao Ma, Kunpeng Li, Xiaofang Wang, Felix Wimbauer, Zijian He, Peizhao Zhang, Bastian Leibe, Peter Vajda, Ji Hou
Central to our approach is a user-defined 3D semantic proxy room that outlines a rough room layout based on semantic bounding boxes and a textual description of the overall room style.
1 code implementation • 28 Sep 2023 • Kadir Yilmaz, Jonas Schult, Alexey Nekrasov, Bastian Leibe
With this intention, we propose Mask4Former for the challenging task of 4D panoptic segmentation of LiDAR point clouds.
Ranked #1 on 4D Panoptic Segmentation on SemanticKITTI
no code implementations • 18 Aug 2023 • Jonathon Luiten, Georgios Kopanas, Bastian Leibe, Deva Ramanan
We present a method that simultaneously addresses the tasks of dynamic scene novel-view synthesis and six degree-of-freedom (6-DOF) tracking of all dense scene elements.
1 code implementation • 3 Aug 2023 • Alexey Nekrasov, Alexander Hermans, Lars Kuhnert, Bastian Leibe
Our approach centers on an out-of-distribution segmentation model for identifying uncertain regions and a strong generalist segmentation model for anomaly instances segmentation.
no code implementations • 1 Jun 2023 • Yuanwen Yue, Sabarinath Mahadevan, Jonas Schult, Francis Engelmann, Bastian Leibe, Konrad Schindler, Theodora Kontogianni
In an iterative process, the model assigns each data point to an object (or the background), while the user corrects errors in the resulting segmentation and feeds them back into the model.
no code implementations • ICCV 2023 • Amit Kumar Rana, Sabarinath Mahadevan, Alexander Hermans, Bastian Leibe
We introduce a more efficient approach, called DynaMITe, in which we represent user interactions as spatio-temporal queries to a Transformer decoder with a potential to segment multiple object instances in a single iteration.
1 code implementation • 29 Mar 2023 • Karim Abou Zeid, Jonas Schult, Alexander Hermans, Bastian Leibe
Recently, the self-supervised learning framework data2vec has shown inspiring performance for various modalities using a masked student-teacher approach.
Ranked #3 on Few-Shot 3D Point Cloud Classification on ModelNet40 10-way (20-shot) (using extra training data)
3D Part Segmentation Few-Shot 3D Point Cloud Classification +3
1 code implementation • CVPR 2023 • Ali Athar, Alexander Hermans, Jonathon Luiten, Deva Ramanan, Bastian Leibe
A single TarViS model can be trained jointly on a collection of datasets spanning different tasks, and can hot-swap between tasks during inference without any task-specific retraining.
Ranked #2 on Video Panoptic Segmentation on KITTI-STEP (using extra training data)
1 code implementation • 29 Dec 2022 • István Sárándi, Alexander Hermans, Bastian Leibe
Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model, which outperforms prior work on a range of benchmarks, including the challenging 3D Poses in the Wild (3DPW) dataset.
no code implementations • ICCV 2023 • Ayça Takmaz, Jonas Schult, Irem Kaftan, Mertcan Akçay, Bastian Leibe, Robert Sumner, Francis Engelmann, Siyu Tang
We address this challenge and propose a framework for generating training data of synthetic humans interacting with real 3D scenes.
1 code implementation • 6 Oct 2022 • Jonas Schult, Francis Engelmann, Alexander Hermans, Or Litany, Siyu Tang, Bastian Leibe
Modern 3D semantic instance segmentation approaches predominantly rely on specialized voting mechanisms followed by carefully designed geometric clustering techniques.
Ranked #1 on 3D Instance Segmentation on STPLS3D
3D Instance Segmentation 3D Semantic Instance Segmentation +1
1 code implementation • 29 Sep 2022 • Lars Kreuzberg, Idil Esen Zulfikar, Sabarinath Mahadevan, Francis Engelmann, Bastian Leibe
Our voting-based tracklet generation method followed by geometric feature-based aggregation generates significantly improved panoptic LiDAR segmentation quality when compared to modeling the entire 4D volume using Gaussian probability distributions.
Ranked #4 on 4D Panoptic Segmentation on SemanticKITTI
1 code implementation • 25 Sep 2022 • Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan
Multiple existing benchmarks involve tracking and segmenting objects in video e. g., Video Object Segmentation (VOS) and Multi-Object Tracking and Segmentation (MOTS), but there is little interaction between them due to the use of disparate benchmark datasets and metrics (e. g. J&F, mAP, sMOTSA).
Ranked #4 on Long-tail Video Object Segmentation on BURST-val (using extra training data)
Long-tail Video Object Segmentation Multi-Object Tracking +6
no code implementations • 7 Aug 2022 • Dan Jia, Alexander Hermans, Bastian Leibe
For the 3D object detection task, GHA improves the CenterPoint baseline by +0. 5% mAP on the nuScenes dataset, and the 3DETR baseline by +2. 1% mAP25 and +1. 5% mAP50 on ScanNet.
1 code implementation • 3 Aug 2022 • Diego Paez-Granados, Yujie He, David Gonon, Dan Jia, Bastian Leibe, Kenji Suzuki, Aude Billard
Autonomous navigation in highly populated areas remains a challenging task for robots because of the difficulty in guaranteeing safe interactions with pedestrians in unstructured situations.
1 code implementation • 1 Jun 2022 • Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe
Recently, "Masked Attention" was proposed in which a given object representation only attends to those image pixel features for which the segmentation mask of that object is active.
no code implementations • CVPR 2022 • Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé
A benchmark that would allow us to perform an apple-to-apple comparison of existing efforts is a crucial first step towards advancing this important research field.
Ranked #3 on Open-World Video Segmentation on BURST-val (using extra training data)
1 code implementation • CVPR 2022 • Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe
Existing state-of-the-art methods for Video Object Segmentation (VOS) learn low-level pixel-to-pixel correspondences between frames to propagate object masks across video.
2 code implementations • WACV 2021 • Christian Schmidt, Ali Athar, Sabarinath Mahadevan, Bastian Leibe
We further show that D2Conv3D out-performs trivial extensions of existing dilated and deformable convolutions to 3D.
1 code implementation • 15 Nov 2021 • Christian Schmidt, Ali Athar, Sabarinath Mahadevan, Bastian Leibe
We further show that D^2Conv3D out-performs trivial extensions of existing dilated and deformable convolutions to 3D.
3 code implementations • 5 Oct 2021 • Alexey Nekrasov, Jonas Schult, Or Litany, Bastian Leibe, Francis Engelmann
Since scene context helps reasoning about object semantics, current works focus on models with large capacity and receptive fields that can fully capture the global context of an input 3D scene.
Ranked #3 on Semantic Segmentation on ScanNet
1 code implementation • 3 Jul 2021 • Dan Jia, Bastian Leibe
In this preliminary work we attempt to apply submanifold sparse convolution to the task of 3D person detection.
no code implementations • 21 Jun 2021 • Dan Jia, Alexander Hermans, Bastian Leibe
Person detection is a crucial task for mobile robots navigating in human-populated environments.
no code implementations • 22 Apr 2021 • Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé
We hope to open a new front in multi-object tracking research that will hopefully bring us a step closer to intelligent systems that can operate safely in the real world.
1 code implementation • 23 Feb 2021 • Mark Weber, Jun Xie, Maxwell Collins, Yukun Zhu, Paul Voigtlaender, Hartwig Adam, Bradley Green, Andreas Geiger, Bastian Leibe, Daniel Cremers, Aljoša Ošep, Laura Leal-Taixé, Liang-Chieh Chen
The task of assigning semantic classes and track identities to every pixel in a video is called video panoptic segmentation.
no code implementations • CVPR 2021 • Francis Engelmann, Konstantinos Rematas, Bastian Leibe, Vittorio Ferrari
We propose a method to detect and reconstruct multiple 3D objects from a single RGB image.
1 code implementation • 16 Dec 2020 • Dan Jia, Mats Steinweg, Alexander Hermans, Bastian Leibe
Through experiments on the JackRabbot dataset with two detector models, DROW3 and DR-SPAAM, we show that self-supervised detectors, trained or fine-tuned with pseudo-labels, outperform detectors trained only on a different dataset.
no code implementations • 2 Nov 2020 • Paul Voigtlaender, Lishu Luo, Chun Yuan, Yong Jiang, Bastian Leibe
We use a deep convolutional network to automatically create pseudo-labels on a pixel level from much cheaper bounding box annotations and investigate how far such pseudo-labels can carry us for training state-of-the-art VOS approaches.
5 code implementations • 16 Sep 2020 • Jonathon Luiten, Aljosa Osep, Patrick Dendorfer, Philip Torr, Andreas Geiger, Laura Leal-Taixe, Bastian Leibe
Multi-Object Tracking (MOT) has been notoriously difficult to evaluate.
1 code implementation • 26 Aug 2020 • Sabarinath Mahadevan, Ali Athar, Aljoša Ošep, Sebastian Hennen, Laura Leal-Taixé, Bastian Leibe
On the other hand, 3D convolutional networks have been successfully applied for video classification tasks, but have not been leveraged as effectively to problems involving dense per-pixel interpretation of videos compared to their 2D convolutional counterparts and lag behind the aforementioned networks in terms of performance.
Ranked #14 on Unsupervised Video Object Segmentation on DAVIS 2016 val
1 code implementation • 12 Jul 2020 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
Heatmap representations have formed the basis of human pose estimation systems for many years, and their extension to 3D has been a fruitful line of recent research.
Ranked #1 on 3D Human Pose Estimation on 3D Poses in the Wild Challenge (MPJPE metric)
1 code implementation • 8 Jun 2020 • Markus Knoche, István Sárándi, Bastian Leibe
We address the problem of reposing an image of a human into any desired novel pose.
1 code implementation • 2 May 2020 • Francis Engelmann, Jörg Stückler, Bastian Leibe
In this paper, we propose to use 3D shape and motion priors to regularize the estimation of the trajectory and the shape of vehicles in sequences of stereo images.
2 code implementations • 29 Apr 2020 • Dan Jia, Alexander Hermans, Bastian Leibe
Detecting persons using a 2D LiDAR is a challenging task due to the low information content of 2D range data.
no code implementations • ECCV 2020 • Umer Rafi, Andreas Doering, Bastian Leibe, Juergen Gall
Instead of training the network for estimating keypoint correspondences on video data, it is trained on a large scale image datasets for human pose estimation using self-supervision.
Multi-Person Pose Estimation Multi-Person Pose Estimation and Tracking +1
1 code implementation • CVPR 2020 • Jonas Schult, Francis Engelmann, Theodora Kontogianni, Bastian Leibe
That is, the convolutional kernel weights are mapped to the local surface of a given mesh.
1 code implementation • 30 Mar 2020 • Francis Engelmann, Martin Bokeloh, Alireza Fathi, Bastian Leibe, Matthias Nießner
We show that grouping proposals improves over NMS and outperforms previous state-of-the-art methods on the tasks of 3D object detection and semantic instance segmentation on the ScanNetV2 benchmark and the S3DIS dataset.
Ranked #1 on 3D Semantic Instance Segmentation on ScanNetV2
1 code implementation • ECCV 2020 • Ali Athar, Sabarinath Mahadevan, Aljoša Ošep, Laura Leal-Taixé, Bastian Leibe
In this paper, we propose a different approach that is well-suited to a variety of tasks involving instance segmentation in videos.
Ranked #5 on Unsupervised Video Object Segmentation on DAVIS 2017 (val) (using extra training data)
1 code implementation • 5 Mar 2020 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
Furthermore, as the image space is decoupled from the heatmap space, the network can learn to reason about joints beyond the image boundary.
Ranked #160 on 3D Human Pose Estimation on Human3.6M
1 code implementation • 15 Jan 2020 • Jonathon Luiten, Idil Esen Zulfikar, Bastian Leibe
UnOVOST even performs competitively with many semi-supervised video object segmentation algorithms even though it is not given any input as to which objects should be tracked and segmented.
1 code implementation • CVPR 2020 • Paul Voigtlaender, Jonathon Luiten, Philip H. S. Torr, Bastian Leibe
We present Siam R-CNN, a Siamese re-detection architecture which unleashes the full power of two-stage object detection approaches for visual object tracking.
Ranked #5 on Object Tracking on COESOT
no code implementations • 2 Nov 2019 • Mark Weber, Jonathon Luiten, Bastian Leibe
We present a novel end-to-end single-shot method that segments countable object instances (things) as well as background regions (stuff) into a non-overlapping panoptic segmentation at almost video frame rate.
1 code implementation • 10 Oct 2019 • Johannes Groß, Aljosa Osep, Bastian Leibe
In this work, we focus on precise 3D track state estimation and propose a learning-based approach for object-centric relative motion estimation of partially observed objects.
1 code implementation • 30 Sep 2019 • Jonathon Luiten, Tobias Fischer, Bastian Leibe
Object tracking and 3D reconstruction are often performed together, with tracking used as input for reconstruction.
1 code implementation • 28 Jul 2019 • Francis Engelmann, Theodora Kontogianni, Bastian Leibe
In a thorough ablation study, we show that the receptive field size is directly related to the performance of 3D point cloud processing tasks, including semantic segmentation and object classification.
Ranked #43 on Semantic Segmentation on S3DIS Area5
no code implementations • 7 Jun 2019 • Kilian Pfeiffer, Alexander Hermans, István Sárándi, Mark Weber, Bastian Leibe
We address the problem of learning a single model for person re-identification, attribute classification, body part segmentation, and pose estimation.
no code implementations • 9 Apr 2019 • Paul Voigtlaender, Jonathon Luiten, Bastian Leibe
Following this paradigm, we present BoLTVOS (Box-Level Tracking for VOS), which consists of an R-CNN detector conditioned on the first-frame bounding box to detect the object of interest, a temporal consistency rescoring algorithm, and a Box2Seg network that converts bounding boxes to segmentation masks.
no code implementations • 3 Apr 2019 • Cathrin Elich, Francis Engelmann, Theodora Kontogianni, Bastian Leibe
A lot of progress was made in the field of object classification and semantic segmentation.
Ranked #4 on 3D Semantic Instance Segmentation on ScanNetV2
3D Instance Segmentation 3D Semantic Instance Segmentation +4
no code implementations • 28 Feb 2019 • Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe
This paper addresses the problem of object discovery from unlabeled driving videos captured in a realistic automotive setting.
3 code implementations • CVPR 2019 • Paul Voigtlaender, Yuning Chai, Florian Schroff, Hartwig Adam, Bastian Leibe, Liang-Chieh Chen
Many of the recent successful methods for video object segmentation (VOS) are overly complicated, heavily rely on fine-tuning on the first frame, and/or are slow, and are hence of limited practical use.
Ranked #1 on Semi-Supervised Video Object Segmentation on YouTube
no code implementations • CVPR 2019 • Paul Voigtlaender, Michael Krause, Aljosa Osep, Jonathon Luiten, Berin Balachandar Gnana Sekar, Andreas Geiger, Bastian Leibe
This paper extends the popular task of multi-object tracking to multi-object tracking and segmentation (MOTS).
Ranked #6 on Multi-Object Tracking on MOTS20
Multi-Object Tracking Multi-Object Tracking and Segmentation +3
1 code implementation • 26 Jan 2019 • Aljosa Osep, Paul Voigtlaender, Mark Weber, Jonathon Luiten, Bastian Leibe
Many high-level video understanding methods require input in the form of object proposals.
no code implementations • 2 Oct 2018 • Francis Engelmann, Theodora Kontogianni, Jonas Schult, Bastian Leibe
In this paper, we present a deep learning architecture which addresses the problem of 3D semantic segmentation of unstructured point clouds.
no code implementations • 19 Sep 2018 • Aljosa Osep, Wolfgang Mehner, Markus Mathias, Bastian Leibe
Most of the current vision-based tracking methods perform tracking in the image domain.
Ranked #22 on Multiple Object Tracking on KITTI Tracking test
no code implementations • 19 Sep 2018 • Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe
We propose to leverage a generic object tracker in order to perform object mining in large-scale unlabeled videos, captured in a realistic automotive setting.
1 code implementation • 13 Sep 2018 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
In this paper we present our winning entry at the 2018 ECCV PoseTrack Challenge on 3D human pose estimation.
Ranked #223 on 3D Human Pose Estimation on Human3.6M
1 code implementation • 28 Aug 2018 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe
Occlusion is commonplace in realistic human-robot shared environments, yet its effects are not considered in standard 3D human pose estimation benchmarks.
Ranked #234 on 3D Human Pose Estimation on Human3.6M
5 code implementations • 24 Jul 2018 • Jonathon Luiten, Paul Voigtlaender, Bastian Leibe
We address semi-supervised video object segmentation, the task of automatically generating accurate and consistent pixel masks for objects in a video sequence, given the first-frame ground truth annotations.
1 code implementation • 11 May 2018 • Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe
Deep learning requires large amounts of training data to be effective.
1 code implementation • 26 Apr 2018 • Stefan Breuers, Lucas Beyer, Umer Rafi, Bastian Leibe
In the past decade many robots were deployed in the wild, and people detection and tracking is an important component of such deployments.
1 code implementation • 6 Apr 2018 • Lucas Beyer, Alexander Hermans, Timm Linder, Kai O. Arras, Bastian Leibe
Detecting humans is a key skill for mobile robots and intelligent vehicles in a large variety of applications.
1 code implementation • 5 Feb 2018 • Francis Engelmann, Theodora Kontogianni, Alexander Hermans, Bastian Leibe
The recently proposed PointNet architecture presents an interesting step ahead in that it can operate on unstructured point clouds, achieving encouraging segmentation results.
1 code implementation • 23 Dec 2017 • Aljoša Ošep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe
We explore object discovery and detector adaptation based on unlabeled video sequences captured from a mobile platform.
no code implementations • 21 Dec 2017 • Aljoša Ošep, Wolfgang Mehner, Paul Voigtlaender, Bastian Leibe
In this paper, we propose a model-free multi-object tracking approach that uses a category-agnostic image segmentation method to track objects.
no code implementations • 28 Jun 2017 • Paul Voigtlaender, Bastian Leibe
We tackle the task of semi-supervised video object segmentation, i. e. segmenting the pixels belonging to an object in the video using the ground truth pixel mask for the first frame.
Ranked #2 on Visual Object Tracking on YouTube-VOS 2018
2 code implementations • 31 May 2017 • Vitaly Kurin, Sebastian Nowozin, Katja Hofmann, Lucas Beyer, Bastian Leibe
Recent progress in Reinforcement Learning (RL), fueled by its combination, with Deep Learning has enabled impressive results in learning to interact with complex virtual environments, yet real-world applications of RL are still scarce.
2 code implementations • 12 May 2017 • Lucas Beyer, Stefan Breuers, Vitaly Kurin, Bastian Leibe
With the rise of end-to-end learning through deep learning, person detectors and re-identification (ReID) models have recently become very strong.
31 code implementations • 22 Mar 2017 • Alexander Hermans, Lucas Beyer, Bastian Leibe
In the past few years, the field of computer vision has gone through a revolution fueled mainly by the advent of large datasets and the adoption of deep convolutional neural networks for end-to-end learning.
Ranked #3 on Person Re-Identification on CUHK03 (Rank-5 metric)
no code implementations • CVPR 2017 • Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe
Supervised deep learning often suffers from the lack of sufficient training data.
no code implementations • 7 Feb 2017 • Anton Kasyanov, Francis Engelmann, Jörg Stückler, Bastian Leibe
Our visual-inertial SLAM system is based on a real-time capable visual-inertial odometry method that provides locally consistent trajectory and map estimates.
2 code implementations • 6 Dec 2016 • David Stutz, Alexander Hermans, Bastian Leibe
As such, and due to their quick adoption in a wide range of applications, appropriate benchmarks are crucial for algorithm selection and comparison.
4 code implementations • CVPR 2017 • Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe
Therefore, additional processing steps have to be performed in order to obtain pixel-accurate segmentation masks at the full image resolution.
Ranked #25 on Real-Time Semantic Segmentation on Cityscapes test
no code implementations • CVPR 2016 • Theodora Kontogianni, Markus Mathias, Bastian Leibe
Abstract In this paper, we address the problem of object discovery in time-varying, large-scale image collections.
no code implementations • 8 Mar 2016 • Lucas Beyer, Alexander Hermans, Bastian Leibe
We propose a Convolutional Neural Network (CNN) based detector for this task.
no code implementations • 18 Sep 2014 • Tobias Weyand, Bastian Leibe
We evaluate how different choices of methods and parameters for the individual pipeline steps affect overall system performance and examine their effects for different query categories such as buildings, paintings or sculptures.
no code implementations • CVPR 2014 • Ilya Kostrikov, Esther Horbert, Bastian Leibe
In this paper, we propose a novel labeling cost for multi- view reconstruction.
no code implementations • CVPR 2013 • Tobias Baumgartner, Dennis Mitzel, Bastian Leibe
Current pedestrian tracking approaches ignore important aspects of human behavior.