Search Results for author: Bastian Leibe

Found 81 papers, 50 papers with code

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

2 code implementations • 25 Mar 2024 • Zicong Fan, Takehiko Ohkawa, Linlin Yang, Nie Lin, Zhishan Zhou, Shihao Zhou, Jiajun Liang, Zhong Gao, Xuanyang Zhang, Xue Zhang, Fei Li, Liu Zheng, Feng Lu, Karim Abou Zeid, Bastian Leibe, Jeongwan On, Seungryul Baek, Aditya Prakash, Saurabh Gupta, Kun He, Yoichi Sato, Otmar Hilliges, Hyung Jin Chang, Angela Yao

We interact with the world with our hands and see it through our own (egocentric) perspective.

Action Recognition Object +2

257

Paper
Code

An Ordinal Regression Framework for a Deep Learning Based Severity Assessment for Chest Radiographs

1 code implementation • 8 Feb 2024 • Patrick Wienholt, Alexander Hermans, Firas Khader, Behrus Puladi, Bastian Leibe, Christiane Kuhl, Sven Nebelung, Daniel Truhn

This study investigates the application of ordinal regression methods for categorizing disease severity in chest radiographs.

regression

Paper
Code

Point-VOS: Pointing Up Video Object Segmentation

no code implementations • 8 Feb 2024 • Idil Esen Zulfikar, Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe

We propose a novel Point-VOS task with a spatio-temporally sparse point-wise annotation scheme that substantially reduces the annotation effort.

Object Semantic Segmentation +3

Paper
Add Code

Cyto R-CNN and CytoNuke Dataset: Towards reliable whole-cell segmentation in bright-field histological images

1 code implementation • 28 Jan 2024 • Johannes Raufeisen, Kunpeng Xie, Fabian Hörst, Till Braunschweig, Jianning Li, Jens Kleesiek, Rainer Röhrig, Jan Egger, Bastian Leibe, Frank Hölzle, Alexander Hermans, Behrus Puladi

Utilizing this dataset, we compared the performance of Cyto R-CNN to other popular cell segmentation algorithms, including QuPath's built-in algorithm, StarDist and Cellpose.

Cell Segmentation Segmentation

Paper
Code

ControlRoom3D: Room Generation using Semantic Proxy Rooms

no code implementations • 8 Dec 2023 • Jonas Schult, Sam Tsai, Lukas Höllein, Bichen Wu, Jialiang Wang, Chih-Yao Ma, Kunpeng Li, Xiaofang Wang, Felix Wimbauer, Zijian He, Peizhao Zhang, Bastian Leibe, Peter Vajda, Ji Hou

Central to our approach is a user-defined 3D semantic proxy room that outlines a rough room layout based on semantic bounding boxes and a textual description of the overall room style.

Paper
Add Code

Mask4Former: Mask Transformer for 4D Panoptic Segmentation

1 code implementation • 28 Sep 2023 • Kadir Yilmaz, Jonas Schult, Alexey Nekrasov, Bastian Leibe

With this intention, we propose Mask4Former for the challenging task of 4D panoptic segmentation of LiDAR point clouds.

Ranked #1 on 4D Panoptic Segmentation on SemanticKITTI

4D Panoptic Segmentation Instance Segmentation +2

Paper
Code

Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis

no code implementations • 18 Aug 2023 • Jonathon Luiten, Georgios Kopanas, Bastian Leibe, Deva Ramanan

We present a method that simultaneously addresses the tasks of dynamic scene novel-view synthesis and six degree-of-freedom (6-DOF) tracking of all dense scene elements.

Dynamic Reconstruction Novel View Synthesis +1

Paper
Add Code

UGainS: Uncertainty Guided Anomaly Instance Segmentation

1 code implementation • 3 Aug 2023 • Alexey Nekrasov, Alexander Hermans, Lars Kuhnert, Bastian Leibe

Our approach centers on an out-of-distribution segmentation model for identifying uncertain regions and a strong generalist segmentation model for anomaly instances segmentation.

Autonomous Driving Instance Segmentation +2

Paper
Code

AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation

no code implementations • 1 Jun 2023 • Yuanwen Yue, Sabarinath Mahadevan, Jonas Schult, Francis Engelmann, Bastian Leibe, Konrad Schindler, Theodora Kontogianni

In an iterative process, the model assigns each data point to an object (or the background), while the user corrects errors in the resulting segmentation and feeds them back into the model.

Binary Classification Interactive Segmentation +2

Paper
Add Code

DynaMITe: Dynamic Query Bootstrapping for Multi-object Interactive Segmentation Transformer

no code implementations • ICCV 2023 • Amit Kumar Rana, Sabarinath Mahadevan, Alexander Hermans, Bastian Leibe

We introduce a more efficient approach, called DynaMITe, in which we represent user interactions as spatio-temporal queries to a Transformer decoder with a potential to segment multiple object instances in a single iteration.

Decoder Instance Segmentation +3

Paper
Add Code

Point2Vec for Self-Supervised Representation Learning on Point Clouds

1 code implementation • 29 Mar 2023 • Karim Abou Zeid, Jonas Schult, Alexander Hermans, Bastian Leibe

Recently, the self-supervised learning framework data2vec has shown inspiring performance for various modalities using a masked student-teacher approach.

Ranked #3 on Few-Shot 3D Point Cloud Classification on ModelNet40 10-way (20-shot) (using extra training data)

3D Part Segmentation Few-Shot 3D Point Cloud Classification +3

Paper
Code

TarViS: A Unified Approach for Target-based Video Segmentation

1 code implementation • CVPR 2023 • Ali Athar, Alexander Hermans, Jonathon Luiten, Deva Ramanan, Bastian Leibe

A single TarViS model can be trained jointly on a collection of datasets spanning different tasks, and can hot-swap between tasks during inference without any task-specific retraining.

Ranked #2 on Video Panoptic Segmentation on KITTI-STEP (using extra training data)

Instance Segmentation Segmentation +4

Paper
Code

Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats

1 code implementation • 29 Dec 2022 • István Sárándi, Alexander Hermans, Bastian Leibe

Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model, which outperforms prior work on a range of benchmarks, including the challenging 3D Poses in the Wild (3DPW) dataset.

3D Human Pose Estimation Dimensionality Reduction

424

Paper
Code

3D Segmentation of Humans in Point Clouds with Synthetic Data

no code implementations • ICCV 2023 • Ayça Takmaz, Jonas Schult, Irem Kaftan, Mertcan Akçay, Bastian Leibe, Robert Sumner, Francis Engelmann, Siyu Tang

We address this challenge and propose a framework for generating training data of synthetic humans interacting with real 3D scenes.

Instance Segmentation Segmentation +2

Paper
Add Code

Mask3D: Mask Transformer for 3D Semantic Instance Segmentation

1 code implementation • 6 Oct 2022 • Jonas Schult, Francis Engelmann, Alexander Hermans, Or Litany, Siyu Tang, Bastian Leibe

Modern 3D semantic instance segmentation approaches predominantly rely on specialized voting mechanisms followed by carefully designed geometric clustering techniques.

Ranked #1 on 3D Instance Segmentation on STPLS3D

3D Instance Segmentation 3D Semantic Instance Segmentation +1

481

Paper
Code

4D-StOP: Panoptic Segmentation of 4D LiDAR using Spatio-temporal Object Proposal Generation and Aggregation

1 code implementation • 29 Sep 2022 • Lars Kreuzberg, Idil Esen Zulfikar, Sabarinath Mahadevan, Francis Engelmann, Bastian Leibe

Our voting-based tracklet generation method followed by geometric feature-based aggregation generates significantly improved panoptic LiDAR segmentation quality when compared to modeling the entire 4D volume using Gaussian probability distributions.

Ranked #4 on 4D Panoptic Segmentation on SemanticKITTI

4D Panoptic Segmentation Object Proposal Generation +1

Paper
Code

BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video

1 code implementation • 25 Sep 2022 • Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan

Multiple existing benchmarks involve tracking and segmenting objects in video e. g., Video Object Segmentation (VOS) and Multi-Object Tracking and Segmentation (MOTS), but there is little interaction between them due to the use of disparate benchmark datasets and metrics (e. g. J&F, mAP, sMOTSA).

Ranked #4 on Long-tail Video Object Segmentation on BURST-val (using extra training data)

Long-tail Video Object Segmentation Multi-Object Tracking +6

Paper
Code

Global Hierarchical Attention for 3D Point Cloud Analysis

no code implementations • 7 Aug 2022 • Dan Jia, Alexander Hermans, Bastian Leibe

For the 3D object detection task, GHA improves the CenterPoint baseline by +0. 5% mAP on the nuScenes dataset, and the 3DETR baseline by +2. 1% mAP25 and +1. 5% mAP50 on ScanNet.

3D Object Detection Inductive Bias +2

Paper
Add Code

Pedestrian-Robot Interactions on Autonomous Crowd Navigation: Reactive Control Methods and Evaluation Metrics

1 code implementation • 3 Aug 2022 • Diego Paez-Granados, Yujie He, David Gonon, Dan Jia, Bastian Leibe, Kenji Suzuki, Aude Billard

Autonomous navigation in highly populated areas remains a challenging task for robots because of the difficulty in guaranteeing safe interactions with pedestrians in unstructured situations.

Autonomous Navigation

Paper
Code

Differentiable Soft-Masked Attention

1 code implementation • 1 Jun 2022 • Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe

Recently, "Masked Attention" was proposed in which a given object representation only attends to those image pixel features for which the segmentation mask of that object is active.

Object Segmentation +4

Paper
Code

Opening Up Open World Tracking

no code implementations • CVPR 2022 • Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé

A benchmark that would allow us to perform an apple-to-apple comparison of existing efforts is a crucial first step towards advancing this important research field.

Ranked #3 on Open-World Video Segmentation on BURST-val (using extra training data)

Multi-Object Tracking Object +1

Paper
Add Code

HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images

1 code implementation • CVPR 2022 • Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe

Existing state-of-the-art methods for Video Object Segmentation (VOS) learn low-level pixel-to-pixel correspondences between frames to propagate object masks across video.

Object Semantic Segmentation +2

Paper
Code

D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos

2 code implementations • WACV 2021 • Christian Schmidt, Ali Athar, Sabarinath Mahadevan, Bastian Leibe

We further show that D2Conv3D out-performs trivial extensions of existing dilated and deformable convolutions to 3D.

Ranked #7 on Unsupervised Video Object Segmentation on DAVIS 2016 val

Multi-Object Tracking and Segmentation Segmentation +5

Paper
Code

D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos

1 code implementation • 15 Nov 2021 • Christian Schmidt, Ali Athar, Sabarinath Mahadevan, Bastian Leibe

We further show that D^2Conv3D out-performs trivial extensions of existing dilated and deformable convolutions to 3D.

Segmentation Semantic Segmentation +3

Paper
Code

Mix3D: Out-of-Context Data Augmentation for 3D Scenes

3 code implementations • 5 Oct 2021 • Alexey Nekrasov, Jonas Schult, Or Litany, Bastian Leibe, Francis Engelmann

Since scene context helps reasoning about object semantics, current works focus on models with large capacity and receptive fields that can fully capture the global context of an input 3D scene.

Ranked #3 on Semantic Segmentation on ScanNet

3D Semantic Segmentation

1,223

Paper
Code

Person-MinkUNet: 3D Person Detection with LiDAR Point Cloud

1 code implementation • 3 Jul 2021 • Dan Jia, Bastian Leibe

In this preliminary work we attempt to apply submanifold sparse convolution to the task of 3D person detection.

Human Detection

Paper
Code

2D vs. 3D LiDAR-based Person Detection on Mobile Robots

no code implementations • 21 Jun 2021 • Dan Jia, Alexander Hermans, Bastian Leibe

Person detection is a crucial task for mobile robots navigating in human-populated environments.

Human Detection

Paper
Add Code

Opening up Open-World Tracking

no code implementations • 22 Apr 2021 • Yang Liu, Idil Esen Zulfikar, Jonathon Luiten, Achal Dave, Deva Ramanan, Bastian Leibe, Aljoša Ošep, Laura Leal-Taixé

We hope to open a new front in multi-object tracking research that will hopefully bring us a step closer to intelligent systems that can operate safely in the real world.

Multi-Object Tracking Object

Paper
Add Code

STEP: Segmenting and Tracking Every Pixel

1 code implementation • 23 Feb 2021 • Mark Weber, Jun Xie, Maxwell Collins, Yukun Zhu, Paul Voigtlaender, Hartwig Adam, Bradley Green, Andreas Geiger, Bastian Leibe, Daniel Cremers, Aljoša Ošep, Laura Leal-Taixé, Liang-Chieh Chen

The task of assigning semantic classes and track identities to every pixel in a video is called video panoptic segmentation.

Segmentation Video Panoptic Segmentation

991

Paper
Code

From Points to Multi-Object 3D Reconstruction

no code implementations • CVPR 2021 • Francis Engelmann, Konstantinos Rematas, Bastian Leibe, Vittorio Ferrari

We propose a method to detect and reconstruct multiple 3D objects from a single RGB image.

3D Reconstruction 3D Shape Reconstruction +1

Paper
Add Code

Self-Supervised Person Detection in 2D Range Data using a Calibrated Camera

1 code implementation • 16 Dec 2020 • Dan Jia, Mats Steinweg, Alexander Hermans, Bastian Leibe

Through experiments on the JackRabbot dataset with two detector models, DROW3 and DR-SPAAM, we show that self-supervised detectors, trained or fine-tuned with pseudo-labels, outperform detectors trained only on a different dataset.

Human Detection

155

Paper
Code

Reducing the Annotation Effort for Video Object Segmentation Datasets

no code implementations • 2 Nov 2020 • Paul Voigtlaender, Lishu Luo, Chun Yuan, Yong Jiang, Bastian Leibe

We use a deep convolutional network to automatically create pseudo-labels on a pixel level from much cheaper bounding box annotations and investigate how far such pseudo-labels can carry us for training state-of-the-art VOS approaches.

Object Semantic Segmentation +2

Paper
Add Code

HOTA: A Higher Order Metric for Evaluating Multi-Object Tracking

5 code implementations • 16 Sep 2020 • Jonathon Luiten, Aljosa Osep, Patrick Dendorfer, Philip Torr, Andreas Geiger, Laura Leal-Taixe, Bastian Leibe

Multi-Object Tracking (MOT) has been notoriously difficult to evaluate.

Multi-Object Tracking

894

Paper
Code

Making a Case for 3D Convolutions for Object Segmentation in Videos

1 code implementation • 26 Aug 2020 • Sabarinath Mahadevan, Ali Athar, Aljoša Ošep, Sebastian Hennen, Laura Leal-Taixé, Bastian Leibe

On the other hand, 3D convolutional networks have been successfully applied for video classification tasks, but have not been leveraged as effectively to problems involving dense per-pixel interpretation of videos compared to their 2D convolutional counterparts and lag behind the aforementioned networks in terms of performance.

Ranked #14 on Unsupervised Video Object Segmentation on DAVIS 2016 val

Decoder Segmentation +6

Paper
Code

MeTRAbs: Metric-Scale Truncation-Robust Heatmaps for Absolute 3D Human Pose Estimation

1 code implementation • 12 Jul 2020 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe

Heatmap representations have formed the basis of human pose estimation systems for many years, and their extension to 3D has been a fruitful line of recent research.

Ranked #1 on 3D Human Pose Estimation on 3D Poses in the Wild Challenge (MPJPE metric)

3D Absolute Human Pose Estimation

424

Paper
Code

Reposing Humans by Warping 3D Features

1 code implementation • 8 Jun 2020 • Markus Knoche, István Sárándi, Bastian Leibe

We address the problem of reposing an image of a human into any desired novel pose.

Conditional Image Generation Decoder +2

Paper
Code

SAMP: Shape and Motion Priors for 4D Vehicle Reconstruction

1 code implementation • 2 May 2020 • Francis Engelmann, Jörg Stückler, Bastian Leibe

In this paper, we propose to use 3D shape and motion priors to regularize the estimation of the trajectory and the shape of vehicles in sequences of stereo images.

Pose Estimation

Paper
Code

DR-SPAAM: A Spatial-Attention and Auto-regressive Model for Person Detection in 2D Range Data

2 code implementations • 29 Apr 2020 • Dan Jia, Alexander Hermans, Bastian Leibe

Detecting persons using a 2D LiDAR is a challenging task due to the low information content of 2D range data.

Human Detection

155

Paper
Code

Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos

no code implementations • ECCV 2020 • Umer Rafi, Andreas Doering, Bastian Leibe, Juergen Gall

Instead of training the network for estimating keypoint correspondences on video data, it is trained on a large scale image datasets for human pose estimation using self-supervision.

Multi-Person Pose Estimation Multi-Person Pose Estimation and Tracking +1

Paper
Add Code

DualConvMesh-Net: Joint Geodesic and Euclidean Convolutions on 3D Meshes

1 code implementation • CVPR 2020 • Jonas Schult, Francis Engelmann, Theodora Kontogianni, Bastian Leibe

That is, the convolutional kernel weights are mapped to the local surface of a given mesh.

3D Semantic Segmentation Scene Segmentation

117

Paper
Code

3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation

1 code implementation • 30 Mar 2020 • Francis Engelmann, Martin Bokeloh, Alireza Fathi, Bastian Leibe, Matthias Nießner

We show that grouping proposals improves over NMS and outperforms previous state-of-the-art methods on the tasks of 3D object detection and semantic instance segmentation on the ScanNetV2 benchmark and the S3DIS dataset.

Ranked #1 on 3D Semantic Instance Segmentation on ScanNetV2

3D Instance Segmentation 3D Object Detection +3

Paper
Code

STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos

1 code implementation • ECCV 2020 • Ali Athar, Sabarinath Mahadevan, Aljoša Ošep, Laura Leal-Taixé, Bastian Leibe

In this paper, we propose a different approach that is well-suited to a variety of tasks involving instance segmentation in videos.

Ranked #5 on Unsupervised Video Object Segmentation on DAVIS 2017 (val) (using extra training data)

Instance Segmentation Semantic Segmentation +2

153

Paper
Code

Metric-Scale Truncation-Robust Heatmaps for 3D Human Pose Estimation

1 code implementation • 5 Mar 2020 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe

Furthermore, as the image space is decoupled from the heatmap space, the network can learn to reason about joints beyond the image boundary.

Ranked #160 on 3D Human Pose Estimation on Human3.6M

2D Human Pose Estimation 3D Human Pose Estimation +1

Paper
Code

UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking

1 code implementation • 15 Jan 2020 • Jonathon Luiten, Idil Esen Zulfikar, Bastian Leibe

UnOVOST even performs competitively with many semi-supervised video object segmentation algorithms even though it is not given any input as to which objects should be tracked and segmented.

Ranked #2 on Unsupervised Video Object Segmentation on DAVIS 2017 (test-dev)

Object Segmentation +4

Paper
Code

Siam R-CNN: Visual Tracking by Re-Detection

1 code implementation • CVPR 2020 • Paul Voigtlaender, Jonathon Luiten, Philip H. S. Torr, Bastian Leibe

We present Siam R-CNN, a Siamese re-detection architecture which unleashes the full power of two-stage object detection approaches for visual object tracking.

Ranked #5 on Object Tracking on COESOT

Object object-detection +4

217

Paper
Code

Single-Shot Panoptic Segmentation

no code implementations • 2 Nov 2019 • Mark Weber, Jonathon Luiten, Bastian Leibe

We present a novel end-to-end single-shot method that segments countable object instances (things) as well as background regions (stuff) into a non-overlapping panoptic segmentation at almost video frame rate.

Decoder Instance Segmentation +5

Paper
Add Code

AlignNet-3D: Fast Point Cloud Registration of Partially Observed Objects

1 code implementation • 10 Oct 2019 • Johannes Groß, Aljosa Osep, Bastian Leibe

In this work, we focus on precise 3D track state estimation and propose a learning-based approach for object-centric relative motion estimation of partially observed objects.

3D Pose Estimation Motion Estimation +2

Paper
Code

Track to Reconstruct and Reconstruct to Track

1 code implementation • 30 Sep 2019 • Jonathon Luiten, Tobias Fischer, Bastian Leibe

Object tracking and 3D reconstruction are often performed together, with tracking used as input for reconstruction.

3D Reconstruction Multi-Object Tracking +3

174

Paper
Code

Dilated Point Convolutions: On the Receptive Field Size of Point Convolutions on 3D Point Clouds

1 code implementation • 28 Jul 2019 • Francis Engelmann, Theodora Kontogianni, Bastian Leibe

In a thorough ablation study, we show that the receptive field size is directly related to the performance of 3D point cloud processing tasks, including semantic segmentation and object classification.

Ranked #43 on Semantic Segmentation on S3DIS Area5

3D Semantic Segmentation

Paper
Code

Visual Person Understanding through Multi-Task and Multi-Dataset Learning

no code implementations • 7 Jun 2019 • Kilian Pfeiffer, Alexander Hermans, István Sárándi, Mark Weber, Bastian Leibe

We address the problem of learning a single model for person re-identification, attribute classification, body part segmentation, and pose estimation.

Attribute General Classification +3

Paper
Add Code

BoLTVOS: Box-Level Tracking for Video Object Segmentation

no code implementations • 9 Apr 2019 • Paul Voigtlaender, Jonathon Luiten, Bastian Leibe

Following this paradigm, we present BoLTVOS (Box-Level Tracking for VOS), which consists of an R-CNN detector conditioned on the first-frame bounding box to detect the object of interest, a temporal consistency rescoring algorithm, and a Box2Seg network that converts bounding boxes to segmentation masks.

Object One-shot visual object segmentation +4

Paper
Add Code

3D-BEVIS: Bird's-Eye-View Instance Segmentation

no code implementations • 3 Apr 2019 • Cathrin Elich, Francis Engelmann, Theodora Kontogianni, Bastian Leibe

A lot of progress was made in the field of object classification and semantic segmentation.

Ranked #4 on 3D Semantic Instance Segmentation on ScanNetV2

3D Instance Segmentation 3D Semantic Instance Segmentation +4

Paper
Add Code

Large-Scale Object Mining for Object Discovery from Unlabeled Video

no code implementations • 28 Feb 2019 • Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe

This paper addresses the problem of object discovery from unlabeled driving videos captured in a realistic automotive setting.

Clustering Object +1

Paper
Add Code

FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

3 code implementations • CVPR 2019 • Paul Voigtlaender, Yuning Chai, Florian Schroff, Hartwig Adam, Bastian Leibe, Liang-Chieh Chen

Many of the recent successful methods for video object segmentation (VOS) are overly complicated, heavily rely on fine-tuning on the first frame, and/or are slow, and are hence of limited practical use.

Ranked #1 on Semi-Supervised Video Object Segmentation on YouTube

Object Segmentation +3

76,690

Paper
Code

MOTS: Multi-Object Tracking and Segmentation

no code implementations • CVPR 2019 • Paul Voigtlaender, Michael Krause, Aljosa Osep, Jonathon Luiten, Berin Balachandar Gnana Sekar, Andreas Geiger, Bastian Leibe

This paper extends the popular task of multi-object tracking to multi-object tracking and segmentation (MOTS).

Ranked #6 on Multi-Object Tracking on MOTS20

Multi-Object Tracking Multi-Object Tracking and Segmentation +3

Paper
Add Code

4D Generic Video Object Proposals

1 code implementation • 26 Jan 2019 • Aljosa Osep, Paul Voigtlaender, Mark Weber, Jonathon Luiten, Bastian Leibe

Many high-level video understanding methods require input in the form of object proposals.

Instance Segmentation Object +2

Paper
Code

Know What Your Neighbors Do: 3D Semantic Segmentation of Point Clouds

no code implementations • 2 Oct 2018 • Francis Engelmann, Theodora Kontogianni, Jonas Schult, Bastian Leibe

In this paper, we present a deep learning architecture which addresses the problem of 3D semantic segmentation of unstructured point clouds.

3D Semantic Segmentation Segmentation

Paper
Add Code

Combined Image- and World-Space Tracking in Traffic Scenes

no code implementations • 19 Sep 2018 • Aljosa Osep, Wolfgang Mehner, Markus Mathias, Bastian Leibe

Most of the current vision-based tracking methods perform tracking in the image domain.

Ranked #22 on Multiple Object Tracking on KITTI Tracking test

Multiple Object Tracking Self-Driving Cars

Paper
Add Code

Towards Large-Scale Video Video Object Mining

no code implementations • 19 Sep 2018 • Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe

We propose to leverage a generic object tracker in order to perform object mining in large-scale unlabeled videos, captured in a realistic automotive setting.

Object

Paper
Add Code

Synthetic Occlusion Augmentation with Volumetric Heatmaps for the 2018 ECCV PoseTrack Challenge on 3D Human Pose Estimation

1 code implementation • 13 Sep 2018 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe

In this paper we present our winning entry at the 2018 ECCV PoseTrack Challenge on 3D human pose estimation.

Ranked #223 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation Data Augmentation

114

Paper
Code

How Robust is 3D Human Pose Estimation to Occlusion?

1 code implementation • 28 Aug 2018 • István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe

Occlusion is commonplace in realistic human-robot shared environments, yet its effects are not considered in standard 3D human pose estimation benchmarks.

Ranked #234 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation 3D Pose Estimation +1

114

Paper
Code

PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation

5 code implementations • 24 Jul 2018 • Jonathon Luiten, Paul Voigtlaender, Bastian Leibe

We address semi-supervised video object segmentation, the task of automatically generating accurate and consistent pixel masks for objects in a video sequence, given the first-frame ground truth annotations.

Ranked #39 on Semi-Supervised Video Object Segmentation on DAVIS 2017 (test-dev)

Object One-shot visual object segmentation +3

129

Paper
Code

Iteratively Trained Interactive Segmentation

1 code implementation • 11 May 2018 • Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe

Deep learning requires large amounts of training data to be effective.

Interactive Segmentation Object +2

Paper
Code

Detection-Tracking for Efficient Person Analysis: The DetTA Pipeline

1 code implementation • 26 Apr 2018 • Stefan Breuers, Lucas Beyer, Umer Rafi, Bastian Leibe

In the past decade many robots were deployed in the wild, and people detection and tracking is an important component of such deployments.

Attribute

Paper
Code

Deep Person Detection in 2D Range Data

1 code implementation • 6 Apr 2018 • Lucas Beyer, Alexander Hermans, Timm Linder, Kai O. Arras, Bastian Leibe

Detecting humans is a key skill for mobile robots and intelligent vehicles in a large variety of applications.

Human Detection

155

Paper
Code

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds

1 code implementation • 5 Feb 2018 • Francis Engelmann, Theodora Kontogianni, Alexander Hermans, Bastian Leibe

The recently proposed PointNet architecture presents an interesting step ahead in that it can operate on unstructured point clouds, achieving encouraging segmentation results.

3D Semantic Segmentation Segmentation

Paper
Code

Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video

1 code implementation • 23 Dec 2017 • Aljoša Ošep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe

We explore object discovery and detector adaptation based on unlabeled video sequences captured from a mobile platform.

Autonomous Driving Clustering +3

Paper
Code

Track, then Decide: Category-Agnostic Vision-based Multi-Object Tracking

no code implementations • 21 Dec 2017 • Aljoša Ošep, Wolfgang Mehner, Paul Voigtlaender, Bastian Leibe

In this paper, we propose a model-free multi-object tracking approach that uses a category-agnostic image segmentation method to track objects.

Image Segmentation Multi-Object Tracking +3

Paper
Add Code

Online Adaptation of Convolutional Neural Networks for Video Object Segmentation

no code implementations • 28 Jun 2017 • Paul Voigtlaender, Bastian Leibe

We tackle the task of semi-supervised video object segmentation, i. e. segmenting the pixels belonging to an object in the video using the ground truth pixel mask for the first frame.

Ranked #2 on Visual Object Tracking on YouTube-VOS 2018

Object Segmentation +4

Paper
Add Code

The Atari Grand Challenge Dataset

2 code implementations • 31 May 2017 • Vitaly Kurin, Sebastian Nowozin, Katja Hofmann, Lucas Beyer, Bastian Leibe

Recent progress in Reinforcement Learning (RL), fueled by its combination, with Deep Learning has enabled impressive results in learning to interact with complex virtual environments, yet real-world applications of RL are still scarce.

Imitation Learning Reinforcement Learning (RL)

Paper
Code

Towards a Principled Integration of Multi-Camera Re-Identification and Tracking through Optimal Bayes Filters

2 code implementations • 12 May 2017 • Lucas Beyer, Stefan Breuers, Vitaly Kurin, Bastian Leibe

With the rise of end-to-end learning through deep learning, person detectors and re-identification (ReID) models have recently become very strong.

760

Paper
Code

In Defense of the Triplet Loss for Person Re-Identification

31 code implementations • 22 Mar 2017 • Alexander Hermans, Lucas Beyer, Bastian Leibe

In the past few years, the field of computer vision has gone through a revolution fueled mainly by the advent of large datasets and the adoption of deep convolutional neural networks for end-to-end learning.

Ranked #3 on Person Re-Identification on CUHK03 (Rank-5 metric)

General Classification Metric Learning +1

3,979

Paper
Code

Semi-Supervised Deep Learning for Monocular Depth Map Prediction

no code implementations • CVPR 2017 • Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe

Supervised deep learning often suffers from the lack of sufficient training data.

Monocular Depth Estimation

Paper
Add Code

Keyframe-Based Visual-Inertial Online SLAM with Relocalization

no code implementations • 7 Feb 2017 • Anton Kasyanov, Francis Engelmann, Jörg Stückler, Bastian Leibe

Our visual-inertial SLAM system is based on a real-time capable visual-inertial odometry method that provides locally consistent trajectory and map estimates.

Pose Tracking Simultaneous Localization and Mapping

Paper
Add Code

Superpixels: An Evaluation of the State-of-the-Art

2 code implementations • 6 Dec 2016 • David Stutz, Alexander Hermans, Bastian Leibe

As such, and due to their quick adoption in a wide range of applications, appropriate benchmarks are crucial for algorithm selection and comparison.

Superpixels

387

Paper
Code

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

4 code implementations • CVPR 2017 • Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe

Therefore, additional processing steps have to be performed in order to obtain pixel-accurate segmentation masks at the full image resolution.

Ranked #25 on Real-Time Semantic Segmentation on Cityscapes test

Autonomous Driving Image Segmentation +3

276

Paper
Code

Incremental Object Discovery in Time-Varying Image Collections

no code implementations • CVPR 2016 • Theodora Kontogianni, Markus Mathias, Bastian Leibe

Abstract In this paper, we address the problem of object discovery in time-varying, large-scale image collections.

Clustering Image Retrieval +3

Paper
Add Code

DROW: Real-Time Deep Learning based Wheelchair Detection in 2D Range Data

no code implementations • 8 Mar 2016 • Lucas Beyer, Alexander Hermans, Bastian Leibe

We propose a Convolutional Neural Network (CNN) based detector for this task.

Paper
Add Code

Visual Landmark Recognition from Internet Photo Collections: A Large-Scale Evaluation

no code implementations • 18 Sep 2014 • Tobias Weyand, Bastian Leibe

We evaluate how different choices of methods and parameters for the individual pipeline steps affect overall system performance and examine their effects for different query categories such as buildings, paintings or sculptures.

Clustering Landmark Recognition

Paper
Add Code

Probabilistic Labeling Cost for High-Accuracy Multi-View Reconstruction

no code implementations • CVPR 2014 • Ilya Kostrikov, Esther Horbert, Bastian Leibe

In this paper, we propose a novel labeling cost for multi- view reconstruction.

Vocal Bursts Intensity Prediction

Paper
Add Code

Tracking People and Their Objects

no code implementations • CVPR 2013 • Tobias Baumgartner, Dennis Mitzel, Bastian Leibe

Current pedestrian tracking approaches ignore important aspects of human behavior.

Object

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.