Search Results for author: Yoshimitsu Aoki

Found 20 papers, 12 papers with code

3D Human Scan With A Moving Event Camera

no code implementations • 12 Apr 2024 • Kai Kohyama, Shintaro Shiba, Yoshimitsu Aoki

The experimental results show that the proposed method outperforms conventional frame-based methods in the estimation accuracy of both pose and body mesh.

3D Pose Estimation Event-based vision +1

Paper
Add Code

PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation

no code implementations • 19 Mar 2024 • Haruya Ishikawa, Takumi Iida, Yoshinori Konishi, Yoshimitsu Aoki

In this work, we address these challenges by leveraging the abundance of unlabeled data available.

Segmentation Semantic Segmentation +1

Paper
Add Code

TAG: Guidance-free Open-Vocabulary Semantic Segmentation

1 code implementation • 17 Mar 2024 • Yasufumi Kawano, Yoshimitsu Aoki

Unsupervised and open-vocabulary segmentation, proposed to tackle these issues, faces challenges, including the inability to assign specific class labels to clusters and the necessity of user-provided text queries for guidance.

Open Vocabulary Semantic Segmentation Segmentation +2

Paper
Code

MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation

1 code implementation • 17 Mar 2024 • Yasufumi Kawano, Yoshimitsu Aoki

Semantic segmentation is essential in computer vision for various applications, yet traditional approaches face significant challenges, including the high cost of annotation and extensive training for supervised learning.

Open Vocabulary Semantic Segmentation Proper Noun +2

Paper
Code

Event-based Background-Oriented Schlieren

2 code implementations • 1 Nov 2023 • Shintaro Shiba, Friedhelm Hamann, Yoshimitsu Aoki, Guillermo Gallego

Schlieren imaging is an optical technique to observe the flow of transparent media, such as air or water, without any particle seeding.

Event-based Optical Flow Optical Flow Estimation

2,683

Paper
Code

Boosting Semantic Segmentation with Semantic Boundaries

1 code implementation • 19 Apr 2023 • Haruya Ishikawa, Yoshimitsu Aoki

Motivated by the recent development in improving semantic segmentation by incorporating boundaries as auxiliary tasks, we propose a multi-task framework that uses semantic boundary detection (SBD) as an auxiliary task.

Boundary Detection Segmentation +1

Paper
Code

FindView: Precise Target View Localization Task for Look Around Agents

1 code implementation • 16 Mar 2023 • Haruya Ishikawa, Yoshimitsu Aoki

With the increase in demands for service robots and automated inspection, agents need to localize in its surrounding environment to achieve more natural communication with humans by shared contexts.

Paper
Code

Listening Human Behavior: 3D Human Pose Estimation With Acoustic Signals

no code implementations • CVPR 2023 • Yuto Shibata, Yutaka Kawashima, Mariko Isogawa, Go Irie, Akisato Kimura, Yoshimitsu Aoki

Aiming to capture subtle sound changes to reveal detailed pose information, we explicitly extract phase features from the acoustic signals together with typical spectrum features and feed them into our human pose estimation network.

3D Human Pose Estimation

Paper
Add Code

Fast Event-based Optical Flow Estimation by Triplet Matching

no code implementations • 23 Dec 2022 • Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

Event cameras are novel bio-inspired sensors that offer advantages over traditional cameras (low latency, high dynamic range, low power, etc.).

Event-based Optical Flow Motion Estimation +1

Paper
Add Code

A Fast Geometric Regularizer to Mitigate Event Collapse in the Contrast Maximization Framework

1 code implementation • 14 Dec 2022 • Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

We hope our work opens the door for future applications that unlocks the advantages of event cameras.

Event-based Motion Estimation Motion Estimation

Paper
Code

Document Shadow Removal with Foreground Detection Learning From Fully Synthetic Images

1 code implementation • 2022 2022 • Yuhi Matsuo, Naofumi Akimoto, Yoshimitsu Aoki

In this paper, we present a large-scale and diverse dataset called fully synthetic document shadow removal dataset (FSDSRD) that does not require capturing documents.

Document Shadow Removal

Paper
Code

Secrets of Event-Based Optical Flow

1 code implementation • 20 Jul 2022 • Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

Event cameras respond to scene dynamics and offer advantages to estimate motion.

Event-based Optical Flow Optical Flow Estimation

115

Paper
Code

Event Collapse in Contrast Maximization Frameworks

1 code implementation • 8 Jul 2022 • Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

Contrast maximization (CMax) is a framework that provides state-of-the-art results on several event-based computer vision tasks, such as ego-motion or optical flow estimation.

Event-based Motion Estimation Optical Flow Estimation

115

Paper
Code

Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation

1 code implementation • CVPR 2022 • Naofumi Akimoto, Yuhi Matsuo, Yoshimitsu Aoki

To improve the properties of a 360-degree image on an output image, we also propose WS-perceptual loss and circular inference.

Image Inpainting Image Outpainting +1

Paper
Code

Alleviating Over-segmentation Errors by Detecting Action Boundaries

2 code implementations • 14 Jul 2020 • Yuchi Ishikawa, Seito Kasai, Yoshimitsu Aoki, Hirokatsu Kataoka

Our model architecture consists of a long-term feature extractor and two branches: the Action Segmentation Branch (ASB) and the Boundary Regression Branch (BRB).

Ranked #10 on Action Segmentation on GTEA

Action Classification Action Segmentation +2

Paper
Code

Retrieving and Highlighting Action with Spatiotemporal Reference

1 code implementation • 19 May 2020 • Seito Kasai, Yuchi Ishikawa, Masaki Hayashi, Yoshimitsu Aoki, Kensho Hara, Hirokatsu Kataoka

In this paper, we present a framework that jointly retrieves and spatiotemporally highlights actions in videos by enhancing current deep cross-modal retrieval methods.

Action Recognition Cross-Modal Retrieval +5

Paper
Code

Fast Soft Color Segmentation

no code implementations • CVPR 2020 • Naofumi Akimoto, Huachun Zhu, Yanghua Jin, Yoshimitsu Aoki

We address the problem of soft color segmentation, defined as decomposing a given image into several RGBA layers, each containing only homogeneous color regions.

Segmentation Video Editing

Paper
Add Code

Anticipating Traffic Accidents with Adaptive Loss and Large-scale Incident DB

no code implementations • CVPR 2018 • Tomoyuki Suzuki, Hirokatsu Kataoka, Yoshimitsu Aoki, Yutaka Satoh

In this paper, we propose a novel approach for traffic accident anticipation through (i) Adaptive Loss for Early Anticipation (AdaLEA) and (ii) a large-scale self-annotated incident database for anticipation.

Accident Anticipation

Paper
Add Code

Dominant Codewords Selection with Topic Model for Action Recognition

no code implementations • 1 May 2016 • Hirokatsu Kataoka, Masaki Hayashi, Kenji Iwata, Yutaka Satoh, Yoshimitsu Aoki, Slobodan Ilic

Latent Dirichlet allocation (LDA) is used to develop approximations of human motion primitives; these are mid-level representations, and they adaptively integrate dominant vectors when classifying human activities.

Action Recognition Temporal Action Localization

Paper
Add Code

Depth Image Enhancement Using Local Tangent Plane Approximations

no code implementations • CVPR 2015 • Kiyoshi Matsuo, Yoshimitsu Aoki

Our method is composed of two steps, a calculation of the local tangents and surface reconstruction.

Image Enhancement Surface Reconstruction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.