Search Results for author: Mikihiro Tanaka

Found 5 papers, 2 papers with code

Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches

no code implementations • 8 May 2024 • Qing Yu, Mikihiro Tanaka, Kent Fujiwara

To build a cross-modal latent space between 3D human motion and language, acquiring large-scale and high-quality human motion data is crucial.

Human Interaction Recognition Transfer Learning

Paper
Add Code

Leveraging Image-Text Similarity and Caption Modification for the DataComp Challenge: Filtering Track and BYOD Track

1 code implementation • 23 Oct 2023 • Shuhei Yokoo, Peifei Zhu, Yuchi Ishikawa, Mikihiro Tanaka, Masayoshi Kondo, Hirokatsu Kataoka

Our solution adopts large multimodal models CLIP and BLIP-2 to filter and modify web crawl data, and utilize external datasets along with a bag of tricks to improve the data quality.

text similarity

Paper
Code

Role-Aware Interaction Generation from Textual Description

no code implementations • ICCV 2023 • Mikihiro Tanaka, Kent Fujiwara

We claim that certain interactions, which we call asymmetric interactions, involve a relationship between an actor and a receiver, whose motions significantly differ depending on the assigned role.

Paper
Add Code

Captioning Images with Novel Objects via Online Vocabulary Expansion

no code implementations • 6 Mar 2020 • Mikihiro Tanaka, Tatsuya Harada

In this study, we introduce a low cost method for generating descriptions from images containing novel objects.

Image Captioning Word Embeddings

Paper
Add Code

Generating Easy-to-Understand Referring Expressions for Target Identifications

2 code implementations • ICCV 2019 • Mikihiro Tanaka, Takayuki Itamochi, Kenichi Narioka, Ikuro Sato, Yoshitaka Ushiku, Tatsuya Harada

Moreover, we regard that sentences that are easily understood are those that are comprehended correctly and quickly by humans.

Referring Expression

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.