no code implementations • 27 Mar 2024 • Ting-Kang Yen, Igor Morawski, Shusil Dangi, Kai He, Chung-Yi Lin, Jia-Fong Yeh, Hung-Ting Su, Winston Hsu
However, feature asynchronism and sparsity cause invisible objects due to no relative motion to the camera, posing a significant challenge in the task.
no code implementations • 6 Jan 2024 • ChungYi Lin, Shen-Lung Tung, Hung-Ting Su, Winston H. Hsu
To address the limitations of traffic prediction from location-bound detectors, we present Geographical Cellular Traffic (GCT) flow, a novel data source that leverages the extensive coverage of cellular traffic to capture mobility patterns.
1 code implementation • 7 Aug 2023 • Chien Cheng Chyou, Hung-Ting Su, Winston H. Hsu
Adversarial robustness poses a critical challenge in the deployment of deep learning models for real-world applications.
no code implementations • 7 Apr 2023 • Hung-Ting Su, Yulei Niu, Xudong Lin, Winston H. Hsu, Shih-Fu Chang
Causal Video Question Answering (CVidQA) queries not only association or temporal relations but also causal relations in a video.
1 code implementation • 8 Oct 2022 • Hsin-Ying Lee, Hung-Ting Su, Bing-Chen Tsai, Tsung-Han Wu, Jia-Fong Yeh, Winston H. Hsu
While recent large-scale video-language pre-training made great progress in video question answering, the design of spatial modeling of video-language models is less fine-grained than that of image-language models; existing practices of temporal modeling also suffer from weak and noisy alignment between modalities.
no code implementations • 22 Sep 2022 • Tsung-Han Wu, Hung-Ting Su, Shang-Tse Chen, Winston H. Hsu
Fairness and robustness play vital roles in trustworthy machine learning.
1 code implementation • CVPR 2022 • Kuan-Chih Huang, Tsung-Han Wu, Hung-Ting Su, Winston H. Hsu
Moreover, different from conventional pixel-wise positional encodings, we introduce a novel depth positional encoding (DPE) to inject depth positional hints into transformers.
no code implementations • 4 Dec 2021 • Jia-Fong Yeh, Chi-Ming Chung, Hung-Ting Su, Yi-Ting Chen, Winston H. Hsu
(3) Learning from a different expert.
no code implementations • 29 Nov 2021 • Guan-Rong Lu, Yueh-Cheng Liu, Tung-I Chen, Hung-Ting Su, Tsung-Han Wu, Winston H. Hsu
We design a new Masked Gradient Update (MGU) module to generate auxiliary data along the boundary of in-distribution data points.
1 code implementation • 18 Aug 2021 • Chung-Yi Lin, Hung-Ting Su, Shen-Lung Tung, Winston H. Hsu
Furthermore, we propose a new model for multivariate spatial-temporal prediction, mainly consisting of two extending graph attention networks (GAT).
no code implementations • 10 Aug 2021 • Hung-Ting Su, Po-Wei Shen, Bing-Chen Tsai, Wen-Feng Cheng, Ke-Jyun Wang, Winston H. Hsu
By coping with the trope understanding task and enabling the deep cognition skills of machines, data mining applications and algorithms could be taken to the next level.
1 code implementation • ICCV 2021 • Tsung-Han Wu, Yueh-Cheng Liu, Yu-Kai Huang, Hsin-Ying Lee, Hung-Ting Su, Ping-Chia Huang, Winston H. Hsu
Despite the success of deep learning on supervised point cloud semantic segmentation, obtaining large-scale point-by-point manual annotations is still a significant challenge.
no code implementations • CVPR 2021 • Yu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Yu-Cheng Chang, Tsung-Lin Tsou, Yu-An Wang, Winston H. Hsu
Dense depth estimation plays a key role in multiple applications such as robotics, 3D reconstruction, and augmented reality.
1 code implementation • WACV 2021 • Shuo-Diao Yang, Hung-Ting Su, Winston H. Hsu, Wen-Chin Chen
Instead of counting a pre-defined class, our model is able to count instances based on input reference images and reduces the huge cost of data collection, training and parameter tuning for each new object class.
no code implementations • 10 Apr 2021 • Yueh-Cheng Liu, Yu-Kai Huang, Hung-Yueh Chiang, Hung-Ting Su, Zhe-Yu Liu, Chin-Tang Chen, Ching-Yu Tseng, Winston H. Hsu
Most 3D neural networks are trained from scratch owing to the lack of large-scale labeled 3D datasets.
1 code implementation • NAACL 2021 • Ke-Jyun Wang, Yun-Hsuan Liu, Hung-Ting Su, Jen-Wei Wang, Yu-Siang Wang, Winston H. Hsu, Wen-Chin Chen
To effectively apply robots in working environments and assist humans, it is essential to develop and evaluate how visual grounding (VG) can affect machine performance on occluded objects.
no code implementations • 3 Mar 2021 • Yu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Yu-Cheng Chang, Tsung-Lin Tsou, Yu-An Wang, Winston H. Hsu
Dense depth estimation plays a key role in multiple applications such as robotics, 3D reconstruction, and augmented reality.
1 code implementation • 24 Feb 2021 • Tung-I Chen, Yueh-Cheng Liu, Hung-Ting Su, Yu-Cheng Chang, Yu-Hsiang Lin, Jia-Fong Yeh, Wen-Chin Chen, Winston H. Hsu
While recent progress has significantly boosted few-shot classification (FSC) performance, few-shot object detection (FSOD) remains challenging for modern learning systems.
Ranked #9 on Few-Shot Object Detection on MS-COCO (10-shot)
1 code implementation • 19 Jan 2021 • Chen-Hsi Chang, Hung-Ting Su, Jui-heng Hsu, Yu-Siang Wang, Yu-Cheng Chang, Zhe Yu Liu, Ya-Liang Chang, Wen-Feng Cheng, Ke-Jyun Wang, Winston H. Hsu
Experimental result demonstrates that modern models including BERT contextual embedding, movie tag prediction systems, and relational networks, perform at most 37% of human performance (23. 97/64. 87) in terms of F1 score.
1 code implementation • 5 Jan 2021 • Hung-Ting Su, Chen-Hsi Chang, Po-Wei Shen, Yu-Siang Wang, Ya-Liang Chang, Yu-Cheng Chang, Pu-Jen Cheng, Winston H. Hsu
Furthermore, using our generated QA pairs only on the Video QA task, we can surpass some supervised baselines.
no code implementations • 21 Oct 2020 • Kuang-Yu Jeng, Yueh-Cheng Liu, Zhe Yu Liu, Jen-Wei Wang, Ya-Liang Chang, Hung-Ting Su, Winston H. Hsu
We proposed an end-to-end grasp detection network, Grasp Detection Network (GDN), cooperated with a novel coarse-to-fine (C2F) grasp representation design to detect diverse and accurate 6-DoF grasps based on point clouds.
no code implementations • 24 Apr 2020 • Yu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Winston H. Hsu
The performance of image based stereo estimation suffers from lighting variations, repetitive patterns and homogeneous appearance.
no code implementations • 5 Jul 2019 • Yu-Siang Wang, Hung-Ting Su, Chen-Hsi Chang, Zhe-Yu Liu, Winston H. Hsu
We introduce a novel task, Video Question Generation (Video QG).