Search Results for author: Shangling Jui

Found 36 papers, 21 papers with code

Rethinking Optimization and Architecture for Tiny Language Models

1 code implementation • 5 Feb 2024 • Yehui Tang, Fangcheng Liu, Yunsheng Ni, Yuchuan Tian, Zheyuan Bai, Yi-Qi Hu, Sichao Liu, Shangling Jui, Kai Han, Yunhe Wang

Several design formulas are empirically proved especially effective for tiny language models, including tokenizer compression, architecture tweaking, parameter inheritance and multiple-round training.

Language Modelling

102

Paper
Code

A Theory of Non-Acyclic Generative Flow Networks

no code implementations • 23 Dec 2023 • Leo Maxime Brunswic, Yinchuan Li, Yushun Xu, Shangling Jui, Lizhuang Ma

GFlowNets is a novel flow-based method for learning a stochastic policy to generate objects via a sequence of actions and with probability proportional to a given positive reward.

Paper
Add Code

Q-Boost: On Visual Quality Assessment Ability of Low-level Multi-Modality Foundation Models

no code implementations • 23 Dec 2023 • ZiCheng Zhang, HaoNing Wu, Zhongpeng Ji, Chunyi Li, Erli Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Fengyu Sun, Shangling Jui, Weisi Lin, Guangtao Zhai

Recent advancements in Multi-modality Large Language Models (MLLMs) have demonstrated remarkable capabilities in complex high-level vision tasks.

Image Quality Assessment Video Quality Assessment +1

Paper
Add Code

Exploring the Naturalness of AI-Generated Images

1 code implementation • 9 Dec 2023 • Zijian Chen, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

In this paper, we take the first step to benchmark and assess the visual naturalness of AI-generated images.

Paper
Code

Trust your Good Friends: Source-free Domain Adaptation by Reciprocal Neighborhood Clustering

no code implementations • 1 Sep 2023 • Shiqi Yang, Yaxing Wang, Joost Van de Weijer, Luis Herranz, Shangling Jui, Jian Yang

We capture this intrinsic structure by defining local affinity of the target data, and encourage label consistency among data with high local affinity.

Clustering Source-Free Domain Adaptation

Paper
Add Code

Ternary Singular Value Decomposition as a Better Parameterized Form in Linear Mapping

1 code implementation • 15 Aug 2023 • BoYu Chen, Hanxuan Chen, Jiao He, Fengyu Sun, Shangling Jui

We present a simple yet novel parameterized form of linear mapping to achieves remarkable network compression performance: a pseudo SVD called Ternary SVD (TSVD).

Language Modelling Large Language Model +1

Paper
Code

Reparameterization through Spatial Gradient Scaling

1 code implementation • 5 Mar 2023 • Alexander Detkov, Mohammad Salameh, Muhammad Fetrat Qharabagh, Jialin Zhang, Wei Lui, Shangling Jui, Di Niu

Reparameterization aims to improve the generalization of deep neural networks by transforming convolutional layers into equivalent multi-branched structures during training.

Paper
Code

A General-Purpose Transferable Predictor for Neural Architecture Search

no code implementations • 21 Feb 2023 • Fred X. Han, Keith G. Mills, Fabian Chudak, Parsa Riahi, Mohammad Salameh, Jialin Zhang, Wei Lu, Shangling Jui, Di Niu

In this paper, we propose a general-purpose neural predictor for NAS that can transfer across search spaces, by representing any given candidate Convolutional Neural Network (CNN) with a Computation Graph (CG) that consists of primitive operators.

Contrastive Learning Graph Representation Learning +1

Paper
Add Code

AIO-P: Expanding Neural Performance Predictors Beyond Image Classification

1 code implementation • 30 Nov 2022 • Keith G. Mills, Di Niu, Mohammad Salameh, Weichen Qiu, Fred X. Han, Puyuan Liu, Jialin Zhang, Wei Lu, Shangling Jui

Evaluating neural network performance is critical to deep neural network design but a costly procedure.

2D Human Pose Estimation Classification +5

Paper
Code

GENNAPE: Towards Generalized Neural Architecture Performance Estimators

1 code implementation • 30 Nov 2022 • Keith G. Mills, Fred X. Han, Jialin Zhang, Fabian Chudak, Ali Safari Mamaghani, Mohammad Salameh, Wei Lu, Shangling Jui, Di Niu

In this paper, we propose GENNAPE, a Generalized Neural Architecture Performance Estimator, which is pretrained on open neural architecture benchmarks, and aims to generalize to completely unseen architectures through combined innovations in network representation, contrastive pretraining, and fuzzy clustering-based predictor ensemble.

Contrastive Learning Image Classification +1

Paper
Code

Positive Pair Distillation Considered Harmful: Continual Meta Metric Learning for Lifelong Object Re-Identification

1 code implementation • 4 Oct 2022 • Kai Wang, Chenshen Wu, Andy Bagdanov, Xialei Liu, Shiqi Yang, Shangling Jui, Joost Van de Weijer

Lifelong object re-identification incrementally learns from a stream of re-identification tasks.

Knowledge Distillation Metric Learning +1

Paper
Code

OneRing: A Simple Method for Source-free Open-partial Domain Adaptation

1 code implementation • 7 Jun 2022 • Shiqi Yang, Yaxing Wang, Kai Wang, Shangling Jui, Joost Van de Weijer

In this paper, we investigate Source-free Open-partial Domain Adaptation (SF-OPDA), which addresses the situation where there exist both domain and category shifts between source and target domains.

Domain Generalization Open Set Learning +2

Paper
Code

R5: Rule Discovery with Reinforced and Recurrent Relational Reasoning

1 code implementation • ICLR 2022 • Shengyao Lu, Bang Liu, Keith G. Mills, Shangling Jui, Di Niu

Systematicity, i. e., the ability to recombine known parts and rules to form new sequences while reasoning over relational data, is critical to machine intelligence.

Relation Relational Reasoning

Paper
Code

Attracting and Dispersing: A Simple Approach for Source-free Domain Adaptation

1 code implementation • 9 May 2022 • Shiqi Yang, Yaxing Wang, Kai Wang, Shangling Jui, Joost Van de Weijer

Treating SFDA as an unsupervised clustering problem and following the intuition that local neighbors in feature space should have more similar predictions than other features, we propose to optimize an objective of prediction consistency.

Clustering Source-Free Domain Adaptation

Paper
Code

Incremental Meta-Learning via Episodic Replay Distillation for Few-Shot Image Recognition

1 code implementation • 9 Nov 2021 • Kai Wang, Xialei Liu, Andy Bagdanov, Luis Herranz, Shangling Jui, Joost Van de Weijer

We propose an approach to IML, which we call Episodic Replay Distillation (ERD), that mixes classes from the current task with class exemplars from previous tasks when sampling episodes for meta-learning.

Continual Learning Knowledge Distillation +1

Paper
Code

Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization

no code implementations • NeurIPS 2021 • Ke Sun, Yafei Wang, Yi Liu, Yingnan Zhao, Bo Pan, Shangling Jui, Bei Jiang, Linglong Kong

Anderson mixing has been heuristically applied to reinforcement learning (RL) algorithms for accelerating convergence and improving the sampling efficiency of deep RL.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation

2 code implementations • NeurIPS 2021 • Shiqi Yang, Yaxing Wang, Joost Van de Weijer, Luis Herranz, Shangling Jui

In this paper, we address the challenging source-free domain adaptation (SFDA) problem, where the source pretrained model is adapted to the target domain in the absence of source data.

Ranked #3 on Source-Free Domain Adaptation on VisDA-2017

Source-Free Domain Adaptation

Paper
Code

Exploring the Robustness of Distributional Reinforcement Learning against Noisy State Observations

no code implementations • 29 Sep 2021 • Ke Sun, Yi Liu, Yingnan Zhao, Hengshuai Yao, Shangling Jui, Linglong Kong

In real scenarios, state observations that an agent observes may contain measurement errors or adversarial noises, misleading the agent to take suboptimal actions or even collapse while training.

Distributional Reinforcement Learning reinforcement-learning +1

Paper
Add Code

A Transferable General-Purpose Predictor for Neural Architecture Search

no code implementations • 29 Sep 2021 • Fred X. Han, Fabian Chudak, Keith G Mills, Mohammad Salameh, Parsa Riahi, Jialin Zhang, Wei Lu, Shangling Jui, Di Niu

Understanding and modelling the performance of neural architectures is key to Neural Architecture Search (NAS).

Contrastive Learning Graph Representation Learning +1

Paper
Add Code

Distilling GANs with Style-Mixed Triplets for X2I Translation with Limited Data

no code implementations • ICLR 2022 • Yaxing Wang, Joost Van de Weijer, Lu Yu, Shangling Jui

Therefore, we investigate knowledge distillation to transfer knowledge from a high-quality unconditioned generative model (e. g., StyleGAN) to a conditioned synthetic image generation modules in a variety of systems.

Image Generation Knowledge Distillation +2

Paper
Add Code

L$^{2}$NAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning

no code implementations • 25 Sep 2021 • Keith G. Mills, Fred X. Han, Mohammad Salameh, SEYED SAEED CHANGIZ REZAEI, Linglong Kong, Wei Lu, Shuo Lian, Shangling Jui, Di Niu

In this paper, we propose L$^{2}$NAS, which learns to intelligently optimize and update architecture hyperparameters via an actor neural network based on the distribution of high-performing architectures in the search history.

Hyperparameter Optimization Neural Architecture Search +2

Paper
Add Code

Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

1 code implementation • 25 Sep 2021 • Keith G. Mills, Fred X. Han, Jialin Zhang, SEYED SAEED CHANGIZ REZAEI, Fabian Chudak, Wei Lu, Shuo Lian, Shangling Jui, Di Niu

Neural architecture search automates neural network design and has achieved state-of-the-art results in many deep learning applications.

Neural Architecture Search

Paper
Code

Exploring the Training Robustness of Distributional Reinforcement Learning against Noisy State Observations

1 code implementation • 17 Sep 2021 • Ke Sun, Yingnan Zhao, Shangling Jui, Linglong Kong

In real scenarios, state observations that an agent observes may contain measurement errors or adversarial noises, misleading the agent to take suboptimal actions or even collapse while training.

Density Estimation Distributional Reinforcement Learning +2

Paper
Code

Generalized Source-free Domain Adaptation

1 code implementation • ICCV 2021 • Shiqi Yang, Yaxing Wang, Joost Van de Weijer, Luis Herranz, Shangling Jui

In this paper, we propose a new domain adaptation paradigm called Generalized Source-free Domain Adaptation (G-SFDA), where the learned model needs to perform well on both the target and source domains, with only access to current unlabeled target data during adaptation.

Ranked #2 on Source-Free Domain Adaptation on VisDA-2017

Source-Free Domain Adaptation

100

Paper
Code

ReNAS: Relativistic Evaluation of Neural Architecture Search

7 code implementations • CVPR 2021 • Yixing Xu, Yunhe Wang, Kai Han, Yehui Tang, Shangling Jui, Chunjing Xu, Chang Xu

An effective and efficient architecture performance evaluation scheme is essential for the success of Neural Architecture Search (NAS).

Neural Architecture Search

1,118

Paper
Code

Generative Adversarial Neural Architecture Search

no code implementations • 19 May 2021 • SEYED SAEED CHANGIZ REZAEI, Fred X. Han, Di Niu, Mohammad Salameh, Keith Mills, Shuo Lian, Wei Lu, Shangling Jui

Despite the empirical success of neural architecture search (NAS) in deep learning applications, the optimality, reproducibility and cost of NAS schemes remain hard to assess.

Neural Architecture Search

Paper
Add Code

MineGAN++: Mining Generative Models for Efficient Knowledge Transfer to Limited Data Domains

1 code implementation • 28 Apr 2021 • Yaxing Wang, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, Joost Van de Weijer

Therefore, we propose a novel knowledge transfer method for generative models based on mining the knowledge that is most beneficial to a specific target domain, either from a single or multiple pretrained GANs.

Transfer Learning

150

Paper
Code

Generative Adversarial Neural Architecture Search with Importance Sampling

no code implementations • 1 Jan 2021 • SEYED SAEED CHANGIZ REZAEI, Fred X. Han, Di Niu, Mohammad Salameh, Keith G Mills, Shangling Jui

Despite the empirical success of neural architecture search (NAS) algorithms in deep learning applications, the optimality, reproducibility and cost of NAS schemes remain hard to be assessed.

Neural Architecture Search

Paper
Add Code

Casting a BAIT for Offline and Online Source-free Domain Adaptation

2 code implementations • 23 Oct 2020 • Shiqi Yang, Yaxing Wang, Joost Van de Weijer, Luis Herranz, Shangling Jui

When adapting to the target domain, the additional classifier initialized from source classifier is expected to find misclassified features.

Source-Free Domain Adaptation Unsupervised Domain Adaptation

Paper
Code

Neural Architecture Search For Keyword Spotting

no code implementations • 1 Sep 2020 • Tong Mo, Yakun Yu, Mohammad Salameh, Di Niu, Shangling Jui

Deep neural networks have recently become a popular solution to keyword spotting systems, which enable the control of smart devices via voice.

Ranked #1 on Keyword Spotting on Google Speech Commands (Google Speech Commands V1 6 metric)

Keyword Spotting Neural Architecture Search

Paper
Add Code

Generative Feature Replay For Class-Incremental Learning

1 code implementation • 20 Apr 2020 • Xialei Liu, Chenshen Wu, Mikel Menta, Luis Herranz, Bogdan Raducanu, Andrew D. Bagdanov, Shangling Jui, Joost Van de Weijer

To prevent forgetting, we combine generative feature replay in the classifier with feature distillation in the feature extractor.

Class Incremental Learning Image Generation +1

Paper
Code

Semantic Drift Compensation for Class-Incremental Learning

2 code implementations • CVPR 2020 • Lu Yu, Bartłomiej Twardowski, Xialei Liu, Luis Herranz, Kai Wang, Yongmei Cheng, Shangling Jui, Joost Van de Weijer

The vast majority of methods have studied this scenario for classification networks, where for each new task the classification layer of the network must be augmented with additional weights to make room for the newly added classes.

Class Incremental Learning General Classification +1

111

Paper
Code

ReNAS:Relativistic Evaluation of Neural Architecture Search

4 code implementations • 30 Sep 2019 • Yixing Xu, Yunhe Wang, Kai Han, Yehui Tang, Shangling Jui, Chunjing Xu, Chang Xu

An effective and efficient architecture performance evaluation scheme is essential for the success of Neural Architecture Search (NAS).

Neural Architecture Search

1,118

Paper
Code

Three-Head Neural Network Architecture for AlphaZero Learning

no code implementations • 25 Sep 2019 • Chao GAO, Martin Mueller, Ryan Hayward, Hengshuai Yao, Shangling Jui

A three-head network architecture has been recently proposed that can learn a third action-value head on a fixed dataset the same as for two-head net.

Paper
Add Code

Deep Demosaicing for Edge Implementation

no code implementations • 26 Mar 2019 • Ramchalam Kinattinkara Ramakrishnan, Shangling Jui, Vahid Patrovi Nia

We provide an exhaustive search of deep neural network architectures and obtain a pareto front of Color Peak Signal to Noise Ratio (CPSNR) as the performance criterion versus the number of parameters as the model complexity that beats the state-of-the-art.

Demosaicking Neural Architecture Search

Paper
Add Code

Single-step Options for Adversary Driving

no code implementations • 20 Mar 2019 • Nazmus Sakib, Hengshuai Yao, Hong Zhang, Shangling Jui

In this paper, we use reinforcement learning for safety driving in adversary settings.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.