Search Results for author: Zhiyu Xue

Found 5 papers, 2 papers with code

Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness

no code implementations • 6 Jun 2024 • Guangliang Liu, Milad Afshari, Xitong Zhang, Zhiyu Xue, Avrajit Ghosh, Bidhan Bashyal, Rongrong Wang, Kristen Johnson

Our proposed framework can push the fine-tuned model to approach the bias lower bound during downstream fine-tuning, indicating that the ineffectiveness of debiasing can be alleviated by overcoming the forgetting issue through regularizing successfully debiased attention heads based on the PLMs' bias levels from stages of pretraining and debiasing.

Language Modelling

Paper
Add Code

On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept

no code implementations • 4 Jun 2024 • Guangliang Liu, Haitao Mao, Bochuan Cao, Zhiyu Xue, Kristen Johnson, Jiliang Tang, Rongrong Wang

When these instructions lack specific details about the issues in the response, this is referred to as leveraging the intrinsic self-correction capability.

Paper
Add Code

Initialization Matters for Adversarial Transfer Learning

1 code implementation • 10 Dec 2023 • Andong Hua, Jindong Gu, Zhiyu Xue, Nicholas Carlini, Eric Wong, Yao Qin

Based on this, we propose Robust Linear Initialization (RoLI) for adversarial finetuning, which initializes the linear head with the weights obtained by adversarial linear probing to maximally inherit the robustness from pretraining.

Adversarial Robustness Image Classification +1

Paper
Code

PAC-tuning:Fine-tuning Pretrained Language Models with PAC-driven Perturbed Gradient Descent

no code implementations • 26 Oct 2023 • Guangliang Liu, Zhiyu Xue, Xitong Zhang, Kristen Marie Johnson, Rongrong Wang

Fine-tuning pretrained language models (PLMs) for downstream tasks is a large-scale optimization problem, in which the choice of the training algorithm critically determines how well the trained model can generalize to unseen test data, especially in the context of few-shot learning.

Data Augmentation Few-Shot Learning

Paper
Add Code

Region Comparison Network for Interpretable Few-shot Image Classification

1 code implementation • 8 Sep 2020 • Zhiyu Xue, Lixin Duan, Wen Li, Lin Chen, Jiebo Luo

For that, in this work, we propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works as in a neural network as well as to find out specific regions that are related to each other in images coming from the query and support sets.

Ranked #33 on Few-Shot Image Classification on CIFAR-FS 5-way (5-shot)

Classification Few-Shot Image Classification +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.