Search Results for author: Zhiquan Tan

Found 10 papers, 6 papers with code

Provable Contrastive Continual Learning

no code implementations • 29 May 2024 • Yichen Wen, Zhiquan Tan, Kaipeng Zheng, Chuanlong Xie, Weiran Huang

In this work, we fill this gap by establishing theoretical performance guarantees, which reveal how the performance of the model is bounded by training losses of previous tasks in the contrastive continual learning framework.

Continual Learning

Paper
Add Code

The Information of Large Language Model Geometry

no code implementations • 1 Feb 2024 • Zhiquan Tan, Chenghai Li, Weiran Huang

This paper investigates the information encoded in the embeddings of large language models (LLMs).

Language Modelling Large Language Model +1

Paper
Add Code

Large Language Model Evaluation via Matrix Entropy

1 code implementation • 30 Jan 2024 • Lai Wei, Zhiquan Tan, Chenghai Li, Jindong Wang, Weiran Huang

Large language models (LLMs) have revolutionized the field of natural language processing, extending their strong capabilities into multi-modal domains.

Data Compression Language Modelling +1

Paper
Code

Understanding Grokking Through A Robustness Viewpoint

no code implementations • 11 Nov 2023 • Zhiquan Tan, Weiran Huang

Recently, an interesting phenomenon called grokking has gained much attention, where generalization occurs long after the models have initially overfitted the training data.

Paper
Add Code

OTMatch: Improving Semi-Supervised Learning with Optimal Transport

no code implementations • 26 Oct 2023 • Zhiquan Tan, Kaipeng Zheng, Weiran Huang

Semi-supervised learning has made remarkable strides by effectively utilizing a limited amount of labeled data while capitalizing on the abundant information present in unlabeled data.

Paper
Add Code

Information Flow in Self-Supervised Learning

2 code implementations • 29 Sep 2023 • Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan, Yifan Zhang

In this paper, we conduct a comprehensive analysis of two dual-branch (Siamese architecture) self-supervised learning approaches, namely Barlow Twins and spectral contrastive learning, through the lens of matrix mutual information.

Contrastive Learning Self-Supervised Learning +1

Paper
Code

Matrix Information Theory for Self-Supervised Learning

3 code implementations • 27 May 2023 • Yifan Zhang, Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan

Inspired by this framework, we introduce Matrix-SSL, a novel approach that leverages matrix information theory to interpret the maximum entropy encoding loss as matrix uniformity loss.

Ranked #1 on Contrastive Learning on imagenet-1k

Contrastive Learning GSM8K +5

Paper
Code

RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

1 code implementation • 17 May 2023 • Yifan Zhang, Jingqin Yang, Zhiquan Tan, Yang Yuan

Semi-supervised learning has achieved notable success by leveraging very few labeled data and exploiting the wealth of information derived from unlabeled data.

Ranked #1 on Semi-Supervised Image Classification on STL-10, 40 Labels