Search Results for author: Da Yin

Found 24 papers, 19 papers with code

Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents

1 code implementation • 4 Mar 2024 • YiFan Song, Da Yin, Xiang Yue, Jie Huang, Sujian Li, Bill Yuchen Lin

This iterative cycle of exploration and training fosters continued improvement in the agents.

Contrastive Learning

Paper
Code

PST-Bench: Tracing and Benchmarking the Source of Publications

1 code implementation • 25 Feb 2024 • Fanjin Zhang, Kun Cao, Yukuo Cen, Jifan Yu, Da Yin, Jie Tang

Tracing the source of research papers is a fundamental yet challenging task for researchers.

Benchmarking

Paper
Code

Agent Lumos: Unified and Modular Training for Open-Source Language Agents

1 code implementation • 9 Nov 2023 • Da Yin, Faeze Brahman, Abhilasha Ravichander, Khyathi Chandu, Kai-Wei Chang, Yejin Choi, Bill Yuchen Lin

To foster generalizable agent learning, we collect large-scale, unified, and high-quality training annotations derived from diverse ground-truth reasoning rationales across various complex interactive tasks.

Math Question Answering

414

Paper
Code

Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts

no code implementations • 16 Oct 2023 • Christina Chance, Da Yin, Dakuo Wang, Kai-Wei Chang

Using counterfactual data augmentation to the FairytaleQA dataset, we evaluate model robustness against swapped gender character information, and then mitigate learned biases by introducing counterfactual gender stereotypes during training time.

counterfactual Data Augmentation +1

Paper
Add Code

The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code

1 code implementation • 30 May 2023 • Xiao Liu, Da Yin, Chen Zhang, Yansong Feng, Dongyan Zhao

Causal reasoning, the ability to identify cause-and-effect relationship, is crucial in human thinking.

counterfactual Counterfactual Reasoning

Paper
Code

Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation

1 code implementation • 23 May 2023 • Da Yin, Xiao Liu, Fan Yin, Ming Zhong, Hritik Bansal, Jiawei Han, Kai-Wei Chang

Instruction tuning has emerged to enhance the capabilities of large language models (LLMs) to comprehend instructions and generate appropriate responses.

Continual Learning

Paper
Code

KPEval: Towards Fine-Grained Semantic-Based Keyphrase Evaluation

1 code implementation • 27 Mar 2023 • Di wu, Da Yin, Kai-Wei Chang

Despite the significant advancements in keyphrase extraction and keyphrase generation methods, the predominant approach for evaluation mainly relies on exact matching with human references.

Keyphrase Extraction Keyphrase Generation

Paper
Code

GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods

no code implementations • CVPR 2023 • Da Yin, Feng Gao, Govind Thattai, Michael Johnston, Kai-Wei Chang

A key goal for the advancement of AI is to develop technologies that serve the needs not just of one group but of all communities regardless of their geographical region.

Paper
Add Code

How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?

1 code implementation • 27 Oct 2022 • Hritik Bansal, Da Yin, Masoud Monajatipoor, Kai-Wei Chang

To this end, we introduce an Ethical NaTural Language Interventions in Text-to-Image GENeration (ENTIGEN) benchmark dataset to evaluate the change in image generations conditional on ethical interventions across three social axes -- gender, skin color, and culture.

Cultural Vocal Bursts Intensity Prediction Text-to-Image Generation

Paper
Code

Towards a Unified Multi-Dimensional Evaluator for Text Generation

2 code implementations • 13 Oct 2022 • Ming Zhong, Yang Liu, Da Yin, Yuning Mao, Yizhu Jiao, PengFei Liu, Chenguang Zhu, Heng Ji, Jiawei Han

We re-frame NLG evaluation as a Boolean Question Answering (QA) task, and by guiding the model with different questions, we can use one evaluator to evaluate from multiple dimensions.

nlg evaluation Question Answering +4

163

Paper
Code

GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models

1 code implementation • 24 May 2022 • Da Yin, Hritik Bansal, Masoud Monajatipoor, Liunian Harold Li, Kai-Wei Chang

In this paper, we introduce a benchmark dataset, Geo-Diverse Commonsense Multilingual Language Models Analysis (GeoMLAMA), for probing the diversity of the relational knowledge in multilingual PLMs.

Language Modelling

Paper
Code

Things not Written in Text: Exploring Spatial Commonsense from Visual Signals

1 code implementation • ACL 2022 • Xiao Liu, Da Yin, Yansong Feng, Dongyan Zhao

We probe PLMs and models with visual signals, including vision-language pretrained models and image synthesis models, on this benchmark, and find that image synthesis models are more capable of learning accurate and consistent spatial knowledge than other models.

Image Generation Natural Language Understanding +1

Paper
Code

A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models

no code implementations • 17 Feb 2022 • Da Yin, Li Dong, Hao Cheng, Xiaodong Liu, Kai-Wei Chang, Furu Wei, Jianfeng Gao

With the increasing of model capacity brought by pre-trained language models, there emerges boosting needs for more knowledgeable natural language processing (NLP) models with advanced functionalities including providing and making flexible use of encyclopedic and commonsense knowledge.

Language Modelling

Paper
Add Code

Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning

1 code implementation • 8 Nov 2021 • Qinkai Zheng, Xu Zou, Yuxiao Dong, Yukuo Cen, Da Yin, Jiarong Xu, Yang Yang, Jie Tang

To bridge this gap, we present the Graph Robustness Benchmark (GRB) with the goal of providing a scalable, unified, modular, and reproducible evaluation for the adversarial robustness of GML models.

Adversarial Robustness Benchmarking +1

Paper
Code

Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning

1 code implementation • EMNLP 2021 • Da Yin, Liunian Harold Li, Ziniu Hu, Nanyun Peng, Kai-Wei Chang

Commonsense is defined as the knowledge that is shared by everyone.

Ranked #1 on Visual Commonsense Reasoning on GD-VCR

Cultural Vocal Bursts Intensity Prediction Visual Commonsense Reasoning

Paper
Code

CogView: Mastering Text-to-Image Generation via Transformers

4 code implementations • NeurIPS 2021 • Ming Ding, Zhuoyi Yang, Wenyi Hong, Wendi Zheng, Chang Zhou, Da Yin, Junyang Lin, Xu Zou, Zhou Shao, Hongxia Yang, Jie Tang

Text-to-Image generation in the general domain has long been an open problem, which requires both a powerful generative model and cross-modal understanding.

Ranked #56 on Text-to-Image Generation on MS COCO (using extra training data)

Super-Resolution Zero-Shot Text-to-Image Generation

3,960

Paper
Code

Everything Has a Cause: Leveraging Causal Inference in Legal Text Analysis

1 code implementation • NAACL 2021 • Xiao Liu, Da Yin, Yansong Feng, Yuting Wu, Dongyan Zhao

Causal inference is the process of capturing cause-effect relationship among variables.

Causal Inference

Paper
Code

QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization

1 code implementation • NAACL 2021 • Ming Zhong, Da Yin, Tao Yu, Ahmad Zaidi, Mutethia Mutuma, Rahul Jha, Ahmed Hassan Awadallah, Asli Celikyilmaz, Yang Liu, Xipeng Qiu, Dragomir Radev

As increasing numbers of meetings are recorded and transcribed, meeting summaries have become essential to remind those who may or may not have attended the meetings about the key decisions made and the tasks to be completed.

Meeting Summarization

101

Paper
Code

Controllable Generation from Pre-trained Language Models via Inverse Prompting

1 code implementation • 19 Mar 2021 • Xu Zou, Da Yin, Qingyang Zhong, Ming Ding, Hongxia Yang, Zhilin Yang, Jie Tang

To tackle this challenge, we propose an innovative method, inverse prompting, to better control text generation.

Language Modelling Long Form Question Answering +1

120

Paper
Code

OAG-BERT: Towards A Unified Backbone Language Model For Academic Knowledge Services

1 code implementation • 3 Mar 2021 • Xiao Liu, Da Yin, Jingnan Zheng, Xingjian Zhang, Peng Zhang, Hongxia Yang, Yuxiao Dong, Jie Tang

Academic knowledge services have substantially facilitated the development of the science enterprise by providing a plenitude of efficient research tools.

Language Modelling Link Prediction

Paper
Code

What Does BERT with Vision Look At?

no code implementations • ACL 2020 • Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, Kai-Wei Chang

Pre-trained visually grounded language models such as ViLBERT, LXMERT, and UNITER have achieved significant performance improvement on vision-and-language tasks but what they learn during pre-training remains unclear.

Language Modelling

Paper
Add Code

SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics

2 code implementations • ACL 2020 • Da Yin, Tao Meng, Kai-Wei Chang

We propose SentiBERT, a variant of BERT that effectively captures compositional sentiment semantics.

Classification Emotion Classification +5

Paper
Code

VisualBERT: A Simple and Performant Baseline for Vision and Language

7 code implementations • 9 Aug 2019 • Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, Kai-Wei Chang

We propose VisualBERT, a simple and flexible framework for modeling a broad range of vision-and-language tasks.

Ranked #1 on Visual Reasoning on NLVR

Language Modelling Visual Question Answering (VQA) +1

516

Paper
Code

A Soft Label Strategy for Target-Level Sentiment Classification

no code implementations • WS 2019 • Da Yin, Xiao Liu, Xiuyu Wu, Baobao Chang

In this paper, we propose a soft label approach to target-level sentiment classification task, in which a history-based soft labeling model is proposed to measure the possibility of a context word as an opinion word.

Classification General Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.