Search Results for author: Ruoxi Cheng

Found 2 papers, 0 papers with code

Identity Inference from CLIP Models using Only Textual Data

no code implementations • 23 May 2024 • Songze Li, Ruoxi Cheng, Xiaojun Jia

To address these challenges, we propose a textual unimodal detector (TUNI) in CLIP models, a novel method for ID inference that 1) queries the target model with only text data; and 2) does not require training shadow models.

Paper
Add Code

RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs

no code implementations • 15 Apr 2024 • Ruoxi Cheng, Haoxuan Ma, Shuirong Cao, Tianyu Shi

Biases and stereotypes in Large Language Models (LLMs) can have negative implications for user experience and societal outcomes.

Bias Detection Logical Reasoning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.