Search Results for author: Ruoxi Cheng

Found 2 papers, 0 papers with code

Identity Inference from CLIP Models using Only Textual Data

no code implementations23 May 2024 Songze Li, Ruoxi Cheng, Xiaojun Jia

To address these challenges, we propose a textual unimodal detector (TUNI) in CLIP models, a novel method for ID inference that 1) queries the target model with only text data; and 2) does not require training shadow models.

RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs

no code implementations15 Apr 2024 Ruoxi Cheng, Haoxuan Ma, Shuirong Cao, Tianyu Shi

Biases and stereotypes in Large Language Models (LLMs) can have negative implications for user experience and societal outcomes.

Bias Detection Logical Reasoning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.