Search Results for author: Xikang Yang

Found 1 papers, 1 papers with code

Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM

1 code implementation9 May 2024 Xikang Yang, Xuehai Tang, Songlin Hu, Jizhong Han

CoA is a semantic-driven contextual multi-turn attack method that adaptively adjusts the attack policy through contextual feedback and semantic relevance during multi-turn of dialogue with a large model, resulting in the model producing unreasonable or harmful content.

Cannot find the paper you are looking for? You can Submit a new open access paper.