Publications

You can also find my articles on my Google Scholar profile.

Are you still on track!? Catching LLM Task Drift with Activations

Sahar Abdelnabi, Aideen Fay, Giovanni Cherubin, Ahmed Salem, Mario Fritz, Andrew Paverd. Arxiv'24

[Paper] [Code]

Hypothesizing Missing Causal Variables with LLMs

Ivaxi Sheth, Sahar Abdelnabi, Mario Fritz. Arxiv'24

[Paper]

Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

Egor Zverev, Sahar Abdelnabi, Soroush Tabesh, Mario Fritz, Christoph H. Lampert. Arxiv'24

[Paper]

Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiation

Sahar Abdelnabi, Amr Gomaa, Sarath Sivaprasad, Lea Schönherr, Mario Fritz. Arxiv'23

[Paper] [Code]

Not what you’ve signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

Kai Greshake*, Sahar Abdelnabi*, Shailesh Mishra, Christoph Endres, Thorsten Holz, Mario Fritz. Arxiv'23

[Paper] [Code]

Fact-Saboteurs: A Taxonomy of Evidence Manipulation Attacks against Fact-Verification Systems

Sahar Abdelnabi and Mario Fritz. USENIX Security'23

[Paper] [Code]

Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources

Sahar Abdelnabi, Rakibul Hasan, and Mario Fritz. CVPR'22

[Paper] [Video] [Code] [Page]

Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data

Ning Yu*, Vladislav Skripniuk*, Sahar Abdelnabi, and Mario Fritz. ICCV'21 (Oral)

[Paper] [Video] [Code]

What’s in the box?!: Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models

Sahar Abdelnabi and Mario Fritz. Moving Target Defense Workshop, in conjunction with CCS'21

[Paper] [Code]

VisualPhishNet: Zero-Day Phishing Website Detection by Visual Similarity

Sahar Abdelnabi, Katharina Krombholz, and Mario Fritz. CCS'20

[Paper] [Video] [Code] [Page]