Are you still on track!? Catching LLM Task Drift with Activations
Sahar Abdelnabi, Aideen Fay, Giovanni Cherubin, Ahmed Salem, Mario Fritz, Andrew Paverd. Arxiv'24
[Paper] [Code]You can also find my articles on my Google Scholar profile.
Sahar Abdelnabi, Aideen Fay, Giovanni Cherubin, Ahmed Salem, Mario Fritz, Andrew Paverd. Arxiv'24
[Paper] [Code]Ivaxi Sheth, Sahar Abdelnabi, Mario Fritz. Arxiv'24
[Paper]Egor Zverev, Sahar Abdelnabi, Soroush Tabesh, Mario Fritz, Christoph H. Lampert. Arxiv'24
[Paper]Sahar Abdelnabi, Amr Gomaa, Sarath Sivaprasad, Lea Schönherr, Mario Fritz. Arxiv'23
[Paper] [Code]Kai Greshake*, Sahar Abdelnabi*, Shailesh Mishra, Christoph Endres, Thorsten Holz, Mario Fritz. Arxiv'23
[Paper] [Code]Sahar Abdelnabi and Mario Fritz. USENIX Security'23
[Paper] [Code]Sahar Abdelnabi, Rakibul Hasan, and Mario Fritz. CVPR'22
[Paper] [Video] [Code] [Page]Ning Yu*, Vladislav Skripniuk*, Sahar Abdelnabi, and Mario Fritz. ICCV'21 (Oral)
[Paper] [Video] [Code]Sahar Abdelnabi and Mario Fritz. Moving Target Defense Workshop, in conjunction with CCS'21
[Paper] [Code]Sahar Abdelnabi and Mario Fritz. S&P'21
[Paper] [Video] [Short Video] [Code]