Poisoning Web-Scale Training Datasets is Practical [Florian Tamer]
2023 Invited Talk
in
Workshop: Workshop on Decentralized and Collaborative Learning
in
Workshop: Workshop on Decentralized and Collaborative Learning
Abstract
Deep learning models are often trained on distributed, webscale datasets crawled from the internet. We introduce two new dataset poisoning attacks that intentionally introduce malicious examples to degrade a model's performance. Our attacks are immediately practical and could, today, poison 10 popular datasets. We will discuss how the attacks work; why (we think) these haven't been exploited yet; and why defending against them comes with non-negligible costs.
Video
Chat is not available.
Successful Page Load