MLSys Poisoning Web-Scale Training Datasets is Practical [Florian Tamer]

Invited Talk
in
Workshop: Workshop on Decentralized and Collaborative Learning

Poisoning Web-Scale Training Datasets is Practical [Florian Tamer]

[ Abstract ]

2023 Invited Talk

Abstract:

Deep learning models are often trained on distributed, webscale datasets crawled from the internet. We introduce two new dataset poisoning attacks that intentionally introduce malicious examples to degrade a model's performance. Our attacks are immediately practical and could, today, poison 10 popular datasets. We will discuss how the attacks work; why (we think) these haven't been exploited yet; and why defending against them comes with non-negligible costs.

Chat is not available.

Invited Talk in Workshop: Workshop on Decentralized and Collaborative Learning

Poisoning Web-Scale Training Datasets is Practical [Florian Tamer]

Invited Talk
in
Workshop: Workshop on Decentralized and Collaborative Learning