can we just acknowledge that most of the time our data is a dumpster fire and no duplicate detection strategy is gonna save us from ourselves? https://www.reddit.com/user/BgA_stan