Search code examples
hashmd5checksumhash-collision

Are there any known MD5 collisions for two plain-text ascii strings (not binary data)


There's many MD5 collision examples out there on the web, but as far as I can find, they all involve binary data inputs.

Are there any two known plain-text ASCII strings that give the same MD5 hash?

Basically I'm building a de-duplication system that stores plain text files such as JSON + XML, but it skips over any files that are detected to contain binary data. And I need a way to test how the system copes with two plain text (non-binary) files/strings that give the same MD5 hash.


Solution

  • I share a sample that was posted to Twitter!

    md5("TEXTCOLLBYfGiJUETHQ4hAcKSMd5zYpgqf1YRDhkmxHkhPWptrkoyz28wnI9V0aHeAuaKnak") = md5("TEXTCOLLBYfGiJUETHQ4hEcKSMd5zYpgqf1YRDhkmxHkhPWptrkoyz28wnI9V0aHeAuaKnak")
    https://twitter.com/realhashbreaker/status/1770161965006008570