Search code examples
algorithmhashcollisionprobabilitycrc

Probability of collision when using a 32-bit hash


I have a 10-character string key field in a database. I've used CRC32 to hash this field, but I'm worrying about duplicates. Could somebody show me the probability of collision in this situation?

P.S.: My string field is unique in the database. If the number of string fields is 1 million, what is the probability of a collision?


Solution

  • Duplicate of Expected collisions for perfect 32bit crc

    The answer referenced this article: http://arstechnica.com/civis/viewtopic.php?f=20&t=149670

    Found the image below from: http://preshing.com/20110504/hash-collision-probabilities

    enter image description here