Search code examples
c#.nethashcodegethashcode

General advice and guidelines on how to properly override object.GetHashCode()


According to MSDN, a hash function must have the following properties:

  1. If two objects compare as equal, the GetHashCode method for each object must return the same value. However, if two objects do not compare as equal, the GetHashCode methods for the two object do not have to return different values.

  2. The GetHashCode method for an object must consistently return the same hash code as long as there is no modification to the object state that determines the return value of the object's Equals method. Note that this is true only for the current execution of an application, and that a different hash code can be returned if the application is run again.

  3. For the best performance, a hash function must generate a random distribution for all input.


I keep finding myself in the following scenario: I have created a class, implemented IEquatable<T> and overridden object.Equals(object). MSDN states that:

Types that override Equals must also override GetHashCode ; otherwise, Hashtable might not work correctly.

And then it usually stops up a bit for me. Because, how do you properly override object.GetHashCode()? Never really know where to start, and it seems to be a lot of pitfalls.

Here at StackOverflow, there are quite a few questions related to GetHashCode overriding, but most of them seems to be on quite particular cases and specific issues. So, therefore I would like to get a good compilation here. An overview with general advice and guidelines. What to do, what not to do, common pitfalls, where to start, etc.

I would like it to be especially directed at C#, but I would think it will work kind of the same way for other .NET languages as well(?).


I think maybe the best way is to create one answer per topic with a quick and short answer first (close to one-liner if at all possible), then maybe some more information and end with related questions, discussions, blog posts, etc., if there are any. I can then create one post as the accepted answer (to get it on top) with just a "table of contents". Try to keep it short and concise. And don't just link to other questions and blog posts. Try to take the essence of them and then rather link to source (especially since the source could disappear. Also, please try to edit and improve answers instead of created lots of very similar ones.

I am not a very good technical writer, but I will at least try to format answers so they look alike, create the table of contents, etc. I will also try to search up some of the related questions here at SO that answers parts of these and maybe pull out the essence of the ones I can manage. But since I am not very stable on this topic, I will try to stay away for the most part :p


Solution

  • Table of contents


    Things that I would like to be covered, but haven't been yet:

    • How to create the integer (How to "convert" an object into an int wasn't very obvious to me anyways).
    • What fields to base the hash code upon.
      • If it should only be on immutable fields, what if there are only mutable ones?
    • How to generate a good random distribution. (MSDN Property #3)
      • Part to this, seems to choose a good magic prime number (have seen 17, 23 and 397 been used), but how do you choose it, and what is it for exactly?
    • How to make sure the hash code stays the same all through the object lifetime. (MSDN Property #2)
      • Especially when the equality is based upon mutable fields. (MSDN Property #1)
    • How to deal with fields that are complex types (not among the built-in C# types).
      • Complex objects and structs, arrays, collections, lists, dictionaries, generic types, etc.
      • For example, even though the list or dictionary might be readonly, that doesn't mean the contents of it are.
    • How to deal with inherited classes.
      • Should you somehow incorporate base.GetHashCode() into your hash code?
    • Could you technically just be lazy and return 0? Would heavily break MSDN guideline number #3, but would at least make sure #1 and #2 were always true :P
    • Common pitfalls and gotchas.