Search code examples
pythontypecheckingpymssql

Skip keys without Type checking in Python (pymssql)


I need to access all the non-integer keys for a dict that looks like:

result = { 
           0 : "value 1",
           1 : "value 2",
           "key 1" : "value 1",
           "key 2" : "value 2", 
         }

I am currently doing this by:

headers = [header for header in tmp_dict.keys() if not isinstance(header, int)]

My question:

  1. Is there a way to do this without type checking?
  2. This tmp_dict is coming out of a query using pymssql with the as_dict=True attribute, and for some reason it returns all the column names with data as expected, but also includes the same data indexed by integers. How can I get my query result as a dictionary with only the column values and data?

Thanks for your help!

PS - Despite my issues being resolved by potentially answering 2, I'm curious how this can be done without type checking. Mainly for the people who say "never do type checking, ever."


Solution

  • With regard to your question about type checking, the duck-type approach would be to see whether it can be converted to or used as an int.

    def can_be_int(obj):
        try:
            int(obj)
        except (TypeError, ValueError):
            return False
        return True
    
    headers = [header for header in tmp_dict.keys() if not can_be_int(header)]
    

    Note that floats can be converted to ints by truncating them, so this isn't necessarily exactly equivalent.

    A slight variation on the above would be to use coerce(0, obj) in place of int(obj). This will allow any kind of object that can be converted to a common type with an integer. You could also do something like 0 + obj and 1 * obj which will check for something that can be used in a mathematical expression with integers.

    You could also check to see whether its string representation is all digits:

    headers = [header for header in tmp_dict.keys() if not str(header).isdigit()]
    

    This is probably closer to a solution that doesn't use type-checking, although it will be slower, and it's of course entirely possible that a column name would be a string that is only digits! (Which would fail with many of these approaches, to be honest.)

    Sometimes explicit type-checking really is the best choice, which is why the language has tools for letting you check types. In this situation I think you're fine, especially since the result dictionary is documented to have only integers and strings as keys. And you're doing it the right way by using isinstance() rather than explicitly checking type() == int.