Search code examples
pythonunicodeutf-8encodedecode

Convert from hex string to unicode


How can i convert the 'dead' string to an unicode string u'\xde\xad'?

Doing this:

from binascii import unhexlify
out = ''.join(x for x in [unhexlify('de'), unhexlify('ad')])

creates a <type 'str'> string '\xde\xad'

Trying to use the Unicode.join() like this:

from binascii import unhexlify
out = ''.join(x for x in [u'', unhexlify('de'), unhexlify('ad')])

results in an error:

UnicodeDecodeError: 'ascii' codec can't decode byte 0xde in position 0: ordinal not in range(128)

Solution

  • Unicode is designed to be compatible with Latin-1, you can use that and simply decode the bytestring:

    In [2]: unhexlify('dead').decode('latin1')
    Out[2]: u'\xde\xad'