Search code examples
pythonregexdjangodjango-forms

Python regex for integer?


I'm learning regex and I would like to use a regular expression in Python to define only integers - whole numbers but not decimals.

I could make one that only allows numbers by using \d, but it also allows decimal numbers, which I don't want:

price = TextField(_('Price'), [
    validators.Regexp('\d', message=_('This is not an integer number, please see the example and try again')),
    validators.Optional()]) 

How can I change the code to only allow integers?


Solution

  • Regexp work on the character base, and \d means a single digit 0...9 and not a decimal number.

    A regular expression that matches only integers with a sign could be for example

    ^[-+]?[0-9]+$
    

    meaning

    1. ^ - start of string
    2. [-+]? - an optional (this is what ? means) minus or plus sign
    3. [0-9]+ - one or more digits (the plus means "one or more" and [0-9] is another way to say \d)
    4. $ - end of string

    Note: having the sign considered part of the number is ok only if you need to parse just the number. For more general parsers handling expressions it's better to leave the sign out of the number: source streams like 3-2 could otherwise end up being parsed as a sequence of two integers instead of an integer, an operator and another integer. My experience is that negative numbers are better handled by constant folding of the unary negation operator at an higher level.