Search code examples
rlogical-operatorsboolean-operationsor-operatorand-operator

Difference between Boolean operators && and & and between || and | in R


According to the R language definition, the difference between & and && (correspondingly | and ||) is that the former is vectorized while the latter is not.

According to the help text, I read the difference akin to the difference between an "And" and "AndAlso" (correspondingly "Or" and "OrElse")... Meaning: That not all evaluations if they don't have to be (i.e. A or B or C is always true if A is true, so stop evaluating if A is true)

Could someone shed light here? Also, is there an AndAlso and OrElse in R?


Solution

  • The shorter ones are vectorized, meaning they can return a vector, like this:

    ((-2:2) >= 0) & ((-2:2) <= 0)
    # [1] FALSE FALSE  TRUE FALSE FALSE
    

    The longer form is not, and so (as of 4.3.0) must be given inputs of length 1. (Hooray! Less checking necessary, see below.)

    Until R 4.3.0, giving && inputs of length > 1 did not throw an error, but instead evaluated left to right examining only the first element of each vector, so the above gave:

    ((-2:2) >= 0) && ((-2:2) <= 0)
    # [1] FALSE
    

    As the help page says, this makes the longer form "appropriate for programming control-flow and [is] typically preferred in if clauses."

    So you want to use the long forms only when you are certain the vectors are length one, and as of 4.3.0, R enforces this.

    If you're using a previous version, you should be absolutely certain your vectors are only length 1, such as in cases where they are functions that return only length 1 booleans. You want to use the short forms if the vectors are length possibly >1. So if you're not absolutely sure, you should either check first, or use the short form and then use all and any to reduce it to length one for use in control flow statements, like if.

    The functions all and any are often used on the result of a vectorized comparison to see if all or any of the comparisons are true, respectively. The results from these functions are sure to be length 1 so they are appropriate for use in if clauses, while the results from the vectorized comparison are not. (Though those results would be appropriate for use in ifelse.)

    One final difference: the && and || only evaluate as many terms as they need to (which is often called short-circuiting). For example, here's a comparison using an undefined value a; if it didn't short-circuit, as & and | don't, it would give an error.

    a
    # Error: object 'a' not found
    TRUE || a
    # [1] TRUE
    FALSE && a
    # [1] FALSE
    TRUE | a
    # Error: object 'a' not found
    FALSE & a
    # Error: object 'a' not found
    

    Finally, see section 8.2.17 in The R Inferno, titled "and and andand".