Search code examples
c++boost-spiritboost-spirit-x3

Boost.Spirit X3 Alternative Operator


I have the following code:

#include <boost/spirit/home/x3.hpp>
#include <boost/spirit/home/x3/support/ast/variant.hpp>

struct printer {
    template <typename int_type>
    void operator()(std::vector<int_type> &vec) {
        std::cout << "vec(" << sizeof(int_type) << "): { ";
        for( auto const &elem : vec ){
            std::cout << elem << ", ";
        }
        std::cout << "}\n";
    }
};

template <typename Iterator>
void parse_int_list(Iterator first, Iterator last) {
    namespace x3 = boost::spirit::x3;
    x3::variant<vector<uint32_t>, vector<uint64_t>> vecs;
    x3::parse( first, last,
            (x3::uint32 % '|') | (x3::uint64 % '|'), vecs );
    boost::apply_visitor(printer{}, vecs);
}

I expected this to first try parsing input into a 32 bit uint vector, then if that failed into a 64 bit uint vector. This works great if the first integer in the list matches a type that is large enough for anything else in the list. I.e.,

string ints32 = "1|2|3";
parse_int_list(being(ints32), end(ints32))
// prints vec(4): { 1, 2, 3, }

string ints64 = "10000000000|20000000000|30000000000";
parse_int_list(being(ints64), end(ints64))
// prints vec(8): { 10000000000, 20000000000, 30000000000, }

However it does not work when the first number is a 32 bit and a later number is a 64 bit.

string ints_mixed = "1|20000000000|30000000000";
parse_int_list(being(ints_mixed), end(ints_mixed))
// prints vec(4): { 1, }

The return value of x3::parse indicates a parse failure. But according to my read of the documentation it should try the second alternative if it can't parse the with the first.

Any pointers on how I'm reading this incorrectly, and how the alternative parser actually works?

Edit: After seeing the responses, I realized that x3::parse was actually returning a parse success. I was checking that it had parsed the entire stream, first == last, to determine success, as demonstrated in the documentation. However, this hides the fact that due to the greedy nature of klean star and not anchoring to the end of stream, it was successfully able to parse a portion of the input. Thanks all.


Solution

  • The issue here is that "3" is a valid input for the (x3::uint32 % '|') parser, so the first branch of the alternative passes, consuming only the 3.

    The cleanest way for you to fix this would be to have a list of alternatives instead of an alternative of lists.

    i.e.:

    (x3::uint32 | x3::uint64) % '|'
    

    However, that would mean you would have to parse in a different structure.

    vector<x3::variant<uint32_t,uint64_t>> vecs;
    

    Edit:

    Alternatively, if you do not intend to use this parser as a sub-parser, you can force a end-of-input in each branch.

    (x3::uint32 % '|' >> x3::eoi) | (x3::uint64 % '|' >> x3::eoi)
    

    This would force the first branch to fail if it does not reach the end of the stream, dropping into the alternative.