Search code examples
c++stringsplitstdtokenize

C++ Tokenize String


I'm looking for a simple way to tokenize std::string input without using non default libraries such as Boost, etc.

For example, if the user enters forty_five, I would like to separate 'forty' and 'five' using the '_' as the delimiter.


Solution

  • To convert a string to a vector of tokens (thread safe):

    std::vector<std::string> inline StringSplit(const std::string &source, const char *delimiter = " ", bool keepEmpty = false)
    {
        std::vector<std::string> results;
    
        size_t prev = 0;
        size_t next = 0;
    
        while ((next = source.find_first_of(delimiter, prev)) != std::string::npos)
        {
            if (keepEmpty || (next - prev != 0))
            {
                results.push_back(source.substr(prev, next - prev));
            }
            prev = next + 1;
        }
    
        if (prev < source.size())
        {
            results.push_back(source.substr(prev));
        }
    
        return results;
    }