Search code examples
pythoniteratorinfinite

How to create an infinite iterator to generate an incrementing alphabet pattern?


I've created a function that generates a list of alphabets incrementing continuously. A, B, C ..., Z. After Z, it goes to AA, AB, AC ...AZ. This pattern repeats. This is similar to MS Excel's column names. At the moment, this function generates a finite list of alphabets.

_column_name_generator() = ['A', 'B', ..., 'AA', 'AB', ..., 'BA', 'BB', ..., 'CV']

I can then iterate over it in conjunction with some finite list, e.g. 0-10. See my code below. What I'd like is to create a generator that will give me an infinitely long list of incrementing alphabets.

import string


def _column_name_generator():
    column_names = []
    for x in range(0, 100):
        if x < 26:
            column_names.append(string.ascii_uppercase[x % 26])
        else:
            column_names.append(column_names[x/26 - 1] + string.ascii_uppercase[x % 26])
    return column_names

container = []
for column_name, num in zip(_column_name_generator(), range(0, 10)):
    container.append(column_name + str(num))

print _column_name_generator()
print container

container = ['A0', 'B1', 'C2', 'D3', 'E4', 'F5', 'G6', 'H7', 'I8', 'J9']

Solution

  • Yield column_names's last element every time, and use itertools.count instead of range to provide infinite increase:

    import itertools
    
    def _column_name_generator():
        column_names = []
        for x in itertools.count():
            if x < 26:
                column_names.append(string.ascii_uppercase[x % 26])
            else:
                column_names.append(column_names[x/26 - 1] + string.ascii_uppercase[x % 26])
            yield column_names[-1]
    

    A better solution, altering the original code but dismissing the need in a memory consuming list of column_names, would be

    import itertools, string
    
    def _column_name_generator():
        for i in itertools.count(1):
            for p in itertools.product(string.ascii_uppercase, repeat=i):
                yield ''.join(p)
    

    it basically iterates over the product of length i the uppercase ascii letters (every sequence possible) when i is gradually increasing, starting from 1 (A, B, C).