Search code examples
google-cloud-platformautoml

google cloud plateform Auto ML


Hi guys i'm trying to import a table in "AutoML Tables" , just for a test I created a table with random numbers so don't mind it. Everytime I test to try to import a table it seems not to work It doesn't seems anything to be wrong in my CSV. Only alphanumeric character, no empty cell, just numeric data for the test 3 columns, 300 rows ... But no it isn't working and its telling me :

Error Messages: Invalid column names:

My columns name are : rooms, or , price . I don't know what could be wrong with those names...

thank you for your help

enter image description here

I tried with and without the quote mark

"id","rooms","OR","price","space","toilets"
0,5,8,200,200,1
1,5,8,200,200,1
2,5,8,200,200,1
3,5,8,200,200,1
4,5,8,200,200,1
5,5,8,200,200,1
6,5,8,200,200,1
7,5,8,200,200,1
8,5,8,200,200,1
9,5,8,200,200,1
10,5,8,200,200,1
11,5,8,200,200,1
12,7,9,300,400,2
13,7,9,300,400,2
14,7,9,300,400,2
15,7,9,300,400,2

Solution

  • I recreated the issue when the CSV was created with an index column.

    import pandas as pd
    
    columns = ["id","rooms","OR","price","space","toilets"]
    data = [
        [0,5,8,200,200,1],
        [1,5,8,200,200,1],
        [2,5,8,200,200,1],
        [3,5,8,200,200,1],
        [4,5,8,200,200,1],
        [5,5,8,200,200,1],
        [6,5,8,200,200,1],
        [7,5,8,200,200,1],
        [8,5,8,200,200,1],
        [9,5,8,200,200,1],
        [10,5,8,200,200,1],
        [11,5,8,200,200,1],
        [12,7,9,300,400,2],
        [13,7,9,300,400,2],
        [14,7,9,300,400,2],
        [15,7,9,300,400,2]
    ]
    
    df = pd.DataFrame(data=data, columns=columns)
    # resampled the data to avoid AutoMLTables error: 
    # Too few rows: 16. Minimum number is: 1000
    df = df.sample(1000, replace=True)
    df.to_csv('/your/data/path/here', index=True)
    

    But if I set index to False and recreated the file the import succeeds.

    It's also possible that an earlier version of AutoMLTables didn't like that you used a reserved keyword, id, as a column name but this is unlikely.