python pandas pandas-groupby pandas-datareader

Group into a list a dataset based in a repeated column in pandas python

I have a dataframe imported with pandas from excel with a format like this:

df = pd.read_excel('excel_file.xlsx')

data = pd.DataFrame(df, columns=['A', 'B', 'C', 'D', 'E'])


A    B   C    D   E
12  bob  32  abc 123
12  jan  34  fbc  23
14  jan  32  ac  133
12  cat  32  abc 123

I would like to group them by the column B so the output would come as:

list[0] = [[12  bob  32  abc 123]]
list[1] = [[12  jan  34  fbc  23][14  jan  32  ac  133]]
list[2] = [[12  cat  32  abc 123]]

I've tried using duplicated function with no success

Thank you very much!!

Solution

You can do:

lst = [d.values.tolist() for (k,d) in df.groupby('B', sort=False)]

# check
for i in range(len(lst)): print(lst[i])

Output:

[[12, 'bob', 32, 'abc', 123]]
[[12, 'jan', 34, 'fbc', 23], [14, 'jan', 32, 'ac', 133]]
[[12, 'cat', 32, 'abc', 123]]

Unexpected list append
Force matrix_world to be recalculated in Blender
SQLAlchemy and empty columns
ValueError: time data '24:00' does not match format '%H:%M'
Convert RDD of LabeledPoint to DataFrame toDF() Error
How to cancel trigonometric expressions in SymPy
Get view used in Django tests
Precompiled sasl python 3.9+ package for windows
Regex: Substitute pattern in string multiple times without leftovers
How to render raw html in the PyHTML library
Why does my implementation of trilateration give wrong results?
Django admin: how to sort by one of the custom list_display fields that has no database field
TypeError: not all arguments converted during string formatting - psycopg2
Is there a Python equivalent of the C# null-coalescing operator?
Kraken API - Account balances request returning Invalid Nonce
configparser without whitespace surrounding operator
Pytorch tensor to numpy array
Django: How to get a person whose birthday is today from a database?
Performance impact of inheriting from many classes
How can I do a line break (line continuation) in Python (split up a long line of source code)?
Using pydantic to change int to string
Breaking long method chains into multiple lines in Python
What do ** (double star/asterisk) and * (star/asterisk) mean in a function call?
How to install Pygame on Python 3.4?
Rotating values in a list [Python]
Launch default image viewer from pygtk program
what's the inverse of the quantile function on a pandas Series?
How can I install packages using pip according to the requirements.txt file from a local directory?
Python generate all n-permutations of n lists
FastAPI error when handling file together with form-data defined in a Pydantic model