Grouping csv file based on column in pandas

I have a file with each row having specific data

expected output is

condition 1 : where ever source is 'a' need to rename its headers with prefix 'seed_'
condition 2 : can make use of bundle id for group

Any way is it doable from pandas ?

Solution

You can use boolean indexing to split your dataframe (group 'a' vs other) then use merge:

m = df['source'] == 'a'
out = df[m].drop(columns='source').merge(df[~m], on='bundle id', suffixes=('_seed', '_comp'))

Output:

>>> out
  name_seed  bundle id  price_seed  name_comp  price_comp source
0    iphone        123         999  iphone 12         950      b
1    iphone        123         999  iphone 13         975      c
2     apple        345         100    Apple 1          99      c

How do I get the current IPython / Jupyter Notebook name
Python - AttributeError: 'NoneType' object has no attribute 'findAll'
Django Invalid HTTP_HOST header: 'testserver'. You may need to add u'testserver' to ALLOWED_HOSTS
Geopandas : sort a sample of points like a cycle graph
_tkinter.TclError: can't use "pyimage1" as iconphoto: not a photo image
toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading
How to update imshow() window for Python OpenCV CV2
What's a fast way to identify all overlapping sets?
http.client works but requests throws read timeout
Elegant way to unpack limited dict values into local variables in Python
How to see if a widget exists in Tkinter?
ROS1 catkin_make failed: catkin_install_python() called without required DESTINATION argument
Custom permissions in rests framework
Python: Sort XML attributes alphabetically within element without sorting elements
Cplex Python how to avoid printing the output
How to use the cl command?
Run the same Python script with different arguments?
Can't raise an exception with user input
How can I silence logs of a command in .ipynb file?
Unable to use Selenium Webdriver. Getting two exceptions
How do I perform a function A set number of times and countdown each time it is performed?
g++ linking and swig
Django CSRF failing with .env file for Docker
dask map_partitions strange behaviour
How can I place a line exactly on the Y-axis?
Python: Convert PDF to DOC
Import local function from a module housed in another directory with relative imports in Jupyter Notebook using Python 3
The equivalent of tf.contrib.image.transform in tensorflow 2.0?
How to use character or string as operator placed in between operands?
How to access a WhatsApp template on Twilio using Python?