Search code examples
pythonpandascountunique

count the number of each name and drop names that are repeated less than 2 times


in this dataset

data = pd.DataFrame({'name':["a","c","d","b","a","b","c","a","c","d","b","n",
                         "m""b","b","c","a","c","d","b","a","b","b","b","c",
                         "a","c","d","b","a","b","b","b","c","a","c","d","b","a","b","b","b","c"]})

I want to count the number of each name and drop names that are repeated less than 2 times.


Solution

  • One approach is using filters:

    data.groupby('name').filter(lambda x : len(x)>1)