cluster-analysis k-means spss statistics

Save Cluster Variables / Variable PSPP

I am using PSPP (NOT SPSS since I can't get that running on my Ubuntu machine) and having my set of ~100k records clustered with a k-means cluster. Now what I really need is a more detailed output than just how many records are in each cluster. I need the cluster variable saved i.e.

row 1 => cluster 1

row 2 => cluster 4

row 3 => cluster 1

etc...

Essentially I need the extra field that saves the resulting cluster affinity of each record. My current syntax is:

QUICK CLUSTER  cat1 cat2 cat3 cat4 cat5 cat6 cat7 cat8 cat9 cat10 cat11 cat12
/CRITERIA=CLUSTERS(12) MXITER(100000000).

SPSS and PSPP share a lot of the same syntax so if there is an option in SPSS it might work here too.

Solution

Statistics should run on Ubuntu, but the Statistics QUICK CLUSTER command has a subcommand

/SAVE CLUSTER

that should do what you want. You can optionally specify a variable name in parentheses after CLUSTER.

Extract labels membership / classification from a cut dendrogram in R (i.e.: a cutree function for dendrogram)
PCA after k-means clustering of multidimensional data
Filter rows based on combined set of values in a string
Problems with creating a mathematical clustering model with an additive criterion in CPLEX OPL Studio
How to group dataframes to get a subset that represents the full range of the larger set
DBSCAN on 3d coordinates doesn't find clusters
Best way to validate DBSCAN Clusters
Clustering with a distance matrix
How can we interpret negative adjusted rand index?
sklearn: Get Distance from Point to Nearest Cluster
Adding a Bubble Plot as a Complex Heatmap Annotation
Clustering longitudinal data with labels?
Clustering geometries recursively exceeds cluster size limit
Topic modelling many documents with low memory overhead
How to delete edges based on cluster_edge_betweenness output
scikit-learn DBSCAN memory usage
How to define exact number of communities in a igraph object,?
How to Cluster Parts of a Mask in an Image Using Python?
matplotlib detect and isolate in circles different groups of points
Julia - AssertionError in K-medoids algorithm
How Fast Can We Approximate Set Jaccard Scores?
Clustering using Python
How reliable is the Elbow curve in finding K in K-Means?
Complicated for-loop in Python
silhouette calculation in R for a large data
Keep column AND row order in a data set EXACTLY the same as in the HEATMAP
Algorithm to find k optimal representatives for subsets of a set with arbitrary cost function
Create a Heatmap for gene expression analyses in R
Python data filtering to remove outliers around a density plot
Error in do_one(nmeth) : NA/NaN/Inf in foreign function call (arg 1)