Search code examples
machine-learningdata-visualizationeuclidean-distancedimension-reduction

Why tsne method use Euclidean distance to compute the similarities in high dimensional data?


I have tried other distance metrics like chebychev distance or Manhatten distance and so on, which are all implemented in tsne in Matlab. Some of them achieve the same good performance as Euclidean distance metric. So I have some questions about why tsne always use the Euclidean distance to calculate the distance. Is there any advantages for this distance metric compared with other distance metrics? I hope someone could help me solve my problem. Thanks in advance!


Solution

  • TSNE always uses the Euclidean distance function to measure distances because it is the default parameter set inside the method definition. If you wish to change the distance function being used for your particular problem, the 'metric' parameter is what you need to change inside your method call.

    Here is a link that lists the different distance functions you can use as a parameter instead of Euclidean.

    Hope this helps!