Sift - Mahalanobis Distance Dialog

From Software Product Documentation
Jump to navigation Jump to search
Language:  English  • français • italiano • português • español 

Mahalanobis Distance is a common measure used to determine outliers in a data sample. The Mahalanobis distance can be conceptualized as the distance from a point to a centroid of a data set, taking into account correlations in the data set. The Mahalanobis distance method can be used on PCA results. This is done by measuring the distance of each point to the centroid in the transformed PCA space.

SPE (Squared Prediction Error) measures the distance between the model prediction and the true model measurement i.e. it is the distance between the true point and the k-dimension transformed point. If the distance between these points exceeds a certain threshold it is determined an outlier, since it "Doesn't fit" the predictive model.

The Mahalanobis Distance and SPE are found on the toolbar and under 'Outlier Detecting Using PCA' in the Analysis menu.

Dialog

  • Grouping to Search: What kind of grouping is used to determine the centroid, Combined Groups, Groups, Workspaces
  • Auto-exclude results: If checked and outliers found will automatically be removed
  • Number of Passes: How many times should the test be run, removing an outlier may alter the centroid, exposing more outliers
  • Find All Outliers: If checked Number of Passes will be ignored, and the test will be run until no outliers are found
  • Determine Number of PCs Using Variance Explained: If checked PCs Variance Explained will be displayed instead of Number of PCs
  • Number of PCs: How many principal components should be considered for the test
  • PCs Variance Explained: Instead of selecting the number of PCs directly, select the amount of variance explained
  • Outlier alpha value: The threshold used to determine an outlier

SPE

Since Squared Prediction Error compares a single predictive point to its original value, many of the parameters in the dialog do not apply, the only parameters of note for SPE are:

  • Auto-exclude results
  • Number of PCs / PCs Variance Explained
  • Outlier alpha value

Results

The Mahalanobis Distance and SPE results appear upon completion of the test.

Retrieved from ""