Fact-checked by Grok 2 weeks ago

Scatter matrix

A scatter matrix, also known as a scatterplot matrix or SPLOM (scatterplot matrix), is a graphical tool used in exploratory data analysis (EDA) to visualize the pairwise relationships between multiple variables in a multivariate dataset.^[1] It consists of a square grid of small scatter plots, where the rows and columns correspond to the variables, and each off-diagonal cell displays a scatter plot of one variable against another, revealing patterns such as correlations, clusters, or outliers.^[2] The diagonal cells typically show univariate representations of each variable, such as histograms, density plots, or box plots, to summarize marginal distributions.^[3] Introduced as part of multivariate visualization techniques in the late 1970s, the scatter matrix helps assess linear and nonlinear dependencies across variables simultaneously, aiding in data understanding before more advanced modeling.^[4] Common in statistical software like R (via the pairs() function), Python (via seaborn.pairplot() or pandas.plotting.scatter_matrix()), and tools like JMP or SPSS, it is particularly useful for datasets with 3 to 10 continuous variables, though larger sets may require enhancements like smoothing or color coding for clarity.^[5] Limitations include overplotting in dense data and challenges in interpreting higher dimensions, often complemented by other plots like heatmaps for correlation matrices.^[6]

Fundamentals

Definition

In multivariate statistics, the scatter matrix is a positive definite symmetric matrix that describes the dispersion and dependence structure of a p-dimensional random vector, serving as a generalization of the variance and covariance matrices to capture second-order characteristics of the data distribution. Unlike the standard covariance matrix, which relies on second moments and is sensitive to outliers, the scatter matrix is often defined in the context of robust statistics to provide consistent estimates under contamination or elliptical distributions.^[7] The concept arose in robust multivariate analysis, with early contributions including M-estimators by Maronna in 1976, which minimize a dispersion functional to estimate the scatter matrix under elliptical models. High-breakdown estimators, such as the Minimum Covariance Determinant (MCD) introduced by Rousseeuw in 1984, achieve up to 50% breakdown points for outlier detection. These developments addressed the limitations of classical covariance estimation, emphasizing properties like affine equivariance to ensure transformation consistency.^[8] A key distinguishing feature is affine equivariance: for any nonsingular p × p matrix A and vector b, the scatter functional satisfies S(A X + b) = A S(X) A^T, where X is the random vector. This property ensures that the matrix transforms appropriately under linear changes of variables, preserving the geometric shape and orientation of the data cloud. In elliptical distributions, the scatter matrix Σ parameterizes the shape and orientation, with density contours forming ellipsoids centered at the location μ, and relates to the covariance by Cov(X) = c Σ for some scalar c > 0 (e.g., c = 1 for the multivariate normal).^[7]^[8]

Mathematical Formulation

The scatter matrix is defined as a functional S(F) of the distribution F of the random vector X, such that S(F) is positive definite and satisfies affine equivariance. For a sample of n observations represented by an n × p data matrix X, robust estimators of the scatter matrix replace the classical covariance formula to achieve robustness. The classical sample covariance matrix, a specific scatter matrix, is given by

\mathbf{S} = \frac{1}{n-1} \sum_{k=1}^n (\mathbf{x}_k - \bar{\mathbf{x}})(\mathbf{x}_k - \bar{\mathbf{x}})^T,

where \bar{\mathbf{x}} is the sample mean vector. This is symmetric and positive semi-definite, with diagonal elements as variances and off-diagonals as covariances.^[9] In general, scatter matrices are defined via M-functionals: S = E[ w(r) (X - T(X))(X - T(X))^T ], where T is a location functional, r is the robust distance r^2 = (X - T)^T S^{-1} (X - T), and w is a weight function tuned for robustness. For example, Tyler's M-estimator solves a fixed-point equation to yield a consistent, affine equivariant estimate under elliptical assumptions.^[7] The population scatter matrix Σ has elements Σ_{ij} reflecting the expected joint variability, generalizing Cov(X_i, X_j) = E[(X_i - μ_i)(X_j - μ_j)]. Under regularity conditions, robust sample scatter matrices converge to Σ, providing a measure of the data's second-moment structure while mitigating outlier influence.^[8]

Construction and Visualization

Building the Matrix

The scatter matrix in multivariate statistics is typically estimated from a sample of n observations of a p-dimensional random vector \mathbf{X}. The classical estimator is the sample covariance matrix, given by \hat{\Sigma} = \frac{1}{n-1} \sum_{i=1}^n (\mathbf{x}_i - \bar{\mathbf{x}})(\mathbf{x}_i - \bar{\mathbf{x}})^T, where \bar{\mathbf{x}} is the sample mean, which assumes finite second moments and is sensitive to outliers.^[7] For robust estimation, M-estimators proposed by Maronna (1976) solve a minimization problem to obtain a consistent estimate under elliptical distributions. Specifically, an M-estimator of scatter \hat{S} minimizes \det(\hat{S}) subject to \frac{1}{n} \sum_{i=1}^n \rho\left( \frac{(\mathbf{x}_i - \hat{\boldsymbol{\mu}})^T \hat{S}^{-1} (\mathbf{x}_i - \hat{\boldsymbol{\mu}})}{p} \right) = b, where \rho is a robust loss function (e.g., Huber), \hat{\boldsymbol{\mu}} is a location estimate, and b is a tuning constant for consistency at the normal distribution. This yields a positive definite matrix affine equivariant to linear transformations.^[10] High-breakdown alternatives, such as the Minimum Covariance Determinant (MCD) estimator introduced by Rousseeuw (1984), select a subset of h observations (e.g., h = \lfloor (n+p+1)/2 \rfloor for 50% breakdown) that minimizes the determinant of their sample covariance, providing robustness against up to nearly half the data being outliers. The raw MCD scatter is then corrected for consistency. Computationally, these estimators involve optimization, with fast algorithms like concentration steps reducing complexity from O(n^2) to near-linear time for moderate dimensions.^[8] Prior to estimation, data centering at a robust location (e.g., median or MCD mean) is often applied, and for standardized scale, the matrix can be normalized such that \det(\hat{S}) = 1 or trace p.^[7]

Handling Multiple Variables

In high-dimensional settings where p approaches or exceeds n, scatter matrix estimators face singularity (rank at most n-1) and inflated variance, violating assumptions of many classical methods. Robust high-dimensional approaches include shrinkage estimators, such as Ledoit-Wolf, which regularize toward a target matrix (e.g., identity) via \hat{\Sigma}_{shrink} = (1 - \alpha) \hat{\Sigma} + \alpha \frac{\operatorname{tr}(\hat{\Sigma})}{p} I_p, with \alpha chosen to minimize mean squared error.^[7] For visualization in low dimensions (p \leq 3), the scatter matrix defines confidence regions as ellipsoids: the set \{ \mathbf{x} : (\mathbf{x} - \hat{\boldsymbol{\mu}})^T \hat{S}^{-1} (\mathbf{x} - \hat{\boldsymbol{\mu}}) \leq \chi^2_{p,1-\alpha} \}, where \chi^2 is the chi-squared quantile, plotted over data scatterplots to assess fit and outliers. In 2D, these appear as tilted ellipses oriented by the eigenvectors of \hat{S}, with semi-axes lengths scaled by eigenvalues.^[11] For higher dimensions, dimensionality reduction via eigendecomposition of \hat{S} = V \Lambda V^T visualizes principal components: scree plots of eigenvalues \lambda_j reveal variance structure, while biplots project data onto top eigenvectors, overlaying the implied elliptical contours. Robust scatter enables reliable visualization even with contamination, highlighting dependencies without outlier distortion. Parallel coordinates or heatmaps of the correlation matrix derived from \hat{S} (via \operatorname{corr} = D^{-1/2} \hat{S} D^{-1/2}, D = \operatorname{diag}(\hat{S})) further aid interpretation.^[7]^[8]

Interpretation and Analysis

Off-Diagonal Elements

The off-diagonal elements of a scatter matrix quantify the pairwise dependencies or covariances between distinct variables in the multivariate random vector, generalizing the second-moment structure captured by the traditional covariance matrix. A positive off-diagonal entry indicates that the corresponding variables tend to vary in the same direction, while a negative value suggests they vary in opposite directions; the absolute magnitude reflects the strength of this linear association. In robust scatter matrices, such as those based on M-estimators or minimum covariance determinant, these elements are designed to downweight the influence of outliers, yielding more stable estimates of dependence compared to classical covariances.^[7] Under elliptical distributions, the off-diagonal elements influence the orientation and tilting of the constant-density ellipsoids that characterize the data distribution. For deeper analysis, the scatter matrix can be standardized to a correlation-like form by dividing off-diagonals by the square root of the product of corresponding diagonal elements, allowing direct comparison of dependence strengths across variable pairs while preserving the matrix's positive definiteness and affine equivariance.^[8]

Diagonal Elements

The diagonal elements of the scatter matrix represent the marginal dispersions or generalized variances for each individual variable, providing a measure of univariate scatter that contributes to the overall multivariate structure. In the classical covariance case, these equal the variances of the centered variables; in general scatter functionals, they ensure consistency with the matrix's positive definiteness and equivariance properties, often without requiring finite second moments. The trace of the scatter matrix—the sum of its diagonal elements—serves as a scalar summary of total dispersion across all dimensions, analogous to the sum of variances in univariate analysis.^[7] Larger diagonal values highlight variables with greater intrinsic variability, which can guide variable selection or scaling in high-dimensional settings. The determinant of the scatter matrix, a product of its eigenvalues, quantifies the generalized volume of the data cloud, with smaller values indicating more concentrated dispersion; this is particularly useful in robust contexts to assess multivariate spread resistant to contamination.^[8] A primary method for analyzing the full scatter matrix is spectral decomposition: S = V \Lambda V^T, where \Lambda is the diagonal matrix of positive eigenvalues \lambda_i (ordered decreasingly), representing the amount of scatter along the orthogonal principal directions given by the eigenvectors in V. The eigenvalues quantify relative dispersion contributions, enabling dimensionality reduction by retaining components with large \lambda_i, as in robust principal component analysis; the eigenvectors define the orientation of maximum variance axes, revealing the data's underlying shape and correlations. This decomposition maintains affine equivariance and is central to interpreting scatter in elliptical models or independent component analysis.^[7]

Applications

Exploratory Data Analysis

In exploratory data analysis (EDA), robust scatter matrices provide a reliable measure of multivariate dispersion and dependence structure, resistant to outliers that can distort classical covariance estimates. This allows analysts to detect data quality issues such as gross errors or contamination early, informing decisions on data cleaning or robust preprocessing before modeling.^[8] High-breakdown estimators like the Minimum Covariance Determinant (MCD) achieve up to 50% breakdown points, enabling consistent estimation under elliptical models while identifying influential observations.^[8] Robust scatter matrices integrate into EDA workflows as diagnostic tools for assessing multivariate structure in contaminated datasets, often signaling the need for outlier removal or weighted analyses when eigenvalues indicate anomalous spread. By providing affine-equivariant estimates, they support variable selection and guide the application of robust dimensionality reduction techniques.^[7] Early developments in robust scatter estimation, such as M-estimators introduced by Maronna in 1976, emphasized minimizing dispersion functionals for consistent estimation under elliptical assumptions, enhancing EDA's ability to reveal true data characteristics without sensitivity to anomalies.^[8] A representative real-world application is the Fisher's Iris dataset, where robust within-class scatter matrices estimate dispersion for sepal and petal measurements across species, revealing clustered structures resistant to potential outliers and supporting classification via linear discriminant analysis.

Multivariate Correlation Assessment

Scatter matrices facilitate the assessment of multivariate dependencies by generalizing the covariance matrix to capture second-order characteristics, including in distributions lacking finite moments or with heavy tails. This reveals patterns such as elliptical shapes in standardized data or deviations indicating non-normality, where the matrix's eigenvectors highlight principal directions of variation.^[7] Such structural insights are valuable in exploratory settings to identify latent dependencies without assuming linearity or normality. Beyond classical Pearson correlations derived from covariance, robust scatter matrices detect dependencies robust to outliers, as their positive definiteness ensures well-defined orientations even under contamination. For instance, in elliptical distributions, the scatter matrix parameterizes the shape, relating to covariance by a scalar factor, aiding identification of scaled associations.^[8] This supports modeling choices like robust PCA for handling nonlinear or asymmetric dependencies. Extensions often involve symmetrized scatter matrices, such as S_s(\mathbf{X}) = S(\mathbf{X}_1 - \mathbf{X}_2) with independent copies, to exploit independence properties and enhance correlation assessment in non-elliptical cases. These can incorporate higher-order moments for finer granularity.^[7] The scatter matrix serves as a foundation for formal tests of multivariate associations, providing estimates for procedures like the robust Mantel test on distance matrices derived from variables, common in ecology or genetics. Observed eigenvalue patterns may prompt such evaluations to quantify overall dependence significance.^[7]

Advantages and Limitations

Key Benefits

Scatter matrices offer a flexible framework for capturing the second-order structure of multivariate data, generalizing the univariate variance to describe dispersion and dependencies in a way that is invariant to location shifts and accommodates various distributional assumptions beyond normality.^[7] Their affine equivariance property ensures that transformations of the data, such as rotations or scalings, result in correspondingly transformed scatter matrices, preserving the geometric interpretation of the data cloud's shape and orientation.^[7] A primary advantage is their role in robust estimation, where alternatives to the sample covariance matrix, such as M-estimators or the Minimum Covariance Determinant (MCD), achieve high breakdown points—up to nearly 50% contamination—allowing reliable inference even in the presence of outliers or gross errors that would distort classical methods.^[8] This robustness makes scatter matrices essential in procedures like principal component analysis (PCA) and linear discriminant analysis (LDA), where they enable stable dimensionality reduction and class separation under elliptical models.^[8] In advanced applications, such as independent component analysis (ICA), scatter matrices facilitate blind source separation by exploiting higher-order statistics; for instance, methods like FOBI use multiple scatter functionals to diagonalize matrices and recover independent components, providing consistency under assumptions of non-Gaussianity.^[7]

Common Drawbacks

While powerful, scatter matrices based on second moments, like the covariance matrix, assume finite variances and are highly sensitive to outliers, potentially leading to indefinite or unstable estimates if second moments do not exist or data is contaminated.^[7] Robust alternatives mitigate this but often at the cost of reduced efficiency; for example, the MCD estimator exhibits low asymptotic efficiency (around 20-30% at the normal distribution for moderate dimensions), requiring reweighting or hybridization to approach the efficiency of classical methods.^[8] Computational demands pose another limitation, particularly for high-breakdown estimators in high-dimensional settings. Exact computation of the MCD involves enumerating subsets of data points, which scales factorially with sample size and dimension, rendering it infeasible for large datasets without approximations like FAST-MCD, which may not always guarantee the global optimum.^[8] Additionally, maintaining affine equivariance and consistency under elliptical assumptions can introduce iterative optimization challenges, increasing complexity in practice.^[7] Scatter matrices also rely on model assumptions, such as elliptical symmetry for certain properties like the relation to the covariance matrix via a scalar multiple, which may not hold for non-elliptical distributions, limiting their applicability in diverse data scenarios.^[8]

References

[1]
[PDF] Scatter Matrices and Independent Component Analysis 1 Introduction
2.1 Definitions. We first define what we mean by a location vector and a scatter matrix. Let x be a p- variate random variable with cdf F. A functional T(F) ...
[2]
[PDF] High-breakdown estimators of multivariate location and scatter
For example, the asymptotic relative efficiency of the diagonal elements of the MCD scatter matrix with respect to the sample covariance matrix is only 6 ...
[3]
Sage Research Methods - The Scatterplot Matrix
A scatterplot matrix is defined as a square, symmetric table or “matrix” of bivariate scatterplots (Cleveland, 1994). This table has k rows and columns, with ...
[4]
Scatterplot Matrix - an overview | ScienceDirect Topics
A scatterplot matrix is defined as an array of scatterplots arranged in rows or columns, with each plot corresponding to different values of a nominal ...
[5]
JOHN W. TUKEY'S WORK ON INTERACTIVE GRAPHICS Stanford ...
One of his earliest suggestions he called “draftman's views,” which are now known as “scatterplot matrices” and are routinely used to visualize multivariate ...<|control11|><|separator|>
[6]
1.3.3.26.11. Scatter Plot Matrix - Information Technology Laboratory
A scatter plot matrix checks pairwise relationships between variables, containing all pairwise scatter plots in a matrix format with k rows and k columns.
[7]
Measures of Association: Covariance, Correlation - STAT ONLINE
S (the sample variance-covariance matrix) is symmetric; i.e., \(s_{jk}\) = \(s_{kj}\) . S is unbiased for the population variance covariance matrix \(Σ\) ; i.e. ...
[8]
[PDF] Data, Covariance, and Correlation Matrix
Jan 16, 2017 · The Data Matrix. Properties. Calculating Variable (Column) Means. The sample mean of the j-th variable is given by. ¯xj = 1 n n. X i=1 xij. = n.
[9]
[PDF] 12.2 Covariance Matrices and Joint Probabilities
R is the correlation matrix for x and y, and the covariance matrix for X = x/σx and Y = y/σy. That number ρxy is also called the Pearson coefficient. Example 4.
[10]
2.6. Covariance estimation - Scikit-learn
Many statistical problems require the estimation of a population's covariance matrix, which can be seen as an estimation of data set scatter plot shape.
[11]
Chapter 5 Multivariate exploratory analysis | Data Analytics
Idea: Unlike scatter plots some visualization techniques necessitate observations on a standardized scale. One commonly-used approach is to standardize all ...
[12]
Jittering to prevent overplotting in statistical graphics - The DO Loop
Jul 5, 2011 · Jittering is adding random noise to data to prevent overplotting, which occurs when continuous measurements are rounded to a convenient unit.
[13]
[PDF] ScagExplorer: Exploring Scatterplots by Their Scagnostics - myweb
The scatterplot matrix is a useful tool for displaying the correlation between a pair of variables. However, it is easy to run out of space as the number of ...
[14]
[PDF] Scatterplots: Tasks, Data, and Designs - Alper Sarikaya
Aggregation commonly has computational complexity on the order of the number of points, though some (such as Splatterplots) may use the GPU to compute ...
[15]
[PDF] Explorative Approaches for High Dimensional, Mixed ... - DiVA portal
Figure 1.8: Example of overcrowded displays when parallel coordinates (top) and a scatter plot matrix. (bottom) are used to represent a high dimensional data ...
[16]
[PDF] Visualizing High-Dimensional Data: Advances in the Past Decade
Mar 30, 2020 · These methods include the most ubiq- uitous visual mapping approaches, such as scatterplot matrices (SPLOMs) and parallel coordinate plots (PCPs) ...Missing: overcrowding | Show results with:overcrowding
[17]
Evaluation on interactive visualization data with scatterplots
Unfortunately, scatterplot techniques do not handle well data sets with a high number of dimensions or a large number of items. The overplotting and overlapping ...
[18]
[PDF] Visual Hierarchical Dimension Reduction for Exploration of High ...
Users can interactively explore and modify the dimension hierarchy, and select interesting dimension clusters at different levels of detail for the data display ...Missing: scalability | Show results with:scalability
[19]
[PDF] Exploratory Visualization of Correlation Matrices - Lexjansen.com
We can use hierarchical clustering to observe the seriation and variables. The SAS procedure. DISTANCE is used to produce a proximity matrix. The procedures ...
[20]
[PDF] Corrgrams: Exploratory displays for correlation matrices - DataVis.ca
magnitudes using visual thinning and correlation-based variable ordering. ... Figure 11: Schematic scatterplot matrix for Baseball data. Each panel shows ...
[21]
[PDF] Smooth Transitions Between Parallel Coordinates and Scatter Plots ...
Abstract. This paper presents new techniques for seamlessly transitioning between parallel coordinate plots, star plots, and scatter plots.Missing: kernel hexbinning
[22]
[TeX] Getting Things in Order: An Introduction to the R Package seriation
Bond energy algorithm. The bond energy algorithm []seriation:McCormick:1972 is a simple heuristic to rearrange columns and rows of a matrix (two-way two- ...
[23]
[PDF] s-CorrPlot: An Interactive Scatterplot for Exploring Correlation
Feb 20, 2015 · Abstract. The degree of correlation between variables is used in many data analysis applications as a key measure of interdependence.<|separator|>
[24]
scatterplotMatrix Scatterplot Matrices - RDocumentation
This function provides a convenient interface to the pairs function to produce enhanced scatterplot matrices, including univariate displays on the diagonal.
[25]
pairs function - Scatterplot Matrices - RDocumentation
The off-diagonal panel functions are passed the appropriate columns of x as x and y : the diagonal panel function (if any) is passed a single column, and the ...
[26]
Exploratory Data Analysis (2) - GeoDa
Sep 29, 2020 · Scatter Plot Matrix. Principle. A scatter plot matrix visualizes the bivariate relationships among several pairs of variables. The individual ...
[27]
Regression analysis—ArcGIS Insights - Esri Documentation
A scatter plot or scatter plot matrix can be used to assess linearity between the dependent variable and the explanatory variables. A scatter plot matrix can ...
[28]
2 Exploratory Data Analysis (EDA) - STAT ONLINE
A scatterplot matrix is one possible visualization of three or more continuous variables taken two at a time.
[29]
[PDF] DATA ANALYSIS, ExPLORATORY - University of California, Berkeley
This entry presents a selection of EDA tech- niques including tables, five-number summaries, stem-and-leaf displays, scatterplot matrices, box plots, residual ...
[30]
The Many Faces of a Scatterplot - jstor
We do not know who invented the scatterplot matrix. A lower triangular version was discussed in Tukey and. Tukey (1981) and a full version in Chambers et al.
[31]
[PDF] The Role of Nonlinear Factor-to-Indicator Relationships in Tests of ...
Scatterplot matrix and loess regression lines fit to the aggregated data from the two groups. Note the slight but consistent curvature in the relation of x5 ...
[32]
THE ELEMENTS OF GRAPHING DATA, by William S. Cleveland ...
Usually the panels are aligned vertically but sometimes it is effective to separate them both horizontally and vertically (the scatterplot matrix). This ...
[33]
Mantel test in population genetics - PMC - NIH
The first and simplest application of the Mantel test is to correlate genetic (G) and geographic (D) distances, seeking for spatial pattern of genetic variation ...
[34]
Should the Mantel test be used in spatial analysis? - Legendre
Jul 25, 2015 · In particular, to test the correlation between two spatially correlated vectors or matrices of raw data, one cannot use a partial Mantel test ...
[35]
[PDF] Using Animation to Alleviate Overdraw in Multiclass Scatterplot ...
Scatterplot matrices (SPLOMs) are widely used to visualize multivariate data [15], and are effective tools for identify- ing correlations, clusters ...
[36]
[PDF] Intro to Info Vis Outline - Computer Science
• scatterplot matrix (SPLOM). – rectilinear axes, point mark. – all possible ... – pro: straightforward and intuitive. • to understand and compute. – con ...
[37]
[PDF] Flow-based Scatterplots for Sensitivity Analysis
Scatter plots are intuitive to under- stand when studying the relationship between two variables. How- ever, projected points may result in clutter and overlap ...
[38]
Visualizing categorical data — seaborn 0.13.2 documentation
Seaborn visualizes categorical data using scatterplots (stripplot, swarmplot), distribution plots (boxplot, violinplot, boxenplot), and estimate plots ( ...
[39]
A Tour Through the Visualization Zoo
SPLOMs enable visual inspection of correlations between any pair of variables. Here we use a scatter plot matrix to visualize the attributes of a database of ...Missing: benefits | Show results with:benefits
[40]
Splatterplots: Overcoming Overdraw in Scatter Plots - PMC - NIH
We introduce Splatterplots, a novel presentation of scattered data that enables visualizations that scale beyond standard scatter plots.
[41]
Effects of Point Size and Opacity Adjustments in Scatterplots
May 11, 2024 · Lowering the opacity of all points using alpha blending [14] addresses this, and makes data trends and distributions easier to see and interpret ...Missing: mitigation | Show results with:mitigation<|control11|><|separator|>
[42]
[PDF] Scatterplot Layout for High-dimensional Data Visualization
Each scatterplot is identified by its row and column index. Even though ordinary users are familiar with scatterplot matrix, they have two major drawbacks.<|control11|><|separator|>
[43]
[PDF] USING GRAPHS EFFECTIVELY FOR LEARNING: FROM DATA
Oct 22, 2010 · Potential disadvantages of scatterplot matrix. 1. Visual resolution of panels will decrease as the number of variables increases. 2 ...Missing: drawbacks | Show results with:drawbacks
[44]
SticiGui Correlation and Association
Sep 2, 2019 · It is not a good summary of association if the scatterplot has a nonlinear (curved) pattern. ... scatter in a vertical slice near the left of the ...
[45]
[PDF] Using Interactive Graphics to Teach Multivariate Data Analysis ... - JSE
Interactive and dynamic graphics are excellent tools for ... Figure 3: A scatterplot matrix of raw data from the Crime sample and a linked biplot of the.