Suppose that we have a set of points on a line. The amount of dispersion can be measured by standard deviation in this case.
My question is, is there something similar for higher dimensions? For example, if there are 100 points ($i, j: 0 le i le 9, 0 le j le 9$), then we want to say that the "area" formed by these points is roughly 100.
I have two ideas, but both are not very good.
The area or volume of the convex hull. However, this value can be affected too much by outliers.
The product of diversion of x-coordinates and that of y-coordinates. However, imagine points on a diagonal line: we want small value for this case but the product will be big.
Best Answer
In situations like this, people often use the variance-covariance matrix. Along the main diagonal, the variance for each dimension is listed. Each $i, j$th off diagonal element (where $ine j$) lists the covariance of variables $i$ and $j$. In this way, every aspect of the dispersion is listed separately.
On the other hand, if you need a single number for simple comparisons, the determinant is sometimes used.