Subject: | |
From: | |
Reply To: | Classification, clustering, and phylogeny estimation |
Date: | Mon, 7 Jun 2010 07:27:40 -0400 |
Content-Type: | multipart/alternative |
Parts/Attachments: |
|
|
It is a function of the standardization used in correspondence analysis. See
papers and books by Greenacre. For example Journal of applied stat. 1993.
20:251-269.
----------------------
F. James Rohlf, Distinguished Professor
Dept. Ecology and Evolution, Stony Brook University, NY 11794-5245
From: Classification, clustering, and phylogeny estimation
[mailto:[log in to unmask]] On Behalf Of Eric Wajnberg
Sent: Monday, June 07, 2010 3:58 AM
To: [log in to unmask]
Subject: correspondence analysis with strange triangular structure
Dear All,
I am doing a simple correspondence analysis on a contingency table having
more than 5000 lines (that are representing genes) and 4 columns. I am then
doing a clustering analysis on the coordinates of all genes on the factorial
axes. The results of the correspondence analysis are quite surprising to me.
The coordinates of all rows on the 3 axes are strictly within a simplex that
is a perfect tetrahedron. Of course, the extremities of the tetrahedron are
corresponding to the coordinates of the column in the factorial space. How
could such a structure be obtained?
I first thought that this was due to a specific structure in my original
dataset, so I've decided to do the analysis again on a randomly drawn
contingency table of the same size (each cell was drawn from a uniform
distribution between 0 and 100). I collected again such a triangle
structure.
It thus seems that a simple correspondence analysis is always producing such
a triangular structure on a space with a small number of dimensions, but I
never heard about this before. I suspect that people are usually not getting
this because either they have more dimensions, or less rows in the original
table, leading to points that look to be spread on factorial plans in a more
homogeneous way.
----------------------------------------------
CLASS-L list.
Instructions: http://www.classification-society.org/csna/lists.html#class-l
|
|
|