CLASS-L Archives

September 2006

CLASS-L@LISTS.SUNYSB.EDU

Options: Use Monospaced Font
Show HTML Part by Default
Condense Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Sender:
"Classification, clustering, and phylogeny estimation" <[log in to unmask]>
Subject:
From:
Date:
Sun, 17 Sep 2006 10:25:37 EDT
Content-Type:
multipart/alternative; boundary="-----------------------------1158503137"
MIME-Version:
1.0
Reply-To:
"Classification, clustering, and phylogeny estimation" <[log in to unmask]>
Parts/Attachments:
text/plain (958 bytes) , text/html (1427 bytes)
I agree with Professor Rohlf. The issue for good clusters twofold: 1) using
an appropriate distance messaure (ie. Euclidean, Mahalanobis, "city-block") 
and 2) a clustering method: Wards, single-linkage, average linkage etc.

Many times it's try and error to find the right combination of the above. In 
finance
for stock retruns, squared Euclidean distance with Wards produces nice result 
for cluster.

They key is to see and look at the clusters if they make sense. One could 
cluster data
use the cluster as seeds to a method as discriminant analysis and see how 
many miss-classifications occur.

Of course there are more robust clustering methods that detect outliers 
better and don't "lock-in" a case into a cluster. See the work by Rosseau and such 
cluster methods as Fanny.

Neil Gottlieb
ProfileDepot, Inc
http://www.profiledepot.com

----------------------------------------------
CLASS-L list.
Instructions: http://www.classification-society.org/csna/lists.html#class-l


ATOM RSS1 RSS2