CLASS-L Archives

June 2004

CLASS-L@LISTS.SUNYSB.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Reply To:
Classification, clustering, and phylogeny estimation
Date:
Wed, 16 Jun 2004 12:25:58 -0400
Content-Type:
text/plain
Parts/Attachments:
text/plain (81 lines)
Luca-
      In spite of their appeal in the consumer world where many seem to
believe that large amounts of information is tantamount to insurance of
some type, the sorry fact is that with massive amounts of data, all too
frequently what you get is massive redundancy.  David Scott's suggestion to
do mode clustering with large databases remains one of the most sensible
suggestions I've ever heard.
Regards,
Tom Ball
McKinsey & Co
55 East 52nd Street
New York, NY  10022




                        Luca Meyer
                        <lucameyer@TIS         To:      [log in to unmask]
                        CALI.IT>               cc:      (bcc: Thomas Ball/NYO/NorthAmerica/MCKINSEY)
                        Sent by:               Subject: TwoStep clustering method comparison
                        "Classificatio
                        n, clustering,
                        and phylogeny
                        estimation"
                        <CLASS-L@lists
                        .sunysb.edu>
                        06/16/2004
                        11:24 AM
                        Please respond
                        to
                        "Classificatio
                        n, clustering,
                        and phylogeny
                        estimation"






Hello,

I am searching for working/published papers on twostep clustering method
comparison as well as references about this and other methods for
clustering large datasets. I am already aware of the following material:

Chiu, T., Fang, D., Chen, J., Wang, Y., and Jeris, C. (2001). A Robust
and Scalable Clustering Algorithm for Mixed Type Attributes in Large
Database Environment. Proceedings of the seventh ACM SIGKDD
international conference on knowledge discovery and data mining, 263.

Zhang, T., Ramakrishnon, R., and Livny M. (1996). BIRCH: An Efficient
Data Clustering Method for Very Large Datebases. Proceedings of the ACM
SIGMOD Conference on Management of Data, p. 103-114, Montreal, Canada.

Gore, P. A. Jr. (2000). Cluster analysis. In H. E. A. Tinsley & S. D.
Brown (Eds.), Handbook of applied multivariate statistics and
mathematical modeling (pp. 297-321). San Diego, CA: Academic Press.

Thank you in advance,

Mr. Luca Meyer
Consumer research advisor: http://www.lucameyer.com/en/
Italian Online Research Mailing List:
http://it.groups.yahoo.com/group/ior
Tel: +390122854456 - Fax: +390122854837 - Mobile: + 393355217628

- One world, one human race -



+---------------------------------------------------------+
This message may contain confidential and/or privileged
information.  If you are not the addressee or authorized to
receive this for the addressee, you must not use, copy,
disclose or take any action based on this message or any
information herein.  If you have received this message in
error, please advise the sender immediately by reply e-mail
and delete this message.  Thank you for your cooperation.
+---------------------------------------------------------+

ATOM RSS1 RSS2