CLASS-L Archives

April 2007

CLASS-L@LISTS.SUNYSB.EDU

Options: Use Monospaced Font
Show HTML Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Hillel Coren <[log in to unmask]>
Reply To:
Classification, clustering, and phylogeny estimation
Date:
Mon, 30 Apr 2007 10:06:36 -0400
Content-Type:
text/plain
Parts/Attachments:
text/plain (40 lines)
I'm working on the following problem and was hoping that someone in
the forum could lend a hand. I've got the following data:

publication | article | sales conversion
----------------------------------------------------
forbes | top 10 businesses | 0.283
newsweek | best 10 pet stores | 0.347
… | … | …

The goal is to use the publication and article to select future
articles which we predict will have a high sales conversion rate. I've
transformed the data into the following ARFF file.

@resource articles

@attribute publication {forbes, newsweek, ...}
@attribute top {yes, no}
@attribute 10 {yes, no}
@attribute businesses {yes, no}
@attribute best {yes, no}
@attribute pet {yes, no}
@attribute stores {yes, no}
…
@attribute sales_conversion NUMERIC

@data
{0 Forbes, 1 yes, 2 yes, 3 yes, 7 0.283}
{0 newsweek, 4 yes, 2 yes, 5 yes, 6 yes, 7 0.283}
…

What do you feel would be the best way to approach this problem (I'm
using WEKA)?

Thank you for your help,
Hillel

----------------------------------------------
CLASS-L list.
Instructions: http://www.classification-society.org/csna/lists.html#class-l

ATOM RSS1 RSS2