Dataming Of Subsets Question

Jul 26, 2007

Have not found an active general data mining forum yet so asking here.

I'm new to data mining band have been given the task of setting up a data mining system. The problem is that our setup seems pretty non standard and I'm not sure how to use data mining on it or what my expectations should be. The situation is:

We have a large and growing set of strings which we get requests for (> 100,000). The requests have many, mostly nominal (non-numeric), variables associated with them. We can only handle a subset (probably less than 10,000) or the strings at any one time. We want to use data mining to analyze historic requests so we can figure out which strings we are going to handle under a given set of variables. So given that our variable currently have values X1, ..., Xn, what subset should be use given than a large database of historic string requests.

Anyone know what techniques would work well for this kind of problem? This is a quick and dirty kind of project, no special purpose hardware or expensive software on this one. I've been looking at using rapid miner but not sure that it's a great tool in this case.

thanks in advance,
max

View 4 Replies


ADVERTISEMENT

Sueqntially Numbered Subsets

Mar 25, 2008

I have a table that is created based on a heirarch of set criteria and need to number the subsets within. The best way to illustrate what I want to accomplish is through an example.

Rank | Item_1 | Item_2 | Class
---- | ------ | ------ | ------
1 | 1A | 1B | ExactMatch
2 | 1A | 1C | Sub
3 | 1A | 1D | Sub
1 | 2B | 2F | ExactMatch
2 | 2B | 2G | Sub

Is there a way to accomplish this? Any help or suggestions are appreciated.

I apologize for the formatting, but I think you should be able to get the point.

thanks,

View 3 Replies View Related

Constrains On Table Subsets

Oct 12, 2007

Hello!
I'm looking for some ideas about how
to ensure constraints on multiple rows.
Of course one can use stored procedures and/or triggers while inserting.
Other ideas?

The reason for this question is that it could be hard to ensure,
because if lock level is not high enough, you can get duplicates during concurrency,
if lock level is too high, it can result in deadlocks.

Thanks

View 5 Replies View Related

Greater/Less Than Query From Within Subsets Of Results

Sep 27, 2005

I think I'm trying to do a simple query on maximum date.

I've got 100 tools that have been used over the past three years.
Some of the tools are used almost every day. Other tools haven't been used for a month, while other tools haven't been used for a year or more.

Ultimately I'm trying to just find the list of tools whose latest date of use was a year ago.

I have a list of tools and a list of times each tool was used.

I think I'm going to have to do a search that for each tool what was the times it was used. That I can do.

What I'm not sure of is how to then pull only the latest date for each tool.

Once I get that I can then do a query off that result to pull the "oldest latest" date of use.

View 1 Replies View Related

Returning Subsets Of The Same Column In Separate Columns...

Mar 23, 2006

I have the following 2 Sql queries. They both are rowcounts of the same column but based on different criteria. What I want to do is return the two results side by side in separate columns:
-- Subscriptions since Sept. 24th
SELECT count(*)
FROM SiteMemberTable103 s(nolock)
JOIN clientmembertable25 c(nolock) ON s.memberid = c.memberid
WHERE site_firstjoindate is not null
and c.clientunsubscribe = 0
and c.validemailaddr = 1
and s.unsubscribe = 0

-- Subscriptions in February
SELECT count(*)
FROM SiteMemberTable103 s(nolock)
JOIN clientmembertable25 c(nolock) ON s.memberid = c.memberid
WHERE site_firstjoindate BETWEEN '2006-02-01 00:00:00.000' AND '2006-03-01 00:00:00.000'
AND c.clientunsubscribe = 0
AND c.validemailaddr = 1
AND s.unsubscribe = 0

It seems like a UNION ALL should work but it just returns the results in one column. I tried changing the count by specifying a different column for each but that doesn't work either. I also tried writing it as one query and using alias to differentiate the two tables but that just gives me syntax errors. I suspect there is a more elegant way to do this but I'm at a loss. Any help would be greatly appreciated!

Caeanis

View 1 Replies View Related







Copyrights 2005-15 www.BigResource.com, All rights reserved