Predicting A Continuous Value (run Minutes)

Oct 23, 2007

Hello,

I have a question regarding whether or not Data Mining can be utilized in a specific problem I have to solve.

Situation: I��m going to simplify the problem by explaining it in terms of a ��pizza manufacturer�?. Suppose I wanted to predict the run minutes + downtime minutes (I use these to get an hourly rate: Pizzas/(run hrs + delay hrs) = Pizzas per hour) by looking at a set of input properties.

My properties could be something like the following:
# of Toppings
# of Special Pricing Stickers
Cardboard Box Indicator
Case Indicator (0 represents auto-casing, 1 represents putting in case by hand)
Machine Type (0 or 1�� 0 represents an older ��slower machine, 1 is newer)
Quantity of Run
(there could be up to 15 other properties that may or may not impact our rate)

Measured Values:
Run Minutes
Delay (down) minutes

Steps I��ve Done So Far:
I��ve created a couple different data mining models for this as I was unsure which one(s) to use. I checked the lift chart while feeding back in the original data set and my scatter plot appeared fairly inaccurate.

I've attempted to use Excel to create a linear regression, however my r squared value was always around .30. I decided to try to use SQL Server Data Mining to see if it could be something to help predict our accuracy better than a linear formula.

I've played with a couple different algorithms in Data Mining, and it appeared that none of them did exceptionally well with prediction. I even checked the lift chart using the same table as I used to train the model.

What algorithm(s) might work the best?
Can I reasonably expect a prediction within a fairly strict tolerance (I'm guessing the answer to this is: "yes, if your source data represents a consistent pattern")?
How can I best utilize Data Mining to give an answer like "historically, your run rate has been between these 2 values with a probability of X". I'm thinking I can utilize the predictprobability and stdev to some extent.

Any suggestions would be greatly appreciated.

If anyone needs further clarification, please let me know.

Thank you.

Regards,

Dan

View 2 Replies

Predicting A Continuous Value (run Minutes)

T-SQL (SS2K8) :: Display Row As 2 Days Ago / 1 Hours 34 Minutes Ago / 11 Minutes Ago

Predicting In Trees

Predicting From A Clustering Model

Predicting Player Win Over A Period Of Time

Viweing Prediction Value In The Textbox After Predicting With Time Series Alforithm

Problems Predicting Unseen Cases Of Nested Tables Data Mining Models

Alert If Cpu Is Over 85% For 1 Min Continuous

Fetching Continuous Dates

Continuous Merge Replication

Get Minimum Day In A Continuous Series

Getting Data In Continuous Time

Continuous Attribute && Complexity_Penalty

Continuous Variable Prediction

Setting Up Continuous Trace File

Replace All Integers With Continuous 6 Or More Occurrences With X

Continuous Data Pump Using SSIS

Need Help Keeping Continuous IDs Using Merge Replication

Continuous Merge Replication - Event Log

Continuous Nonstop Pumping Of Data

Continuous Predictable Problem In MDT Model

Find Continuous Date Range In Sqlserver2005

Transact SQL :: Finding Continuous Members Script?

DB Engine :: Timeout Exception After 15-20 Hours Of Continuous Run

Strange: Continuous Login Failure Errors

How Handle Continuous Until Cancelled Date Field?

SQL Server 2012 :: Creating Continuous Primary Key Integer Value

Reporting Services :: Print On Continuous Paper On SSRS

Continuous Priniting - PDF (MSFT Workaround: Make Sure That Your Report Does Not Use The Subreport Control.)

How To Re-set A Job To Run 10 Minutes Later

Min/max Of X Minutes

Group On Minutes

Datetime To Minutes?