Not long ago I accountered this situation: I had two databases on "MS SQL Server". In one of the databases there was a nomenclature with very large primary key.
I had to transport that nomenclature and transform the wide PK into single identity column into the other database.
I decided to use a function for that transformation. BUT that function had to mark somewhere which combination of the PK columns is relative to which identity value. BUT functions CAN'T WRITE under MSSQL.
So I took the challenge and mine all sources of information. The result was a function "Exec4Fun" that breaks the rule.
I suppose that with this function it's possible to avoid the restriction for triggers, which prevents writing in the triggering table? (not tested yet)
If someone needs such tools, just write back your e-mail and I'll send some code.
I have a report with several matrix objects. The data contained in each matrix is simple. One matrix has one column of labels and a column of data. The other has 3 columns of data. Both matrix's only show about 25 rows of data. A variable amount of data is displayed above the matrix's so that sometime the render across a page boundary. Whenever this happens the matrix doesn't render rows down to the bottom of the page. A considerable amount of empty space is left on the page, the completed matrix is displayed on the next page. I'm wondering how to get the matrix to render in this empty space.
This problem only occurs when I view the report on screen, in the Visual Studio "Preview" window, or from the Report Server web site. Tiff, pdf and printed output doesn't contain the extra space.
I've tried adding my matrix to a List and a Rectangle to see of this would fix the problem but it didn't help.
I've check the dimensions and margins or the page and I don't think I have any sizing problems, everything should fit on the page.
Please forgive my ignorance, however I am 2 months new to SQL-Server 2000.
Is there a way to create a rule specific to a user such that he can only SELECT from a table where a field equals a specific value.
In other words user Joe types in: SELECT * FROM SALES_BY_LOCATION. Because Joe's territory is Los Angeles, I want a rule that only retrieves sales data from Los Angeles.
In My table i'm collecting information about our customers.In that i have fields zip and phone.So i wanted to implemement a rule that zip should be atlaest of of 5 characters and phone should be 13 characters Including '-'s(333-333-3333).How can i implement these two rules on my table.
When i use the MS association rules ,i don't know how it is worked on the background .I stuy the Fp-Growth algorithm , but there're some questions , I don't kown what's the meaning of transcation database. who can give me one example ? thanks .I know we can store the data in relation database,but in basket Analysis ,how a transaction stroed in relation database?
In my package i have column called "optinout" coming from source file which has the value "start" and "stop", now in my destination table i have a column called "contact permission code" to which i need to apply a business rule based on the value in column "optinout" the rule is as stated "Contact Permission Code" is OptOut if "OptInOut" value is "STOP" else OptIn.
Hi there, it has been a long i'm trying to execute Microsoft Association Rules on my database.
I solved memory leak problem now, but i still can't understand output rules.
Database contain all the italian student who took a degree last year. Here in Italy, they have to compile a summary where they speak about universitary experience. ie: they talk about experience with teachers (pointage from 1 to 5); they says if they want to continue in the universitary field or not, and so on.
Most of the rules, says: Int_Stud=1-2, RapDoc>4
Int_Stud is the column where i store student intention to continue university. 1 means they want to go on, 2 means they do not want to continue to study. So, this rules has no sense, because it relates all the student (in my mind): the one who wants to continue university and the one who do not want to.
I think problem is that visual studio 2005 and analysis service has no understanding of Int_Stud world, they've no idea that Int_Stud can have just 2 values and that they're opposite each other. Is there a solution to this problem? Can i discretize this column?
Even if I know not to have perfect english, I hope to be understandable
I used the MS clustering method to run on my sample data, and I would like to apply the rules built to score/predict on another data set. But I am not able to find out where the rules are. I am wondering whether there are sql codes available that can be directly applied on the new data set. cheers
I note that there exist three web viewers for data mining algorithms, namely, DMNaiveBayesViewer, DMDecisionTreeViewer and DMClusterViewer. How come there are no viewers for association rules (itemsets, rules, dependency network)? Can you suggest any alternative way of showing such valuable information in a web application?
I understand Mr. MacLennan's explanation provided at http://forums.microsoft.com/MSDN/ShowPost.aspx?PostID=282651&SiteID=1 and appreciate the time he took to explain how importance works. However, like the user with username "sang", I also ran the data in BI 2005 and got the same results listed by the aforementioned user. I did this using the following data:
donut muffin
y y
y y
y y
y y
y y
y y
y y
y y
y y
y y
y y
y y
y y
y y
y y
n y
n y
n y
n y
n y
etc.
The rule muffin -> donut has an importance of -0.105302438, which is not the same as Mr. MacLennan's results. I tried switching the roles of a and b in a -> b and using different bases on the logarithms. I don't get the result of -0.105302438 with any of these. I also tried to calculate importance with a small data set I have and can't get the results using Mr. MacLennan's explanation with that data set either. Any thoughts on the descrepancy?
I am doing the Market basket analysis for a retailer using association rule. The whole data set is huge which contains grocery, clothes and books etc. If I want to check out the relationship between several different clothes brands, (e.g. LEVI'S and adidas), should I just remove all the grocery and books transactions, use the subset which only contains clothes transactions to re-run the association rules? Is this gonna work?
I got a question about the data preparation of market basket analysis.
There are always some transactions with only one single SKU product. It seems that these kind of transactions have nothing to do with association. Shall I just exclude them or what?
I want to score my data by only the assoziation rules I filtered in Mining-Model-Viewer.
Is this possible?
I recognized that MiningModel Predictionquery uses the generated Model (all rules).
Is there any way to influence the model at generating time.
Thanks a lot.
Hi,
I specified the point to solve my problem but I did not understand why MS SQL Server didn't recognize the Association parameters I but in. It uses the Minimum_Probability I put in, but the Minimum_Importance i wrote didn't care it, it use the default value.
I want to paste a screenshot here but it isn't possible. By the way I'm using the evaluation version, which should be same as the enterprise edition.
I need to create a set of cases for a project that uses the Microsoft Association Rules algorithm to make recommendations for products to customers. My question is: the set of scenarios must include all transactions of customers for training?. or is it sufficient some percentage of total transactions? If i do not use all transactions of customers, could be that the algorithm does not consider some products in their groups or rules and could not make recommendations about these?
I need to implement one logic similar to rule engine. Below is the example.how to execute all queries in second table order?? what is the best way to implements this
declare @tblRules AS Table (RuleNo INT, RuleDesc NVARCHAR(500), RuleQuery NVARCHAR(MAX), QueryExecutionInterval NVARCHAR(50)) declare @tblRuleResults AS Table (RuleResultID INT, RuleNo INT, ExecuteTime DateTime, NextExecutionTime DateTime, Result NVARCHAR(10)) INSERT INTO @tblRules VALUES ('1','Fail - 2 times within 1 Hour','XXX','Every 15 Minutes') INSERT INTO @tblRules VALUES ('2','Fail- 2 times within 2 Hour','YYY','Every 30 Minutes')
The problem is that I can't seem to reduce the minimum probability below .42 to view more rules.
I've considered that it might be becuase these are the only rules discovered, however I know quite a bit about the data and I would excpect many more associations.
I'm new to analysis services and hopefully this is a quick & easy question. I have a couple of quite large (163,000 tuple) tables with columns essentially representing a bit vector. I would like to mine for association rules but the number of '1' values are very, very sparse and they are the only objects of interest. How can I get more control over the algorithm---that is, how can I stipulate that the state of the column must be '1' to be considered? Any help or direction to the proper documentation would be great.
I'm building a mining model wiht MS Association Rules. After processing this model, the result includes some rules(example):
E = Existing, C = Existing -> B = Existing F = Existing -> E = Existing C = Existing, B = Existing -> E = Existing F = Existing -> B = Existing B = Existing, A = Existing -> C = Existing F = Existing, B = Existing -> E = Existing F = Existing, E = Existing -> B = Existing D = Existing -> A = Existing C = Existing -> A = Existing E = Existing, A = Existing -> B = Existing
I want to buid a query that has two or more items on the left of the rules, example: E = Existing, C = Existing -> B = Existing ->I want to buid a query to predict that: when a customer buy 'E' and 'C' then he likely buys 'B'
I read the paper of sequence clustering. It seems that the major application of the algorithm is for the web site. I was just thinking that can I apply this algorithm on the purchase sequences of credit card data?
If so,please also tell me the difference between sequence clustering and association rules on credit card data application. Although I realize that sequence clustering is a fully probabilistic model and it has the capability of prediction, association rules also give the probabilities of purchasing the other products.
I would really appreciate if you could help me out. I am trying to create a taxonomy to be taken into account into the association rules algorithm. For example, if my data is a group of purchases from a supermarket I could have one client who bought milk, cookies and shampoo, and another who bough cheese, cookies and soap.
I would like to specify that milk and cheese belong to the category "dairy" and shampoo and soap belong to "personal hygene". If there are interesting rules regarding the categories I would like them to be taken into account. Additionally, I would not like to have rules like "milk -> dairy". If one specific object appers in a rule, its corresponding category should not. In this scenario I could have milk and "personal hygene" in the same rule, but not shampoo and "personal hygene".
I have seen this done by other mining tools but I've been having trouble finding a way to make this possible in Analysis Services.
I'm pretty new to SQL Server 2005 and I'm wondering how to organize my business rules efficiently.
I'm willing to create a Package with several Data Flow Task in my Control Flow, each Business Rule will be encapsulated in a Data Flow task. I guess this is possible in SSIS but I can't find how my data can be send from one data flow to another one. I definitly don't want my dataset to be parsed completly for each business rule.
Does anyone have an idea how to implement this in SSIS?
I don't know if this is the correct forum or if it would perhaps be the server setup & upgrade, however, I'll ask here first. I am going through the SQL 2005 data mining tutorials and have encountered an error that states:
Query (1,6) The '[System].[Microsoft].[AnalysisServices].[System].[DataMining].[AssociationRules].[GetStatistics]' function does not exist.
After looking around for a bit it appeared that this function is generally used in a stored procedure run from the server. Is this correct? What do I need to do to either correctly import the function into the project or to make sure that it is installed on the server?
We are running the Ent Ed. now, however, we just upgraded to this after I had already started on this project. Could that be a problem?
What would be the right design approach for the following problem?
I have a single table called SelectionFactors, which has the following columns and sample data:
ProjectID Factor FactorValue
1000 Countries USA
1000 Countries Canada
1000 Countries France
1000 Languages English
1000 Languages French
1000 Company Type Consulting
1000 Company Type Software
2000 Countries India
2000 Countries China
2000 Countries USA
2000 Languages English
2000 Languages Chinese (Simplified)
2000 Languages Chinese (Traditional)
2000 Languages Spanish
2000 Company Type Retail
2000 Company Type Dairy Products
The problem is to allow a descriptive analysis of the data to find patterns in the users selections. For instance, if Languages->English is selected, what are the counts of projects for other Factor->Factor Value combinations? Countries->USA = 2, Countries->Canada=1, Company Type->Consulting=1 and so on.
Since all the data is in this single table, are both the case and nested tables the same? What are the keys and inputs? I only need a descriptive analysis (no prediction) and ALL possible combinations MUST be part of the results; how should the model be designed?
Similar to the BizTalk Business Rules engine does the SSIS provide any support for independent business rule management? I would guess that through the use of Embedded CLR we can invoke the Business rules stored in the Business Rules Engine (either Biztalk Rule Engine or ILOG Rule Engine). Is there a construct in the SSIS similar to the BizTalk Call Rules Construct in the Biztalk Orchestration.
If this mechanism is not the preferred mechanism then are there any other mechanisms offered by SSIS which facilitate the use of Business Rule Engine Components within SSIS which make it easier to manage the business rules outside of the deployment.
i begin for Data mining ( analysis manager - sql server 2000) i create some mining model with Microsoft_decision_ trees or Clustering. it's oK but now i want to create a model to know "If customer 1 has product A in their basket, what products should I recommend ".
i read and see that "Microsoft analysis manager sp1" have support association rules. i install pack 4. but i don't see anything else . how i create a model with association rules.
MS uses the a priori algorithm in Association Rules, while other DM software have gone to the Novel Algorithm. Can you tell us why MS decided to stay with the a priori? Did you overcome the limitations that it's accused of having? Thanks!