Does anyone have any references around optimizing the initial processing time for a data mining model? Books, blogs, etc. I have the Wiley “Data Mining with
SQL Server 2008” book, and while I’ve learned a lot from it, it doesn’t seem to cover much around trouble shooting things like processing time. I also have a few other books that have a chapter or two on data mining, but again just basic
“here’s what it is and how to set it up”, nothing that quite covers trouble shooting or optimization. I’ve also checked out the various Data Mining blogs/sites.
I’ve got a Microsoft Time Series model I’m basing on a Cube. Very simple, trying to forecast sales. I have one dimension which is the list of products
(about 1600), a second which is the time dimension, finally the measure is daily sales figures for each, about 3.2 million rows in total. On a brand new server, with two quad core processors and 16 gig of ram it took 40 hours to process. Seems rather high?