Large program....
Aug 10, 2013 19:18:57 GMT -5
Post by ezmoney on Aug 10, 2013 19:18:57 GMT -5
I am trying to do some Market analsys.
There are nearly 8000 stocks and each 100 days of 6 values.
I want to do a correlation coefficient of all stocks.
I guess I could break it down and divide the stocks by symbol.
This would generate 8000 files for the 100 days.
Thus I would pull in the stock1 data and stock2 data do the correlation then read stock3 data and go again to the limit of 8000.
Once there the stock1 would be replaced with stock 2 and start
from 1 to 8000 again not doing anything that had already been done like stock1 and stock2 and stock2 and stock1.
That would generate the same and just a waist if time.
Nor corelating anything with itself.
I was wondering if Sqlight would help hold the data in one file.
Make easy retrival and maybe update with minimum programming.
The thing about using that is when it closes the data is gone.
Thus I could generate the data file and then use it for different
analsys.
I could do it as a random file also.
Any suggestions?
I don't care if it runs all week.
I only have to do half and never correlate anything with itself.
The amount of data is very high and the out put is one line for
each correlation.
I'll have to dump the output to file and then read it.
Basically as I see it is to load the data and start the algorithm
to do each stock with each other stock.
Once some value is placed in the array then it would never
do those two again.
There are nearly 8000 stocks and each 100 days of 6 values.
I want to do a correlation coefficient of all stocks.
I guess I could break it down and divide the stocks by symbol.
This would generate 8000 files for the 100 days.
Thus I would pull in the stock1 data and stock2 data do the correlation then read stock3 data and go again to the limit of 8000.
Once there the stock1 would be replaced with stock 2 and start
from 1 to 8000 again not doing anything that had already been done like stock1 and stock2 and stock2 and stock1.
That would generate the same and just a waist if time.
Nor corelating anything with itself.
I was wondering if Sqlight would help hold the data in one file.
Make easy retrival and maybe update with minimum programming.
The thing about using that is when it closes the data is gone.
Thus I could generate the data file and then use it for different
analsys.
I could do it as a random file also.
Any suggestions?
I don't care if it runs all week.
I only have to do half and never correlate anything with itself.
The amount of data is very high and the out put is one line for
each correlation.
I'll have to dump the output to file and then read it.
Basically as I see it is to load the data and start the algorithm
to do each stock with each other stock.
Once some value is placed in the array then it would never
do those two again.