I have to agree, the results were too far off to be accurate. How is that for TIM cure time? I also like the way the amount of burn-in time changed. I guess they had some kind of "feeling" for when it was right.
What we need is to have tests done on one test bed with C2D and Quad core CPU. Then single die CPU testing would give us a better all around picture.