Very interesting I wander if you could provide a download of code and data (30 million rows) used to produce the benchmarks on page 16. Also I rough Idea of hardware, software costs for rhe haddop site, number of simultaneous users and systemwide CPU utilization durring the benchmarks. I don't have haddop but I do have an off lease dell T7400($600 circa 2008), dual XEONS, two RAID 0 SSD arrays and 64 gb of ram. I would like to set up SPDE and compare my timings with your benchmarks. Also it would be nice in the future if you could provide inexpensive power worksattion comparisons, when the data is less than 1TB. My experience is that a old cheap power SAS workstation are up to an oder of magniture faster then servers at 90 cpu utilzation. Servers are often tuned to run at 90% or more(average workday load), otherwise the company is wasting money.
... View more