06-07-2013 10:34 AM
Hi everyone, I have a new computer on the way and I am hoping for some insight on installing SAS to it.
The main purpose of the computer is for ETL and Enterprise Miner / Text Miner work. Other software that will be installed includes:
As you can see from the list of software, the computer will have a variety of use cases, but the most important one right now is ETL and Data/Text Mining.
The Computer specs are:
Windows 7 Ultimate 64bit
Dual Processors: Two Intel Xeon E5-2680 2.70 20MB 1600 8C
NVIDIA Quadro K4000 3GB DL-DVI(I)+DP+DP 1st
128GB DDR3-1600 (8x8GB+8x8GB) 2CPU Registered RAM
512GB SATA 1st Solid State Drive
600GB 15k RPM SAS 2nd Hard Drive
1TB 10K RPM SATA 3rd Hard Drive
The computer has 3 disks. I’m planning on doing the following:
SSD - OS and software
600GB 15k – Scratch space (….is this where SASWORK goes?)
1TB 10K – General storage / backup
I have 7 questions:
06-12-2013 02:11 PM
Looked at the setup you are proposing. I am a little puzzled by your hardware decisions. I find this to be a pretty "big" machine to still use a client version of Windows or even Windows at all. I see you want to use Visual Studio so this is probably the reason why you opted for Windows. If you are flexible on the operating system choice, I would recommend installing Linux instead of Windows.
For your disk system, have you considered using a RAID 5 made of maybe 4 or 5 of thos 1 TB 10 KRPM disks? You'll get data protection and increase IOPS. You have to keep in mind that SAS usually requires about 50-75 MB/s of I/O per CPU core to ensure all CPU cores are "busy". As for your other disks, having 2 SSDs (one for OS/Apps and one for work) would be the best. If you don't want to purchase 2 SSDs, I would put the OS and Apps on a HD and SASWORK on the SSD. Yes, your SSD's lifespan will be much shorter but the performance gain you'll observe are definitely worth it. Also, as you seam to understand, SAS does mostly sequential I/O so this help "save" your SSD.
Read-Ahead and Write-Behind is useful. But multi-pathing only applies if you are using a SAN which does not seams to be your case.
For the procs, I'm not sure what the answer is. I rarely see people using the desktop version of SAS anymore so I can hardly tell, best thing would be to check with your sales rep.
I hope this helps a little.
06-12-2013 02:31 PM
Hi Yoan. Thanks for the feedback.
The Windows OS is both a result of IT (they don't support Linux) because a server based approach has little return on investment for us at this point in time.
I am considering RAID after doing more reading and felt that the OS and Apps would be on an SSD and the scratch disk (pureley temporary space) would be RAID 0 using at least two 1TB 10,000 RPM drives. (Although I could use an SSD instead. Will the SSD be much faster than the RAID0 of two 10K RPM drives?).
I was also thinking of doing another two 10K RPM drives on RAID1 for file storage, but perhaps this introduces a bottleneck when my project files are located there? Perhaps this is why you suggested RAID5 of a number of drives?
06-12-2013 02:42 PM
I understand the situation with your IT dept. Keep in mind, Linux is not only for server applications, it can be used as a desktop operating system too and a very great one actually.
OS and Apps don't require that much I/O so this is not where I would invest the most. A good 512 GB SSD will outperform any 2 drive RAID 0 setup in any application. You need at least 4 HD in RAID 0 to match a good 512 GB SSD. A good rule of thumb is: SSD for speed, HD for volume.
Going RAID 1 for storage is very good for data security. But you will get poor write performance and very low capacity compared to a comparable RAID 5 setup. The only issue with RAID 5 is that you will need AT LEAST 3 disks. Also, RAID 5 does not take random IO as well as RAID 1 or RAID 0 but with SAS you'll get mostly sequentiel IO so you don't have to worry about this.
Here is something to think about (please keep in mind that this is oversimplified): you have 16 cores, so you should aim for 800 MB/s of IO throughput. This is a lot but your SSD should give you close to 500 MB/s and it is safe to expect 170-200 MB/s from a 3 disk RAID 5 with a good controller. So if you plan your workload correctly, you should be able to keep this CPU busy!