Is there any impact of increasing the datapaths in Linux environment from 8 to 16...?
Is there would be any benefit if we increase the Indexpath from 1 to 2...
Any suggestions...?
What kind of file system do you use?
If your data is on a SAN, no of paths usually does not affect performance that much. If you have local file systems assigning additional paths to exclusive physical paths could improve performance.
Index path behave different from data path, it's more of spill over paths. SPDS does not spread indexes between those paths. So I believe that having more index paths is not increasing performance.
Hi Linush,
Data is not on SAN, but yes the new file systems are on SAN, and by increasing the datapaths should increase the performance.
Parrallelism would be achieved in that case. Once all these file systems have been increased, Is there any other way for faster retrieval(access) of data from these datapaths.
Yes, i do agree with your index paths. There it will not improve any performance.
If you think that it should increase performance, why do you ask?
To take advantage of 16 partitions fully, each spds process needs to launch as many, or even more threads. How many CPU/cores do you have? And aren't there any concurrent jobs/users? They need to share those cores.
To point at some test case (it's not facts, just a test case), where it showed that after having more than 12 data partitions, it didn't gain any performance (and that with a 12 core server).http://support.sas.com/resources/papers/partitiontables.pdf
There are some papers on the subject of optimizing SPDS, not much about SAN specifically, but most concepts could apply anyway.
http://support.sas.com/resources/papers/tnote/tnote_spds.html
You'll probably want to work close with the server/SAN administrators for help on monitoring, the server, connection(s) and disks.
Thanks for your wonderful comments...
We have 16 Processors\CPU\cores as of now...But planning to move to 24 cores\processors\CPU in next month... Pls advice in that case.
Yes, we do have some concurrent users too that are hitting the server on a regular basis.
Again, I think that no of data paths is not so important when having data on a SAN. SAN administrators tend to scramble the physical disks into small chunks (LUNs), which they can assign to different usages. This means that your file system is just a virtual one, and that you are probably sharing physical disks with other servers/applications, or, with other file systems on your own server.
Things to focus on should be:
Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.