<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795723#M39066</link>
    <description>Hello Paige, it is 182,000 observations with 5359 variables. Even the proc print step, it just keep running without stop.</description>
    <pubDate>Fri, 11 Feb 2022 17:53:01 GMT</pubDate>
    <dc:creator>kellychan84</dc:creator>
    <dc:date>2022-02-11T17:53:01Z</dc:date>
    <item>
      <title>keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795702#M39060</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am using the following codes to do PCA analysis with large dataset.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=""&gt;data beta_diversity;
  length treatment $20;
  Infile "/home/u39233094/sasuser.v94/Thesis/CSV file/Cecal beta-diversity for SAS PCA delete.csv" dlm="," firstobs=2;
  input treatment$ ASV1-ASV5359;
run;
Proc print data=beta_diversity;
run;
ods graphics on;
proc princomp data=beta_diversity         /* use N= option to specify number of PCs */
              STD               /* optional: stdize PC scores to unit variance */
              out=PCAout         /* only needed to demonstate corr(PC, orig vars) */
              plots=(scree profile pattern score);
var _numeric_;  /* or use _NUMERIC_ */
ods output Eigenvectors=EV;  /* to create loadings plot, output this table */
run;
proc sgplot data=PCAout aspect=1;
   scatter x=prin1 y=prin2 / group=treatment;
   xaxis grid label="PC1 (%)";
   yaxis grid label="PC2 (%)";
run;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;My data matrix is like this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="kellychan84_0-1644598005835.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/68396i5BA1445FCF5B7FDF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="kellychan84_0-1644598005835.png" alt="kellychan84_0-1644598005835.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The SAS studio kept running and did not show results. I don't know what is wrong with the procedures. Could anyone give me some hints on this. Thank you very much in advance!!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 16:49:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795702#M39060</guid>
      <dc:creator>kellychan84</dc:creator>
      <dc:date>2022-02-11T16:49:56Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795703#M39061</link>
      <description>&lt;P&gt;First identify which step is resulting in SAS Studio not responding&amp;nbsp;&lt;BR /&gt;Which Data Step / PROC is running and not responding&lt;BR /&gt;Have you tried running it step by step (run the data step, then the PROC PRINT)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 17:16:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795703#M39061</guid>
      <dc:creator>AMSAS</dc:creator>
      <dc:date>2022-02-11T17:16:55Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795705#M39062</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I don't think a PROC PRINT is a good idea on a dataset with thousands of columns and possibly millions of rows.&lt;/P&gt;
&lt;P&gt;Writing this output to the Results window in HTML format takes ages.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Browse your data in a data pane (from the library pane) right after the data-step.&lt;/P&gt;
&lt;P&gt;Or use the obs= data set option in PROC PRINT.&lt;BR /&gt;Like :&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;Proc print data=beta_diversity(obs=10); run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;Cheers,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 17:25:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795705#M39062</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2022-02-11T17:25:50Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795706#M39063</link>
      <description>&lt;P&gt;How large is "large"? As in how many observations are in your data?&lt;/P&gt;
&lt;P&gt;How long did you allow the program to run?&lt;/P&gt;
&lt;P&gt;If your data is large you might well want to remove that Proc print. Just creating the output table for a large data set can take a lot of time and resources. Plus if it is that "large" what do you get out of looking at printed output?&lt;/P&gt;
&lt;P&gt;Which piece "kept running"? Run one procedure or data step at a time.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You may need to go to this link &lt;A href="https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.4/statug/statug_princomp_details05.htm" target="_blank"&gt;https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.4/statug/statug_princomp_details05.htm&lt;/A&gt; and check on the computation resources.&lt;/P&gt;
&lt;P&gt;The above will you show you how much memory may be needed. If the princomp procedure needs more memory than you have in your system then it may be spending a lot of time writing data used for computations to disk and then rereading it.&lt;/P&gt;
&lt;P&gt;Also there is a formula showing the relationship of variables and records to time. Not that time goes up to calculate the correlation matrix, very roughly, as the square of the number of variables and for eigenvalues the cube of the number of variables. So with 359 variables I am not surprised that there might be some time to run.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 17:28:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795706#M39063</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2022-02-11T17:28:48Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795713#M39064</link>
      <description>&lt;P&gt;How many observations? How many variables?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you just want to plot PRIN1 and PRIN2, then you can run PROC PRINCOMP with the option N=2, which may take a lot less time depending on how the algorithm is coded in SAS.&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 17:46:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795713#M39064</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2022-02-11T17:46:56Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795720#M39065</link>
      <description>Thank you for your reply. I will check the computaion resources later. I calculate there are 182,000 observations of my data. Beause SAS studio is easy to quit and each time I let it run until the paltform quits. Even the print step takes like forever. Will it be better to use SAS PC version to run this kind of large data set?</description>
      <pubDate>Fri, 11 Feb 2022 17:49:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795720#M39065</guid>
      <dc:creator>kellychan84</dc:creator>
      <dc:date>2022-02-11T17:49:58Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795723#M39066</link>
      <description>Hello Paige, it is 182,000 observations with 5359 variables. Even the proc print step, it just keep running without stop.</description>
      <pubDate>Fri, 11 Feb 2022 17:53:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795723#M39066</guid>
      <dc:creator>kellychan84</dc:creator>
      <dc:date>2022-02-11T17:53:01Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795725#M39067</link>
      <description>&lt;P&gt;As others have said, it is pointless (and very time consuming) to do a PROC PRINT on this data. Just remove PROC PRINT from your code and run it again.&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 17:58:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795725#M39067</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2022-02-11T17:58:49Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795727#M39068</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/339127"&gt;@kellychan84&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;Hello Paige, it is 182,000 observations with 5359 variables. Even the proc print step, it just keep running without stop.&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Do NOT run PROC PRINT on such a large dataset. Are you actually going to read all 57 gazillion pages that would produce?&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 18:00:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795727#M39068</guid>
      <dc:creator>Tom</dc:creator>
      <dc:date>2022-02-11T18:00:37Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795730#M39069</link>
      <description>Hello Koen, Evan in 10 observations, it did not work. I try 10 mins. I don't know why? If I input only 10 variables of the data set, it comes out quickly.</description>
      <pubDate>Fri, 11 Feb 2022 18:01:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795730#M39069</guid>
      <dc:creator>kellychan84</dc:creator>
      <dc:date>2022-02-11T18:01:55Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795731#M39070</link>
      <description>I only run the first input step. it shows 5000 column, but may data set has 5359 column. That means something wrong here and can not proceed?</description>
      <pubDate>Fri, 11 Feb 2022 18:03:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795731#M39070</guid>
      <dc:creator>kellychan84</dc:creator>
      <dc:date>2022-02-11T18:03:45Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795733#M39071</link>
      <description>At first, I did not run print, I add it later then to check are there something at the first input step. now I remove it. Thank you!</description>
      <pubDate>Fri, 11 Feb 2022 18:05:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795733#M39071</guid>
      <dc:creator>kellychan84</dc:creator>
      <dc:date>2022-02-11T18:05:14Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795737#M39072</link>
      <description>&lt;P&gt;Are you sure you told SAS to read the complete lines of data?&lt;/P&gt;
&lt;P&gt;Check the notes in SAS log, it will show the minimum and maximum line lengths read from the file.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Note that the default line length INFILE will use is currently 32,767 bytes.&amp;nbsp; If your lines are longer than that then you need to add the LRECL= option to your data step.&amp;nbsp; There is not any real downsize to setting it longer (other than your data step might require a little more memory while it is running).&lt;/P&gt;
&lt;P&gt;While you are at it add other common sense options like DSD (so that missing values and values with commas are treated properly) and TRUNCOVER (so INPUT does not move to a new line if there are too few values on the line.)&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data beta_diversity;
  length treatment $20;
  infile "/home/u39233094/sasuser.v94/Thesis/CSV file/Cecal beta-diversity for SAS PCA delete.csv" 
    dlm="," dsd truncover lrel=1000000 firstobs=2
  ;
  input treatment ASV1-ASV5359;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 18:16:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795737#M39072</guid>
      <dc:creator>Tom</dc:creator>
      <dc:date>2022-02-11T18:16:03Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795739#M39073</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am sure SAS allows more than 1 million variables (columns).&lt;/P&gt;
&lt;P&gt;There's no reason your dataset is limited to 5000 instead of &lt;SPAN&gt;5359.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Maybe you need to specify :&lt;/SPAN&gt;&lt;/P&gt;
&lt;DIV class="xisDoc-syntax"&gt;
&lt;DIV class="xisDoc-syntaxDescription"&gt;
&lt;DIV class="xisDoc-otherArgGroup"&gt;
&lt;DIV id="p1iciced84ejncn1lydukm56d8zg" class="xisDoc-argDescriptionPair"&gt;
&lt;H4 class="xisDoc-argument"&gt;LRECL=&lt;EM class="xisDoc-userSuppliedValue"&gt;logical-record-length&lt;/EM&gt;&lt;/H4&gt;
&lt;P&gt;on infile statement.&amp;nbsp;&lt;/P&gt;
&lt;DIV class="xisDoc-syntax"&gt;
&lt;DIV class="xisDoc-syntaxDescription"&gt;
&lt;DIV class="xisDoc-otherArgGroup"&gt;
&lt;DIV id="p1iciced84ejncn1lydukm56d8zg" class="xisDoc-argDescriptionPair"&gt;
&lt;DIV class="xisDoc-argumentDescription"&gt;
&lt;SECTION class="xisDoc-tableWrap"&gt;
&lt;TABLE class="xisDoc-summary"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD class="xisDoc-summaryText"&gt;LRECL= specifies the physical line length of the file. LINESIZE= tells the INPUT statement how much of the line to read.&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;/SECTION&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Cheers,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;</description>
      <pubDate>Fri, 11 Feb 2022 18:19:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795739#M39073</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2022-02-11T18:19:46Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795742#M39074</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/339127"&gt;@kellychan84&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;I only run the first input step. it shows 5000 column, but may data set has 5359 column. That means something wrong here and can not proceed?&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;That does not mean anything is wrong.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 18:23:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795742#M39074</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2022-02-11T18:23:47Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795747#M39075</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/339127"&gt;@kellychan84&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;Hello Koen, Evan in 10 observations, it did not work. I try 10 mins. I don't know why? If I input only 10 variables of the data set, it comes out quickly.&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;As I said earlier, try running the PROC PRINCOMP with the option N=2 (or N=5 or whatever), you don't need all 5359 dimensions.&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 18:43:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795747#M39075</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2022-02-11T18:43:38Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795755#M39076</link>
      <description>I change the code according to your suggestions. it is still running.</description>
      <pubDate>Fri, 11 Feb 2022 19:02:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795755#M39076</guid>
      <dc:creator>kellychan84</dc:creator>
      <dc:date>2022-02-11T19:02:35Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795757#M39077</link>
      <description>&lt;P&gt;That should be quicker, but I still don't know how long it will take to compute N=2 dimensions for 180,000 records and 5359 variables.&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 19:06:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795757#M39077</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2022-02-11T19:06:24Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795763#M39078</link>
      <description>Hello Tom, it seems it is not working, already run for 20 mins.</description>
      <pubDate>Fri, 11 Feb 2022 19:21:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795763#M39078</guid>
      <dc:creator>kellychan84</dc:creator>
      <dc:date>2022-02-11T19:21:18Z</dc:date>
    </item>
    <item>
      <title>Re: keeping runing process when doing PCA analysis with large dataset of microbial ASV data</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795764#M39079</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/339127"&gt;@kellychan84&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;&lt;STRONG&gt;. Even the print step takes like forever. Will it be better to use SAS PC version to run this kind of large data set?&lt;/STRONG&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Why are you bothering to print it at all? 180,000 lines with everything fitting on one line would be something like 2,250 pages. When add in 400 variables that means each line is likely only showing 12 or so values, so you with 5000+ (on rereading your code) variables each observation of the data set would take about 447 lines to display, times the 2250 single lines that would be on the order of a document with maybe a million or more pages. You do not read that sort of stuff. Get rid of the proc print. If the only purpose of the proc print was to verify that the data was read use the OBS= data set option to display maybe 10 observations&lt;/P&gt;
&lt;PRE&gt;Proc print data=beta_diversity (obs=10) ;
run;&lt;/PRE&gt;
&lt;P&gt;But I think the 5000 variables in Princomp still takes a lot of time.&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 19:22:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/keeping-runing-process-when-doing-PCA-analysis-with-large/m-p/795764#M39079</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2022-02-11T19:22:47Z</dc:date>
    </item>
  </channel>
</rss>

