About xtc283x

xtc283x · ‎03-24-2024

That input code was copied and pasted from the full programming code and is an approach developed over years of coding. It works for me. Thanks for your comments.

xtc283x · ‎03-24-2024

Ballardw- Thank you for your response. The strip() function was the answer. Best regards,xtc283x

xtc283x · ‎03-24-2024

Using 'proc sort nodupkey' on a single text field containing names is not removing duplicates. This is happening even after compressing the field to remove blanks, punctuation, diacritical marks, etc. In other words, printing and visually examining the text field does not reveal any obvious differences in the duplicates such as minor spelling differences, case sensitivity, etc.. Confirmation of this was made by passing the resulting text field into Excel and then reading that Excel file back into SAS. This extra step produces a text field from which all duplicate names can be stripped using 'proc sort nodupkey'. 2952 data test;infile 'c:\data\analyses\data\directors.txt' lrecl=1500 firstobs=2 dlm='09'x dsd 2952! missover; 2953 length DIRECTOR $50.; 2954 input director; 2955 run; NOTE: The infile 'c:\data\analyses\data\directors.txt' is: Filename=c:\data\analyses\data\directors.txt, RECFM=V,LRECL=1500,File Size (bytes)=751665, Last Modified=24Mar2024:12:53:50, Create Time=24Mar2024:12:53:50 NOTE: 46774 records were read from the infile 'c:\data\analyses\data\directors.txt'. The minimum record length was 1. The maximum record length was 40. NOTE: The data set WORK.TEST has 46774 observations and 1 variables. NOTE: DATA statement used (Total process time): real time 0.04 seconds cpu time 0.00 seconds 2956 proc sort nodupkey; 2957 by director; 2958 run; NOTE: There were 46774 observations read from the data set WORK.TEST. NOTE: 1541 observations with duplicate key values were deleted. NOTE: The data set WORK.TEST has 45233 observations and 1 variables. NOTE: PROCEDURE SORT used (Total process time): real time 0.01 seconds cpu time 0.00 seconds Given that, the problem must be in how the underlying information was stored, e.g., hex vs ASCII vs EBCDIC, issues which are not a spike for me. Obviously, I don't want to have to pass files back and forth between SAS and Excel. My question is, How do I fix this text field in SAS?

xtc283x · ‎11-24-2020

The absence of 'real world issues' in a SAS world may not be due to the demand for millions and/or billions of features, parameters or variables since Google engineers are running algorithms of that magnitude on a routine basis, and more due SAS' inability to deliver and perform at that level. Just saying...

xtc283x · ‎11-23-2020

Good to know that the PDV permits more than 32k variables but that still doesn't pin the precise upper bound. With all due respect, things have changed a lot in recent years wrt any 'norm' about the max number of features, e.g., algorithms with millions, even billions, of parameters are pretty routine in the ML community.

xtc283x · ‎11-23-2020

Historically, the SAS PDV only permitted about 32k variables. Has this changed in recent years? If so, what is the current maximum number of variables allowed by the PDV?

xtc283x · ‎09-17-2019

Thanks for your response. However, I don't see five variates, I see one which comprises 100% of the variance. If the example used the OUTSTAT= option, you would see what this means.

xtc283x · ‎09-17-2019

Not having used the MANOVA/canonical option available from Proc GLM in some time, I'm surprised that my version of SAS only returns a single canonical variate regardless of the number of dependent variables are in the model statement. Nothing in the documentation suggests that there is an option for requesting or specifying the number of canonical variates. Is anyone aware of a workaround to this? In other words, how do you get Proc GLM/MANOVA to return more than one canonical variate? Thank you.

xtc283x · ‎10-09-2018

Of course you are correct. My question concerns the availability of multivariate procedures that substitute the GMD for the variance, e.g., as in ANOVA, renamed as ANOGMD. Thank you.

xtc283x · ‎10-05-2018

Shlomo Yitzhaki's book The Gini Methodology: A Primer on a Statistical Methodology offers a compendium of suggestions regarding the use of Gini's Mean Difference (GMD) as a measure of variability whenever an analyst is not ready to impose, without questioning, the convenient, well-developed world of linear, symmetric and normally distributed information. As he notes, one of the advantages of the GMD is that classic variance estimation is a subset class, in other words, when information is linear, symmetric and normally distributed the GMD offers no new information. Yitzhaki proposes substituting GMD estimation into ANOVA, regression, and so on, basically any multivariate technique which has classic covariance structures at its core. My question is this: does SAS offer any statistical procedures leveraging the GMD...that is beyond use of, e.g., the Gini coefficient and concentration curves regarding income inequality?

xtc283x · ‎08-07-2018

From other responses to HDF5 queries I can see that both SAS/JMP and the SAS/R/IML interface will allow SAS access to HDF5 data. In addition respondents have noted that HDF has an export dump into ASCII or .csv files. My question concerns the HDF ODBC Connector which allows SAS to directly access HDF5 files, or so the HDF Group claims. https://www.hdfgroup.org/downloads/hdf5-odbc-connector/ The HDF product costs $1,000. So before I shell out the dough, does anyone have any direct experience in using this product to connect SAS with data stored in HDF5 format?

xtc283x · ‎07-12-2018

Quite large...millions of records. Thank you!

xtc283x · ‎07-11-2018

Proc NPAR1WAY in v9.4 contains an option for the two sample Hodges-Lehmann location estimate but, based on my survey of the SAS Support literature, there does not appear to be a procedure supporting a univariate HL estimate nor does there appear to be a computationally feasible approach to producing on with large n. Is this correct?

xtc283x · ‎03-09-2016

Right you are! Thank you.

xtc283x · ‎03-09-2016

As part of the output from a Proc Mixed model, I am requesting that the ODS table TESTS3 be ouput. This table contains summary statistics wrt the performance of the independent variables in the model. However, when I run this module on data, I get an error that TESTS3 could not be created without an explanation, here's what SAS says, "WARNING: Output 'tests3' was not created. Make sure that the output object name, label, or path is spelled correctly. Also, verify that the appropriate procedure options are used to produce the requested output object. For example, verify that the NOPRINT option is not used." Otherwise, the model converges and does everything else it's supposed to do. Here is my code: proc mixed data=resids covtest noclprint namelen=50 METHOD=MIVQUE0; class day month year networktype orig_acq genre net; model LOGLIVE7RATING2=trend programtenure totallynewprogram censored dayssincelastbroadcastrecp nads_sqrt trend*nads_sqrt day month*year networktype ORIG_ACQ genre /solution s ddfm=bw notest outp=residsx(keep=net pgm trend pred live7rating loglive7rating2 where=(loglive7rating2=.)); random intercept trend/sub=net type=un g; ods output tests3=tests;*summary stats; run; I have highlighted the offending line of code. Any ideas as to what's going on? Thank you!

Online Status	Offline
Date Last Visited	‎03-24-2024 11:28 PM

Re: Removing duplicate names from a text field

Re: Removing duplicate names from a text field

Removing duplicate names from a text field

Re: Maximum number of variables allowed using SAS

Re: Maximum number of variables allowed using SAS

Maximum number of variables allowed using SAS

Re: Specifying the number of canonical variates in a MANOVA

Specifying the number of canonical variates in a MANOVA

Re: Procedures Leveraging Gini Mean Difference Methods in SAS

Procedures Leveraging Gini Mean Difference Methods in SAS

Re: Removing duplicate names from a text field

Re: Removing duplicate names from a text field

Re: Removing duplicate names from a text field

Re: Removing duplicate names from a text field

Re: Removing duplicate names from a text field

Re: Removing duplicate names from a text field

Re: Removing duplicate names from a text field

Removing duplicate names from a text field

Re: Maximum number of variables allowed using SAS

Re: Maximum number of variables allowed using SAS

Maximum number of variables allowed using SAS

Re: Specifying the number of canonical variates in a MANOVA

Specifying the number of canonical variates in a MANOVA

Re: Procedures Leveraging Gini Mean Difference Methods in SAS

Procedures Leveraging Gini Mean Difference Methods in SAS

Does anyone have experience using HDF5 ODBC Connector with SAS?

Re: Univariate Hodges-Lehmann Estimator?

Univariate Hodges-Lehmann Estimator?

Re: Proc Mixed ODS TESTS3 Output

Proc Mixed ODS TESTS3 Output