BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
PamG
Quartz | Level 8

I have  a data where the slope changes at  a certain point.  How do you find that  point of inflection for the two linear lines?  I think PROC NLIN would work but I can't get it to run.

 

Sample code is given below as I cannot share my data.  The point of inflection is 50.

 

DATA data1;
do i=1 to 1000;
x=rand("Integer", 1, 50);
Y=3 + 6*x + rannor(345);
drop i;
OUTPUT;
end;
RUN;

DATA data2;
do i=1 to 1000;
x=rand("Integer", 50, 100);
Y=2 + 8*x + rannor(345);
drop i;
OUTPUT;
end;
RUN;

data full; set data1 data2;run;

proc nlin data=full;
   parms alpha1=.45 beta1=.05 alpha2=.45 beta2=.05;
x0=5*beta1/alpha1;
 model y = (x <x0)*(alpha1+beta1*x) +
             (x>=x0)*(alpha2+beta2*x);
FILE log;put  / x0=  ; 
output out=b predicted=yp;
run;
 
1 ACCEPTED SOLUTION

Accepted Solutions
sbxkoenk
SAS Super FREQ

Hello @PamG ,

 

If your only interest is to know about the inflection point ,

then you can use PROC UCM (SAS/ETS) for structural break detection in a univariate time series.

I use your variable X as a time indicator. This is possible because in your sample data the X-values are equally spaced (fixed width interval).
Not sure how close X is to a real time series in your actual data (?).

DATA data1;
do i=1 to 1000;
x=rand("Integer", 1, 50);
Y=3 + 6*x + rannor(345);
drop i;
OUTPUT;
end;
RUN;

DATA data2;
do i=1 to 1000;
x=rand("Integer", 50, 100);
Y=2 + 8*x + rannor(345);
drop i;
OUTPUT;
end;
RUN;

data full; set data1 data2; run;
proc sort data=full; by x; run;

proc sgplot data=full;
  scatter x=x y=y;
run;

/* Accumulating Transactional Data into Time Series Data   */
/* Goal is to get Only 1 observation per unique value of X */
proc timeseries data=full out=tsfull;
   id x         interval=day
                accumulate=median
                setmiss=0
                ;
   var y;
run;

proc sgplot data=tsfull;
  scatter x=x y=y;
run;

ods trace off;
/* Detection of Level Shift by unobserved components model (UCM).                               */
/* (The UCMs are also called structural models in the time series literature.)                  */
/* You can check for breaks in the level by using the CHECKBREAK option in the LEVEL statement. */ 
/* The following statements fit a simple locally constant level plus constant slope             */
/* plus error model to the series:                                                              */ 
ODS EXCLUDE EstimationSpan ForecastSpan InitialParameters FitSummary
            ParameterEstimates FitStatistics ComponentSignificance
            TrendInformation Forecasts ForecastsOnlyPlot ;
ODS OUTPUT OutlierSummary=work.OutlierSummary;
proc ucm data=tsfull;
   id x interval=day;
   model y;
   irregular;
/* The fitted trend is nearly linear,                                          */ 
/* So I fix fix the level and/or slope variance components to zero             */
/* (via the VARIANCE=0 and NOEST options on the LEVEL and/or SLOPE statements) */
   level variance=0 noest /* plot=smooth */ checkbreak;
   slope variance=0 noest;
  *estimate;
  *forecast plot=decomp;
run;

data _NULL_;
 set work.OutlierSummary;
 file print;
 put 'inflection point is at ' time= 12.;
run;
/* end of program */


BR,

Koen

View solution in original post

9 REPLIES 9
PaigeMiller
Diamond | Level 26

I think PROC NLIN would work but I can't get it to run.

 

What is wrong with your code?

--
Paige Miller
PamG
Quartz | Level 8

The Hessian matrix is singular so I do not get parameter estimates.  I am not sure the code I have written for linear model is correct.

PaigeMiller
Diamond | Level 26

@PamG wrote:

The Hessian matrix is singular so I do not get parameter estimates.  I am not sure the code I have written for linear model is correct.


Thank you. In the future, key information like this should be in your first message.

 

This discusses the error message that says the Hessian is singular: https://communities.sas.com/t5/Statistical-Procedures/Problem-in-Proc-NLIN-The-approximate-Hessian-i... There may be other similar discussions here in the SAS communities, and you should search for them.

--
Paige Miller
sbxkoenk
SAS Super FREQ

Hello @PamG ,

 

If your only interest is to know about the inflection point ,

then you can use PROC UCM (SAS/ETS) for structural break detection in a univariate time series.

I use your variable X as a time indicator. This is possible because in your sample data the X-values are equally spaced (fixed width interval).
Not sure how close X is to a real time series in your actual data (?).

DATA data1;
do i=1 to 1000;
x=rand("Integer", 1, 50);
Y=3 + 6*x + rannor(345);
drop i;
OUTPUT;
end;
RUN;

DATA data2;
do i=1 to 1000;
x=rand("Integer", 50, 100);
Y=2 + 8*x + rannor(345);
drop i;
OUTPUT;
end;
RUN;

data full; set data1 data2; run;
proc sort data=full; by x; run;

proc sgplot data=full;
  scatter x=x y=y;
run;

/* Accumulating Transactional Data into Time Series Data   */
/* Goal is to get Only 1 observation per unique value of X */
proc timeseries data=full out=tsfull;
   id x         interval=day
                accumulate=median
                setmiss=0
                ;
   var y;
run;

proc sgplot data=tsfull;
  scatter x=x y=y;
run;

ods trace off;
/* Detection of Level Shift by unobserved components model (UCM).                               */
/* (The UCMs are also called structural models in the time series literature.)                  */
/* You can check for breaks in the level by using the CHECKBREAK option in the LEVEL statement. */ 
/* The following statements fit a simple locally constant level plus constant slope             */
/* plus error model to the series:                                                              */ 
ODS EXCLUDE EstimationSpan ForecastSpan InitialParameters FitSummary
            ParameterEstimates FitStatistics ComponentSignificance
            TrendInformation Forecasts ForecastsOnlyPlot ;
ODS OUTPUT OutlierSummary=work.OutlierSummary;
proc ucm data=tsfull;
   id x interval=day;
   model y;
   irregular;
/* The fitted trend is nearly linear,                                          */ 
/* So I fix fix the level and/or slope variance components to zero             */
/* (via the VARIANCE=0 and NOEST options on the LEVEL and/or SLOPE statements) */
   level variance=0 noest /* plot=smooth */ checkbreak;
   slope variance=0 noest;
  *estimate;
  *forecast plot=decomp;
run;

data _NULL_;
 set work.OutlierSummary;
 file print;
 put 'inflection point is at ' time= 12.;
run;
/* end of program */


BR,

Koen

sbxkoenk
SAS Super FREQ

On top of PROC UCM post above ,

see also :

 

Segmented regression models in SAS
By Rick Wicklin on The DO Loop December 14, 2020
https://blogs.sas.com/content/iml/2020/12/14/segmented-regression-sas.html

 

BR,

Koen

PamG
Quartz | Level 8
Thanks Koen. I do not have a time series data and I do not have a jump in the line. I am so sorry but I do not know how to create sample datasets to show continuous lines but with different slopes. I have (y,x) points and I would like to know where the joint point is that has no breaks.
ballardw
Super User

@PamG wrote:

I have  a data where the slope changes at  a certain point.  How do you find that  point of inflection for the two linear lines?

 

 

<Pendantic mode=on>

"Linear Lines" have no inflection point where the slope changes. Curves may.

<Pendatic mode=off>

 

Are you assuming both of your curves have inflection points at the same value? That may be pretty broad assumption.

 

Are you using a different definition of inflection other than a point on the curve where the slope changes sign?

 

BTW, if your data is so sensitive that a group of x,y pairs with no other identification causes concerns then perhaps you shouldn't be asking any questions on an open forum.

FreelanceReinh
Jade | Level 19

Hello @PamG,

 

Another option: Try PROC FASTCLUS to get a first estimate of the point of inflection and then use PROC ROBUSTREG for fine-tuning (see the proportions of outliers in the "Diagnostic Summary" table of the output) and estimation of the regression coefficients.

proc fastclus data=full out=clust maxclusters=2 maxiter=100;
var x y;
run;

proc means data=clust;
class cluster;
var x;
run;

proc sort data=clust;
by cluster;
run;

proc robustreg data=clust method=lts seed=2718;
by cluster;
model y=x;
run;

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 9 replies
  • 1203 views
  • 4 likes
  • 6 in conversation