06-06-2017 05:35 AM
I want to draw a plot which can help me compare two variables , the issue is I have around 1 million rows due to which I am not able to use Line plot.
proc gplot data=Data_name;
plot (y1 y2)*xavr / overlay legend=legend1
06-06-2017 05:40 AM
Is plotting 1m records the best way to cmopare some data? I wouldn't think so. Think of a piece of paper, then put 1m dots on that piece of paper, would you fit them all on, no, not even close. I would think there are better methods, e.g. distinct lists of values, proc compare by variables, excluding the sam values etc.
06-06-2017 07:07 AM
If you are trying to identify where the variables are most different, a scatter plot of y1 vs y2 would do it, since they are evaluated at the same time points. Use semitransparent markers to reduce overplotting.
proc sgplot data=Data_name; scatter x=y1 y=y2 / transparency=0.8; run;
If you need to visualize the XVAR information, you can use the COLORRESPONSE=XVAR option.