Don't Miss the Gorilla in the Data!
Had to do a quick scatter plot of the data given to students to analyze from Itai Yanai and Martin Lercher's A Hypothesis is a Liability, which is currently getting lots of Twitter love.
*==> Fun w/SAS ODS Graphics: Don't Miss the Gorilla in the Data!
Reference: A Hypothesis is a Liability, by Itai Yanai & Martin Lercher
https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-02133-w
Dropbox data files courtesy of the researchers;
filename w url 'https://www.dropbox.com/s/685pkte3n3879mn/data9b_w.txt/?dl=1'; * Women's data;
proc import datafile=w dbms=dlm out=w replace; delimiter='09'x;
filename m url 'https://www.dropbox.com/s/r3wyn2ex20glsoa/data9b_m.txt/?dl=1'; * Men's data;
proc import datafile=m dbms=dlm out=m replace; delimiter='09'x;
proc sql; * Merge data together, assign category;
create table StepsBMI as
select 'WOMEN' as Category, w.* from w union all select 'MEN', m.* from m;
proc sgplot aspect=1 noborder nowall; * Quick scatter plot to see what we've got!;
styleattrs datacontrastcolors=(blue red);
scatter x=steps y=bmi / group=category markerattrs=(symbol=circlefilled size=5pt);
keylegend / location=inside position=topright down=2 noborder title="";
xaxis display=(noline noticks) grid gridattrs=(thickness=1pt color=darkgrey) valuesformat=comma9.;
yaxis display=(noline noticks) grid gridattrs=(thickness=1pt color=darkgrey);
label bmi="BODY MASS INDEX" steps="STEPS";