About DarthPathos

DarthPathos · ‎11-15-2019

hi @soumri - Without actually testing this I think the easiest would be to run a couple of PROC SQL queries that would generate tables and then update your dataset accordingly. - Only IDs with at least 3 tests for a particular RANK are considered. PROC SQL; create table id_count as select ID, count(test_variable) as Cnt from table_name where rank = "1" group by ID order by ID; QUIT: - Only GROUPS that contain at least 5 different IDs in a given YEAR are considered; PROC SQL; select distinct group, ID, year into #Temp_Table from table_name; create table grp_cnt as select group, year, count(*) as Cnt from #Temp_Table group by group, year having count(*)>=5 order by group, year; QUIT; - For each class of GYS there are at least 4 observations. PROC SQL; select distinct gys, ID into #Temp_Table2 from table_name group by gys, ID order by gys, ID; create table gys_cnt as select gys, count(*) as Cnt from #Temp_Table2 group by gys having count(*)>=4 order by gys; QUIT; You should be able to then take the results from your tables and generate new variables etc. as needed (select a.* , b.* from main_table a left join gys_cnt b on a.id = b.id or something similar). Hope this helps! Chris

DarthPathos · ‎07-05-2019

After doing some more poking around (digging around numerous notebooks and folders) I can confirm that the only way to do this is as @Cynthia_sas suggested - modify the actual PowerPoint master template. That way you'll only need to do it once, and it'll always have the same look and feel as other presentations you work on. Here's the Microsoft page showing how to do this - I've done this before but not related to a SAS Output (rather, I needed to modify and add a logo to my slides). I hope this helps 🙂 Chris

DarthPathos · ‎07-04-2019

Hi Hanne I hope you're doing well. It's been a while since I've used ODS output for Powerpoint, so apologies I can't be much help. I've looked in my books I have on ODS and can't seem to find anything. Having said that, I found this link that may be of use. Specifically, the line right near the bottom: goptions hsize=3in vsize=3in dev=png; If i'm remembering my ODS correctly, the hsize and vsize can be modified to make the image as big or as small as needed on the slide. If my memory is correct, this should force the footers down - so it's not so much changing the footer directly as changing the content of the slide. If you are able to purchase SAS Books, I highly recommend the book Output Delivery System: The Basics and Beyond; although it doesn't specifically talk about Powerpoint, it does cover a lot of other good information you may find useful. If this isn't what you need (or you've already tried it) let me know and I can do some more digging. Chris

DarthPathos · ‎07-04-2019

hi Jillian Take a look at this paper - it may give you some ideas, and talks about a macro MMI_IMPUTE. Having said that, I've poked around here and found a thread asking a very similar question; the author of the paper, who as of 2017 worked for SAS, posted "...I no longer recommend using MMI_IMPUTE. MMI_IMPUTE uses an imputation algorithm called PAN, which was developed a while back by Joseph Schafer. Unfortunately, PAN is a bit outdated, and is less flexible than newer algorithms (e.g., the algorithm can't incorporate random effects between incomplete variables; all incomplete variables are required to be normally distributed). I recommend that you instead use a standalone software package called Blimp. Blimp was written by Brian Keller and Craig Enders at UCLA. Unlike MMI_IMPUTE, Blimp can handle random effects between incomplete variables and can also handle some non-normal incomplete variables (e.g., binary variables). The software and associated documentation are available at http://www.appliedmissingdata.com/multilevel-imputation.html. Blimp is free, and the website contains scripts for using it from SAS." I don't know Blimp at all, but as Missing Data / Imputation is something I'm interested in, I will definitely be taking a look. Please post back if you have any further questions; I'm also happy to contact you via email if that's easier. Chris

DarthPathos · ‎07-04-2019

I'm not familiar with Stata, but the easiest may be to call Stata from within SAS - paper here. If you can save the DO file as something else (a DTA or a SAV file, according to what I've read) then you can use PROC IMPORT to get the file into the SAS Environment. Hopefully you figure this out, and if you can post back your solution here, it'll help future STATA / SAS users 🙂 Chris

DarthPathos · ‎07-02-2019

Actually the Death Star Analytics Dept probably uses an Open Source stats software - that's how they missed the vulnerability in the thermal exhaust that Luke used to blow it up :')

DarthPathos · ‎06-28-2019

@BeverlyBrown haven't read the whole thing but the stories I have read are a lot of fun. I've always wanted to write a fanfic about a Database Admin that works on the Death Star - I wonder if they use SAS LOL.

DarthPathos · ‎06-28-2019

@BeverlyBrown I have a book at home called "From another Perspective" (C3P0 quote) and it's short stories from side-characters, including one about the creature living in the garbage compactor and how they came to live there. Always love stuff like that!

DarthPathos · ‎06-28-2019

@tomrvincent Surprisingly that's how it was in the scripts (ie. not modified by me). Also from a simple length perspective C3P0 is obviously shorter and easier to type, so no idea why that decision was made. (Appreciate the echuta by the way ;-)) Chris

DarthPathos · ‎06-28-2019

Thanks @MichelleHomes always appreciate your support 8-)

DarthPathos · ‎06-28-2019

Editor's note: SAS programming concepts in this and other Free Data Friday articles remain useful, but SAS OnDemand for Academics has replaced SAS University Edition as a free e-learning option. Hit the orange button below to start your journey with SAS OnDemand for Academics: Access Now As a diehard Star Wars Geek, I'm always looking for ways to out-geek my friends; whether it's by useless facts (did you know John Ratzenberger, who plays Cliff Clavin on Cheers, is in Empire Strikes Back? He's the guard that tells Han not to go out looking for Luke because it's too cold out!) to having numerous posters in my office (current total is 6), I am always looking for creative ways to show my love for the franchise. I saw a blog about text analysis of the original Star Wars scripts (using another stats package) and decided to extend my previous article to see what I could find; I presented it to the Toronto Area SAS Society, and have attached my presentation at the end of this article. Get the Data The data is found on a GitHub repository (here) and needs to be copied and pasted into either a TXT file or Excel. Get Started with SAS OnDemand for Academics In this 9-minute tutorial, SAS instructor @DomWeatherspoon shows you how to get your data into SAS OnDemand for Academics and other key steps: Get Started Get the data ready I must admit, I still use Excel to do a lot of my data cleaning; it's what I've been using for over 25 years (gah!) and I can think and do a lot faster than in SAS. Having said that, some really interesting issues came up with the three files. Episode 4: A New Hope - There are varying number of spaces between the character's name and the spoken line. Episode 5: Empire Strikes Back - A colon separates the character name from the text; descriptive text is in brackets. Episode 6: Return of the Jedi - Double quotes separate the columns; line numbers are included. After cleaning the data, as well as adding in 2 dummy variables (EPISODE and ID), the data finally looked like this: The results One tip that I learnt from my last post on Star Wars, and have used repeatedly since, is how to take the text and split it so each word is on a separate row. This is critical for doing frequency analysis and other types of reporting. data work.starwars2; set work.starwars; do i=1 by 0; new=scan(Text, i, ' '); if missing(new) then leave; output; i+1; end; keep new; run; This takes my Text column and looks for a space; every time it encounters one, it moves to the next row. Here's what I get when I run it on my data: The first thing I am curious about is line count - who has the most number of lines overall? proc sql; create table work.line_count as select character, count(*) as Count from work.starwars group by character order by character; quit; proc sort data=work.line_count; by descending Count; run; ods graphics/ reset width = 6.4in height=4.8in imagemap; proc sgplot data=work.line_count (where=(count>10)); scatter x=character y=Count / ; xaxis grid; yaxis grid; run; ods graphics / reset; Here's the output. What I find interesting is that there is a significant difference between Han (2nd) and Threepio (3rd). The other question I have is around the use of the word "Force"; for those unfamiliar with the franchise, the Force is the invisible energy that permeates and exists in, through and around everything. It's able to be controlled (moving objects etc.) and other abilities to those who are trained. proc sql; select character, count(case when (episode='IV') then 1 end) as A_New_Hope, count(case when (episode='V') then 1 end) as Empire, count(case when (episode='VI') then 1 end) as Return_of_Jedi from work.starwars where text like '%Force%' group by character order by character; quit; So even though The Force is a key component to the franchise, in fact it's not directly mentioned very often at all. Now it's your turn! Did you find something else interesting in this data? Share in the comments. I'm glad to answer any questions!

DarthPathos · ‎04-23-2019

Hi @ChrisBrooks - This is amazing and I have been wanting to get back into the Free Data Friday blogs, so thanks for picking up this work! I'm thrilled you enjoyed the Blog, and I'll reach out to @BevBrown - if you're interested maybe we could do a Tag Team sort of set up? I'll email her now, and look forward to chatting with you soon! Chris

DarthPathos · ‎11-08-2018

HI Vivapoe! Apologies for the delay, have you seen the link https://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/viewer.htm#statug_phreg_sect039.htm - it might get you started trying to figure estimated probabilities. Post back if you have any other questions (and I'll try to be quicker in replying! :-)) have a great day! Chris

DarthPathos · ‎11-10-2017

I'm currently at home with food poisoning (I think that's what it is at least) so won't have a chance to play with this till Monday, but holy cow this is awesome. Thanks for the feedback re: Constraints etc. I am really excited to use this to help my organization move forward, and excited to see what I can learn. All the best and have a great weekend Chris

DarthPathos · ‎11-08-2017

Hi Rob Apologies for the delay, been a rough week. The start times will be determined by the shift (12 hour is 630-1830, 8 hour is 8-4, 10 hour is 1000-1900). We are looking at adding shifts based on skill set but that'll be down the road. Thanks and have a good day Chris

Online Status	Offline
Date Last Visited	‎11-02-2020 01:41 PM