Hi SAS Friends, Need to divide up 100's of structured SOP documents (.TXT) into sections that can be edited, updated, replaced, modified, in a controlled manner. Sample .TXT file is attached. Each SOP document can be brought into SAS and a single .SAS file created. Each SOP has standardized headings, such as NUMBER, TITLE:, POLICY, PURPOSE, APPLICABILITY, RESPONSIBILITY, MATERIALS, PROCEDURE, REFERENCES:, CROSS-REFERENCES to other policies:, KEYWORDS: , ORIGINAL DATE ISSUED:, DATE(S) REVIEWED:, DATE(S) REVISED:, REVIEWED BY: Have looked thru many papers using SAS PRX functions , and discussions of using INDEXW(), and INDEX() functions, etc. but cannot find one that addresses this type of application. I can see how INDEXW() could be used to define the start and end of each section, and so create a single variable such as "Title", that would contain all text starting after TITLE, and finishing before "POLICY", can be created. However, how to manage variability in variable lengths (sentences and paragraphs) is not clear, and how to create these separate variables is not clear. Can you point me to a paper or book that covers this for BASE SAS ? or should a different software tool be used? Any suggestions greatly appreciated. Thank you
... View more