About AnnaBrown

AnnaBrown · ‎06-25-2014

“There’s no ‘I’ in team,” my mother, coaches and teachers said to me growing up – as I’m sure yours did too. And they were right! A successful team plays well together, passes often and understands the power of working together. In my old soccer days, I slit my eyes at “show boaters” on my team – those that took the ball solo through a few defenders and then were stopped just before scoring. Why didn’t she just pass it, let her teammates help? This matrix, based on 2010 World Cup data, shows a strong relationship between passing and winning. Quick tip: darker color = higher correlation. The World Cup data came from The Guardian and the GDP data came from the World Bank’s public site. AllAnalytics Executive Editor Michael Steinhart points out teamwork in his commentary on the matrix: I like the correlation between wins and "shots excluding blocked," which shows that the more shots you take, the higher your chances of success, and the correlation between wins, points scored, and total passes, which is a strong testament to the power of teamwork. Can you attest to the value of working as a team in your professional or personal life? What other connections are worth noting from this graph? Keep following this community for other SAS Visual Analytics graphs on the World Cup.

AnnaBrown · ‎06-19-2014

Rough play is part of football. Some level of aggression is needed to perform well – especially at the all-time competitive level of the World Cup. But like with everything, there’s a limit. When aggressive play reaches its tipping point and red cards run wild, wins aren’t necessarily the result. And yet top-ranked Brazil has received the most red cards in World Cup history, according to Mail Online, while Australia is the ‘dirtiest,’ receiving four red cards over 10 matches. You’ll see the weak relationship between red cards and wins in the SAS Visual Analytics correlation matrix below, based on 2010 World Cup data. (This is the same matrix referenced in Who says you have to be rich to win a football game?) Quick tip: darker color = higher correlation. The World Cup data came from The Guardian and the GDP data came from the World Bank’s public site. Are there any other relationships of interest to you? Dive into other commentary on SAS Visual Analytics World Cup-related graphs for ideas.

AnnaBrown · ‎06-19-2014

Here is the link to the above-mentioned thread opened by Sangramjit. SPDEngine:Storing Data in the Hadoop Distributed File System

AnnaBrown · ‎06-11-2014

Are you going to the Data Governance and Information Quality Conference, June 23-26 in San Diego, CA? Don’t miss a presentation by Capital One’s Maureen Spence, the 2013 Data Steward of the Year! Also, check out a few SAS speakers while you’re there: Lisa Dodson, SAS Data Management Practice Leader, on The Celebrity of Data – Changing the Way Data is Viewed, Treated and Managed Mary Anne Hopper, SAS Management Consultant, on Characteristics of a Maturing Data Governance Program Anne Buff, SAS Best Practice Thought Leader, on Back of the Napkin Data Monetization Kimberly Nevela, SAS Best Practice Director of Business Strategy, on What the DG *?@! Anna

AnnaBrown · ‎06-05-2014

Hi everyone, We experienced technical difficulties this morning with Google Hangouts for the 15-minute Jam Session. We will reschedule the event very soon. We'll keep you posted. Anna

AnnaBrown · ‎05-30-2014

Hi Michelle, Yes! The session will be archived on Google + and I'll start a folder for "15-Minute Jams" on the community to serve as a repository for all the events. I'll also experiment with different time slots in the future so that community members in different parts of the world have the chance to participate. Anna

AnnaBrown · ‎05-29-2014

Hello VA Community! We’re hosting our first-ever “15-Minute Jam on Predicting Outcomes Using Decision Trees in SAS Visual Analytics” via Google+ Hangouts On Air. One of the VA product managers, Ted Werner, will take us through a fun example using NFL data showing each step of the process – from data integration to visualization using decision trees. Bring your questions and get ready to chat about this topic on June 5 at 10:00 a.m. Eastern Time. “See” you there! Bookmark this event. -Anna

AnnaBrown · ‎05-29-2014

Hi SnehasisTTPL, If you haven't done so already, I suggest you open a track with Technical Support to find a solution for an error like this. For information on other ways to contact Tech Support, refer to: http://support.sas.com/techsup/contact/index.htm Best, Anna

AnnaBrown · ‎05-27-2014

Excellent, sdoorneveld, glad you found the solution! Thank you for adding it to this thread for other members' reference. Best, Anna

AnnaBrown · ‎05-23-2014

Hi Rajesh, Welcome to the VA community! There are a bunch of resources to get started with SAS Visual Analytics including "How Do I?" videos, basic tasks in the documentation and cheat sheets. Community members may have other specific recommendations but resources like these will get you going. Anna

AnnaBrown · ‎05-22-2014

Hello VA community! I came across a couple of blog posts on administering VA by SAS’ Wendy McHenry. Thought you may be interested: SAS Visual Analytics: managing user permissions SAS Visual Analytics: loading data into memory Anna

AnnaBrown · ‎05-19-2014

Thanks again for the perspectives, Jaap and Tom. Internally, some performance issues around profiling of large data sets were discussed. For example, details about general improvements within the Data Management Platform that can help address things from a memory management standpoint and information about a setting that can be used to configure for large data sets (those with many columns): Starting with DMP release 2.4, performance of frequency distribution calculations has been significantly improved over previous releases when dealing with large data sets (defined here to mean when the size of input data exceeds the size of memory allocated for frequency distribution by a number of times, resulting in multiple memory dump files). In some test cases, performance has been seen to improve by over an order of magnitude. In DMP release 2.3 and earlier, prof/per_table_bytes option was introduced into app.cfg to configure the amount of memory to be used per column profiled. It typically needs to be set only when profiling hundreds of columns. Starting with DMP release 2.4, a change was made to how the profile engine uses memory. It can lead to the same profiling job using more overall system memory than it did in the past, however, controls are still provided for how much memory to use per column profiled. When using Profile, app.cfg still supports “prof/per_table_bytes” option. When using the Frequency Distribution data node, the new HASH_BUCKETS property is now supported, but the old one, HASH_TABLE_SIZE, is still recognized and supported in cases when an old job is loaded and run. If the option is not set (regardless of the format), the profile engine uses the default value of 1024x1024 = 1048576 buckets (4MBs or 8MBs per table / column profiled). You also might be interested in the fact that SAS is developing a new profiling engine that will perform “in-database”, meaning that the work involved to general a profile report will be capable of running within a data source, rather than relying upon data extraction to a Data Management Server or Data Quality Server where the calculations are done. Leveraging the typically greater hardware resources of the data source can significantly improve profiling performance for very large tables or data sets. Any other suggestions or comments about profiling large data sets? Keep them coming!

AnnaBrown · ‎05-16-2014

Hi jwhite, Welcome to the Data Management Community! You can start another discussion on the prompts topic the same way you submitted your question on table names. Can you mark LinusH's answer correct on this one (if it solved your initial challenge)? That way other members will know what worked. Thanks! Anna

AnnaBrown · ‎05-15-2014

Hi Aditya, The tip Miguel posted on the cutoff node might be useful to you: Tip: Use the Cutoff Node in SAS® Enterprise Miner™ to Consume the Posterior Probabilities of Your Models Efficiently Anna

AnnaBrown · ‎05-12-2014

Thanks RW9. Here is the other post with replies for reference on this community: pulling the latest record based on date Anna

Online Status	Offline
Date Last Visited	‎02-21-2023 03:44 PM