BookmarkSubscribeRSS Feed

Discover More with Faceted Search in SAS Information Catalog

Started ‎07-06-2022 by
Modified ‎07-06-2022 by
Views 1,629

With faceted search you can look at your information assets, from multiple lenses, called facets. You can search assets by name, asset type, description, by date or by user. You can explore data assets by library, file type, table type, number of columns or rows, size. You can search by column information such as name, label, keyword, or completeness percentage. More surprisingly you can search by language detected, semantic type calculated or data privacy status. Watch the video to understand how faceted search can help you find what you need or discover something new about your information assets.

 

What Is Faceted Search?

 

The faceted search in SAS Information Catalog is like a photographer’s lens kit. The facets are the lenses. Each has its own characteristics. They can capture different angles, colors, levels of detail. You switch them, until you can get the perfect shot. The perfect shot can be a unique perspective over your information assets in SAS Viya.

 

bt_1_Lenses.png

 

These facets are calculated when assets are automatically indexed or discovered with an agent.

 

The automatic index keeps track of models, reports, decisions, SAS Studio flows, data plans and CASLIB in-memory or SASHDAT tables available in your SAS Viya.

 

The discovery agent analyzes libraries: CASLIBs or SAS Compute. It produces tens of metrics for each table or file included in the discovery. These calculated metrics are then associated with facets, through which you can search.  

 

Faceted Search Short Demonstration

 

Watch the following short video to understand what faceted search can help you discover in SAS Viya.

 


Asset Type Facets

 

Try a faceted search, AssetType, and it will list the asset types catalogued.

 

bt_2_110_Asset_Type_Faceted_Search_SAS_Information_Catalog-1024x576.png

Select any image to see a larger version.
Mobile users: To view the images, select the "Full" version at the bottom of the page.

 

The results are the union of assets catalogued via:

  • Automatic indexing, e.g. model will list all catalogued models.
  • Discovery agents, e.g. inmemorytable will list all the CAS in-memory tables.

 

Semantic Types Facets

 

SAS Information Catalog is using a Quality Knowledge Base to calculate semantic types for columns in a table or file, based on their name or content. It also labels the columns from a data privacy perspective. For example: Column.informationPrivacy: "private" lists all tables that contain private data.

 

bt_3_120_Information_Privacy_Faceted_Search_SAS_Information_Catalog-1024x352.png

 

Column.semanticType: "individual" shows tables or files where columns have been classified as belonging to a private person.

 

Detected Language Facets

 

More recently, language detection is also performed in long text columns:

 

bt_4_140_Language_Faceted_Search_SAS_Information_Catalog-1024x576.png

 

Languages: "en" will show you tables that have columns with English language content.

 

Languages: "zh" will show you which tables have Chinese language content.

 

Languages: "tl" will indicate assets where Tagalog was detected in a column.

 

To search all CAS tables that contain Korean language data try: +Languages: "ko" +AssetType: "cas". The + indicates a MUST type of search, which means every facet criterion must be satisfied in the search results.

 

bt_5_130_Language_Asset_Type_Faceted_Search_SAS_Information_Catalog-1024x317.png

 
Combine Faceted and Free Text

 

SAS Information Catalog supports two types of searches, free text and faceted, which you can mix and match in a single query, for even better precision or more surprising findings.

 

Software Version

 

The above examples were produced using the SAS Viya Long Term Support Release 2022.1 (May 2022), although facet search has been around since at least October 2021. You will require a SAS Information Governance license for language, semantic type and information privacy features.

 

Additional Resources

 

For more information, see SAS® Information Catalog | Faceted Search. Alternatively, see SAS® Information Catalog | Free Text Search.

 

Conclusion

 

The faceted search allows you to look at your information assets, from multiple lenses, called facets. You can search assets by name, asset type, library, file type, table type, language detected, semantic type calculated or information privacy status, just to name a few.

 

Thank you for your time reading this post. If you liked the post, give it a thumbs up! Please comment and tell us what you think about post content. If you wish to get more information, please write me an email.

 

Find more articles from SAS Global Enablement and Learning here.

Version history
Last update:
‎07-06-2022 11:42 PM
Updated by:
Contributors

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!

Free course: Data Literacy Essentials

Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning  and boost your career prospects.

Get Started