BookmarkSubscribeRSS Feed

How to Gain Insight from Mock Employee Data Model Demo

Started ‎11-21-2023 by
Modified ‎11-21-2023 by
Views 297

In this article we will discuss using mock generated company data provided by Kaggle. We will discuss the effectiveness of gaining useful insight from the data and go through the process of analyzing the data to understand its usefulness. This demonstration will not only give you a practical understanding of working with employee data but also illustrate the potential of data analytics in HR and organizational management.

 

Our objective in this demo is to provide you with a glimpse of the kind of insights and decisions that can be derived from employee data. By the end of this post, you will have a better understanding of how data can be used to optimize human resources, improve organizational efficiency, and make informed decisions.

 

Loading CSV File

 

Begin by introducing you to the mock employee dataset, including the key variables and fields it contains. This will give you an idea of the type of information we are working with. We will need to load the CSV file into the SAS Explorer; once the data is loaded in the proper location, we can use macro variables to establish the path of the datafile. Once the path is established, we need to know what type of separator is be used, in this demo we know that the data is comma separated.

 

%let path=/cisviya-export/cisviya/homes/Dee.Mckoy@sas.com/Blgdata/data;
%let pathout=&path/output;
proc import datafile="&path/Employee_Dataset.csv"
out=work.Education
dbms=dlm
replace;
delimiter=',';
getnames=YES;
run;

 

01_DMcK_10obs_dataset.png

Select any image to see a larger version.
Mobile users: To view the images, select the "Full" version at the bottom of the page.

 

From the above image we can see the first 10 observations from the dataset. From the dataset we see our listed our attributes, Education, JoiningYear, PaymentTier, Age, EverBenched, ExperienceinCurrentDomain, and LeaveOrNot. The variable Everbenched indicates if whether employee has ever been temporarily without assigned work. The variable "Experience in Current Domain" stands for the number of years of experience employees have in their current field.

 

Descriptive  Statistics

 

proc means data=work.Education;
run;

 

02_DMcK_Descriptive_Stat.png

 

In this step, we run a proc means step to understand of the dataset. The descriptive statistics are measures we can use to learn more about the distribution of observations in variables for analysis, transforming variables, and reporting.

 

Data Visualization for Insight

 

In this first data visualization example, we will look at the age, education level, and sex of the employees to see if there is any insight to gain from the visualization.

 

proc sgplot data=work.Education;
title "PaymentTeir by Age and Sex";
vline Age / response=PaymentTier stat=mean markers
group=Gender lineattrs=(thickness=5px);
styleattrs datasymbols=(TriangleFilled CircleFilled)
datalinepatterns=(ShortDash LongDash);
run;

 

03_DMcK_PaymentTier.png

 

From the image above, we see line plot showing the different payment Tiers of employees based on Age and gender. This type of visualization could be useful in understanding the payment scale based around age which could require more investigation.

 

Now that we have seen the PaymentTier for the Ages of employees at the company. Let's take a look at the levels of Education versus Gender and get a count for number of employees with different levels of education.

 

proc sgplot data=work.Education;
	title Employees Joining By Year;
	vline JoiningYear / group=Education stat=mean markers datalabel;
run;

 

04_DMcK_BarGraph_Edu.png

 

proc sgplot data=work.Education;
	title Employees Joining By Year;
	vline JoiningYear / group=Education stat=mean markers datalabel;
run;

 

05_DMcK_Employees_Joined.png

 

In this step, let's take a look the education level of the employees from each year starting from 2012 until 2019. By looking at this visualization we will be able gain insight as to how many employees are being hired each year and what level of education the employee enters the company. From the data we are able to see that through each cycle year there was increase of employees who joined with only a bachelor's degree. This would be good insight for comparing recruiting numbers for recent graduates of universities.

 

Conclusion

 

From the demonstration, we can conclude that using some of these techniques will help with gaining further knowledge and insight from metrics used in HR and other analytical services. By using data similar to the demo data used could help making better decision in the future regarding recruiting, pay tier per departments, and skill level of employees.

 

For more information:

 

 

Find more articles from SAS Global Enablement and Learning here.

Version history
Last update:
‎11-21-2023 10:40 AM
Updated by:
Contributors

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

Free course: Data Literacy Essentials

Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning  and boost your career prospects.

Get Started

Article Tags