BookmarkSubscribeRSS Feed
india2016
Pyrite | Level 9

Hello Team,

 

  We are facing issue while loading data into hadoop environment from Oracle database using DI job (Table Loader Node). 
But when table is loaded into hadoop , multiple missing observations are generated  which are not in Oracle data-base.

 

 

Anyone faced this issue then please suggest what should I do to overcome this issue?

Any suggestion will be appreciated.

 

 

 

2 REPLIES 2
JBailey
Barite | Level 11

Hi @india2016

 

Without detailed information it is hard to answer your question. 

If it were me, I would validate the data coming out of Oracle and see if there are any surprises. Once I am sure the Oracle data looks good I would load it into Hadoop. Pay attention to the data type conversions and missing values. If you can find the Oracle rows that display the problems it would help the community members understand what is happening. 

 

To really solve this we would need, at a minimum, the DDL for the table definitions in both Oracle and Hive and sample data (that causes the problem), and data (from Hive) showing the problem.

 

Best wishes,

Jeff

india2016
Pyrite | Level 9
1) When we run a DI job it fetches data from Oracle table and loads it into Hadoop table using Table loader transformation.
The data gets loaded into Hadoop Table and then when you open the table in SAS environment you are getting some blank rows.
2)  As you suggested, there is no such blank values in Oracle table but while data loading into Hadoop dataset,some extra missing rows were generated in   hadoop table.
 
 
DDL is created  in Hadoop using CREATE TABLE node. and then using DATA LOADER node data was loaded into Hadoop tables.
 
Should I share backend code for these transformations? 
I have attached backend coed for the same.
Please find attached log. 

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

Mastering the WHERE Clause in PROC SQL

SAS' Charu Shankar shares her PROC SQL expertise by showing you how to master the WHERE clause using real winter weather data.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 843 views
  • 0 likes
  • 2 in conversation