BookmarkSubscribeRSS Feed
🔒 This topic is locked. We are no longer accepting replies to this topic. Need further help? Please sign in and ask a new question.
Shannon23_11
Calcite | Level 5

Hi there,

 

I'm working on Data Curation: SAS and Hadoop Lesson 3 Activity 2.

 

I'm trying to execute a Pig script in HUE, but my job keeps failing.

 

I'm not sure if it's because I don't have the data in HUE that I want to load. 

 

Please see screenshots attached.

1 REPLY 1
saskyr
SAS Employee

When I do not complete an earlier step in the course to load data into HDFS I get the same error running that Pig program. When I run that earlier step to load the data in HDFS then the Pig program works. So that is likely the issue you are having. Try these steps:

 

  1. Open mRemoteNG from the desktop icon.
  2. In the connections panel in top left of the mRemoteNG application, double-click student@HadoopClient connection.
  3. On the linux command line for that connection submit this command:

hdfs dfs -put /workshop/dihdm/data  /user/student/dihdm

 

When you execute that command you may (or may not) see one or more messages that files or directories already exist but that is OK.

 

Here is a helpful screen capture:

 

saskyr_0-1616092726227.jpeg

 

Once above is complete you will hopefully be able to execute that Pig program (and others). This is something you should repeat whenever you start up a new Fresh image instead of starting from a saved image.