public class StreamingStep extends Object
See also: Hadoop Streaming
AWSCredentials credentials = new BasicAWSCredentials(accessKey, secretKey);
AmazonElasticMapReduce emr = new AmazonElasticMapReduceClient(credentials);
HadoopJarStepConfig config = new StreamingStep()
.withInputs("s3://elasticmapreduce/samples/wordcount/input")
.withOutput("s3://my-bucket/output/")
.withMapper("s3://elasticmapreduce/samples/wordcount/wordSplitter.py")
.withReducer("aggregate")
.toHadoopJarStepConfig();
StepConfig wordCount = new StepConfig()
.withName("Word Count")
.withActionOnFailure("TERMINATE_JOB_FLOW")
.withHadoopJarStep(config);
RunJobFlowRequest request = new RunJobFlowRequest()
.withName("Word Count")
.withSteps(wordCount)
.withLogUri("s3://log-bucket/")
.withInstances(new JobFlowInstancesConfig()
.withEc2KeyName("keypairt")
.withHadoopVersion("0.20")
.withInstanceCount(5)
.withKeepJobFlowAliveWhenNoSteps(true)
.withMasterInstanceType("m1.small")
.withSlaveInstanceType("m1.small"));
RunJobFlowResult result = emr.runJobFlow(request);
| Constructor and Description |
|---|
StreamingStep ()
Creates a new default StreamingStep.
|
| Modifier and Type | Method and Description |
|---|---|
Map<String,String> |
getHadoopConfig ()
Get the Hadoop config overrides (-D values).
|
List<String> |
getInputs ()
Get list of step input paths.
|
String |
getMapper ()
Get the mapper.
|
String |
getOutput ()
Get output path.
|
String |
getReducer ()
Get the reducer
|
void |
setHadoopConfig (Map<String,String> hadoopConfig)
Set the Hadoop config overrides (-D values).
|
void |
setInputs (Collection<String> inputs)
Set the list of step input paths.
|
void |
setMapper (String mapper)
Set the mapper.
|
void |
setOutput (String output)
Set the output path for this step.
|
void |
setReducer (String reducer)
Set the reducer
|
HadoopJarStepConfig |
toHadoopJarStepConfig ()
Creates the final HadoopJarStepConfig once you are done configuring the step.
|
StreamingStep |
withHadoopConfig (String key,
String value)
Add a Hadoop config override (-D value).
|
StreamingStep |
withInputs (String... inputs)
Add more input paths to this step.
|
StreamingStep |
withMapper (String mapper)
Set the mapper
|
StreamingStep |
withOutput (String output)
Set the output path for this step.
|
StreamingStep |
withReducer (String reducer)
Set the reducer
|
public StreamingStep()
public List<String> getInputs()
public void setInputs(Collection<String> inputs)
inputs - List of step inputs.public StreamingStep withInputs(String... inputs)
inputs - A list of inputs to this step.public String getOutput()
public void setOutput(String output)
output - Output path.public StreamingStep withOutput(String output)
output - Output pathpublic String getMapper()
public void setMapper(String mapper)
mapper - Mapperpublic StreamingStep withMapper(String mapper)
mapper - Mapperpublic String getReducer()
public void setReducer(String reducer)
reducer - Reducerpublic StreamingStep withReducer(String reducer)
reducer - Reducerpublic Map<String,String> getHadoopConfig()
public void setHadoopConfig(Map<String,String> hadoopConfig)
hadoopConfig - Hadoop config.public StreamingStep withHadoopConfig(String key, String value)
key - Hadoop configuration key.value - Configuration value.public HadoopJarStepConfig toHadoopJarStepConfig()