Search code examples
How does checkpointing work in HDFS? I would like to get clarity on fs.checkpoint.period and fs.chec...


hadoopmapreducehdfs

Read More
Get the number of completed steps in an Amazon Elastic MapReduce jobflow via boto...


amazon-ec2mapreduceamazon-emrboto

Read More
Hadoop mapreduce with input size ~ 2Mb slow...


javaamazon-web-serviceshadoopmapreducewritable

Read More
Amazon Elastic MapReduce: Failed to create a job flow with a large number of instances...


amazon-web-servicesmapreduce

Read More
STREAM keyword in pig script that runs in Amazon Mapreduce...


amazon-web-serviceshadoopmapreduceapache-pig

Read More
Hadoop on Amazon Cloud...


amazon-web-serviceshadoopamazon-ec2mapreduceamazon-ami

Read More
Unable to set Inventory Status for an Inventory Adjustment record using Suitescript 2.x...


mapreducenetsuitesuitescriptsuitescript2.0

Read More
I am practicing basic hadoop with IntelliJ...


javahadoopmapreducehdfs

Read More
HDFS staging dir permission issues with yarn mapred framework - /tmp/hadoop-yarn/staging...


hadoopmapreducehadoop-yarn

Read More
Groovy: map reduce list of maps...


groovymapreducefunctional-programming

Read More
NullPointerException caused by static field in Mapper class...


javahadoopmapreduce

Read More
Parquet file optional field does not exist...


javahadoopmapreduceparquet

Read More
Suitescript 2.0 MapReduce Script...


mapreducenetsuitesuitescript

Read More
Extracting a list of substrings from MongoDB using a Regular Expression...


regexmongodbmapreduceaggregation-framework

Read More
Collecting specific data from CSV file using Hadoop MapReduce...


javacsvhadoopmapreduce

Read More
What default reducers are available in Elastic MapReduce?...


hadoopmapreduceaggregatereduce

Read More
Flink Broadcast non-KeyedStream to a KeyedStream...


mapreducebigdataapache-flinkflink-streaming

Read More
Flink how are partitions of a stream associated with the parallelism?...


parallel-processingmapreducebigdataapache-flinkflink-streaming

Read More
What is the int needed for in map(int, icount) in Pydoop...


pythonhadoopmapreduce

Read More
In Flink is it possible to use state with a non keyed stream?...


mapreducebigdataapache-flinkflink-streamingflink-state

Read More
How to perform a map reduce in MongoDB with a colon in the field name?...


mongodbmapreducefield

Read More
Importing data from MySQL to HDFS: ClassNotFoundException...


javamysqlmapreducesqoophadoop2

Read More
hadoop get files from existing archived file in hdfs...


hadoopmapreducehdfsarchive

Read More
OptionConverter.convertLevel Error in Hadoop Mapreduce job...


javagradlehadoopmapreducelog4j

Read More
How to find out number of elements in MongoDB array?...


databasemongodbmapreducemongo-shell

Read More
How to sum fields of collection elements without mapping them first (like foldLeft/reduceLeft)?...


scalacollectionssummapreducehigher-order-functions

Read More
Unit Test MapReduce - Junit Mockito...


javahadoopjunitmockitomapreduce

Read More
in aws emr job flow, does each step receive the output from the previous step?...


javaamazon-web-serviceshadoopmapreduceamazon-emr

Read More
How to check if there is a key in collection that has more than one value?...


databasemongodbmongodb-querymapreduce

Read More
Running a job using hadoop streaming and mrjob: PipeMapRed.waitOutputThreads(): subprocess failed wi...


pythonhadoopmapreducehadoop-streamingmrjob

Read More
BackNext