Search code examples
Why do we need HDFS on EMR when we have S3...


amazon-web-servicesapache-sparkamazon-s3hdfsamazon-emr

Read More
Moving files from one directory to another directory in HDFS using Pyspark...


pythonapache-sparkpysparkhdfs

Read More
How to configure putHDFS processor in Apache NiFi such that I could transfer file from a local machi...


hdfsapache-nifi

Read More
hadoop get files from existing archived file in hdfs...


hadoopmapreducehdfsarchive

Read More
Hadoop copying directory with contents...


hadoophdfs

Read More
hdfs how to output size zero file in a specific directory path...


filehadoopawkhdfs

Read More
Hadoop java.io.IOException: Mkdirs failed to create /some/path...


hadoophdfsioexception

Read More
How to setup webhdfs hadoop...


hadoophdfswebhdfs

Read More
Hive data loading using complex json structure...


arraysjsonstructhivehdfs

Read More
Using AWS EMRFS in apache spark hosted on ec2...


apache-sparkkubernetesamazon-s3hdfsamazon-emr

Read More
NameNode Format error "failure to login for principal: X from keytab Y: Unable to obtain passwo...


hadoophdfskerberosubuntu-20.04namenode

Read More
Kafka Connect - From JSON records to Avro files in HDFS...


apache-kafkahdfsapache-kafka-connectjsonschemaconfluent-schema-registry

Read More
HDFS: How do you list files recursively?...


hadoophdfs

Read More
'hdfs' is not recognized as an internal or external command, operable program or batch file...


windowshadoopinstallationhdfssystem-variable

Read More
Sufficient way in Spark to predict number of records in given hdfs dataset...


scalaapache-sparkcounthdfs

Read More
merge multiple csv files present in hadoop into one csv files in local...


pythoncsvapache-sparkhadoophdfs

Read More
How to take count of no of files in hdfs directory...


scalaapache-sparkhdfs

Read More
Namenode not starting - Exception in namenode join...


hadoophdfs

Read More
How do I get schema / column names from parquet file?...


hadoopapache-pighdfsparquet

Read More
Writing to HDFS could only be replicated to 0 nodes instead of minReplication (=1)...


javahadoopmapreducehivehdfs

Read More
Scala: how to get max partition from hdfs dir...


scalaapache-sparkhdfs

Read More
What open source solutions exist to move data from Kafka to HDFS3 using Kafka Connect?...


hadoopapache-kafkahdfsapache-kafka-connect

Read More
How to Create HDFS file...


pythonhadoophdfssnakebite

Read More
Merging two files by making a HDFS application that merges them into one file located in HDFS...


javahadoophdfs

Read More
Hadoop Error - All data nodes are aborting...


hadoopmapreducehdfshadoop-yarnhadoop2

Read More
The way to check a HDFS directory's size?...


hadoopcommand-linedirectoryhdfs

Read More
How to find the size of a HDFS file...


hadoophdfs

Read More
How to read only n rows of large CSV file on HDFS using spark-csv package?...


apache-sparkpysparkhdfsapache-spark-sqlspark-csv

Read More
What causes Hadoop datanodes to be excluded from operations?...


hadoophdfs

Read More
Datanode decommisioning vs Write...


hadoophdfs

Read More
BackNext