Search code examples
Dataflow writing a pCollection of GenericRecords to Parquet files...

javaapache-beamparquetdataflow

Read More
Is it possible to read a Parquet dataset partitioned by hand using Dask with the Fastparquet reader?...

pythonamazon-s3daskparquetfastparquet

Read More
Corrupt Decimal value when querying a hive on parquet table from spark...

hiveapache-spark-sqlparquetcloudera-cdh

Read More
SQL Server -- CREATE EXTERNAL FILE FORMAT to query Parquet files via Polybase -- failing due to TCP ...

sql-serverparquetsql-server-2019polybase

Read More
How to read multiple .parquet files from multiple directories into single pandas dataframe?...

pandasparquet

Read More
Why GCP gets killed when reading a partitioned parquet file from Google Storage but not locally?...

pandasgoogle-cloud-platformgoogle-cloud-storagedaskparquet

Read More
How can I mark an Azure Dataset as a time series dataset reading from a parquet folder with date par...

parquetazure-machine-learning-service

Read More
Athena returns wrong values for timestamp fields in parquet files...

pythonamazon-web-servicesamazon-s3parquetamazon-athena

Read More
pyarrow data types for columns that have lists of dictionaries?...

pandasparquetpyarrow

Read More
Difference in time taken for importing parquet files between SparkR and sparklyr...

rparquetdatabrickssparkrsparklyr

Read More
Reading a parquet file in nodejs...

javascriptnode.jsapacheparquetapache-arrow

Read More
optimizing reading from partitioned parquet files in s3 bucket...

apache-sparkamazon-s3pysparkparquethadoop-partitioning

Read More
Read parquet file data from azure data lake to Excel stored in SharePoint Online...

azuresharepoint-onlineparquetazure-data-lake-gen2excel-online

Read More
about deprecated method ParquetFileReader.readFooter...

hadoopparquet

Read More
How to convert a float to a Parquet TIMESTAMP Logical Type?...

pythonparquetpyarrow

Read More
AWS Lambda with spark library gives OutOfMemoryError...

javaapache-sparkaws-lambdaout-of-memoryparquet

Read More
AWS DMS: How to handle TIMESTAMP_MICROS parquet fields in Presto/Athena...

amazon-web-servicesparquetprestoamazon-athenaaws-dms

Read More
Moving files from one parquet partition to another...

amazon-s3pysparkparquethadoop-partitioning

Read More
AWS Glue/Athena - S3 - Table partitioning...

amazon-web-servicesparquetaws-glueamazon-athena

Read More
How to add extra metadata when writing to parquet files using spark...

apache-sparkapache-spark-sqlparquet

Read More
check if a file is an ORC file...

scalaapache-sparkparquetorc

Read More
How to set the 'category' data type for a pyarrow Table column?...

pythonparquetpyarrow

Read More
Writing Parquet/Avro GenericRecord to JSON while maintaining LogicalTypes...

javajsonhadoopavroparquet

Read More
Spark read from parquet hive table having timestamp...

apache-sparkapache-spark-sqlparquet

Read More
Avro -> Parquet -> Spark SQL...

apache-sparkapache-spark-sqlavroparquet

Read More
Query Cassandra UDT via Spark SQL...

apache-sparkcassandraparquetprestospark-cassandra-connector

Read More
How to convert a json file in to parquet using aws lambda...

pythonamazon-web-servicesamazon-s3aws-lambdaparquet

Read More
Parquet predicate pushdown filtering with Dask...

daskparquet

Read More
How to display all the fields in a complex data column (map type) in Impala?...

parquetimpala

Read More
Convert multiple CSVs to single partitioned parquet dataset...

pandasparquetfastparquet

Read More
BackNext