Search code examples
Why this T-SQL query doesn't work in Synapse?...

sql-servert-sqlparquetazure-synapse

Read More
How to generate the pyarrow schema for the dynamic values...

google-cloud-dataflowapache-beamparquetpyarrowapache-beam-io

Read More
Reading parquet file is slower in c++ than in python...

pythonc++parquetpyarrowapache-arrow

Read More
Adjusting columns from txt to parquet...

scalaapache-sparkpysparkapache-spark-sqlparquet

Read More
dask loading multiple parquet files with different column selections...

pandasdaskparquetdask-distributed

Read More
Using s3a on linux machine fail for >100 columns parquet...

linuxscalaparquetspark-shellamazon-s3-access-points

Read More
Can not save pandas dataframe to parquet with lists of floats as cell value...

pythonpandasparquetpyarrow

Read More
Pyarrow: How to specify the dtype of partition keys in partitioned parquet datasets?...

pythonpandasparquetpyarrow

Read More
Data type mismatch while transforming data in spark dataset...

javaapache-sparkapache-spark-sqlparquetapache-spark-dataset

Read More
Spark: error reading DateType columns in partitioned parquet data...

pythonapache-sparkamazon-s3pysparkparquet

Read More
Spark parse and processing file parquet/json...

pythonscalaapache-sparkapache-spark-sqlparquet

Read More
What's the best way to write a big file to S3?...

scalaapache-sparkpysparkparquetapache-zeppelin

Read More
How can I create an Azure dataset in Azure ML studio (through the GUI) from a parquet file created w...

azureapache-sparkparquetazure-machine-learning-service

Read More
Can I filter a parquet table?...

pythonparquet

Read More
Can I use Athena / Presto to sort a table before writing?...

parquetamazon-athenapresto

Read More
In AWS Athena - how to show the timestamp column with required format?...

amazon-web-servicesamazon-s3parquetamazon-athenaaws-glue-data-catalog

Read More
Using dask.DataFrame.to_parquet() to write large file...

pythonpandasdaskparquet

Read More
How can I automate the process of running the same aggregation in 12 parquet files and then join the...

apache-sparkpysparkapache-spark-sqlparquetpartition

Read More
Hive table with only a subset of fields from parquet file...

hiveparquet

Read More
Kafka-connect without schema registry...

amazon-s3apache-kafkaparquetapache-kafka-connectconfluent-platform

Read More
What is a common use case for Apache arrow in a data pipeline built in Spark...

apache-sparkparquetpyarrowapache-arrow

Read More
How can I dump the .parquet data that is in Azure DataLakeStorage to a Microsoft SQL Server database...

sql-serverazureapache-nifiparquetazure-data-lake-gen2

Read More
Parquet compression degradation when upgrading spark...

apache-sparkapache-spark-sqlparquetsnappy

Read More
How to create ORC or Parquet files from PHP code?...

phphiveparquetorcpresto

Read More
Json object to Parquet format using Java without converting to AVRO(Without using Spark, Hive, Pig,I...

javajsonhadoopparquet

Read More
Add columns to AWS Athena paquet tables...

parquetamazon-athenaalter-table

Read More
Difference between 'parquet.compress' and 'parquet.compression' in hive table proper...

apache-sparkhivecompressionparquetcloudera

Read More
Failed to find data source: parquet, when building with sbt assembly...

apache-sparkparquetsbt-assembly

Read More
Error: Invalid: Unrecognized filesystem type in URI when loading parquet file from url using arrow p...

rparquetapache-arrow

Read More
How can I achieve predicate pushdown when using PyArrow + Parquet + Google Cloud Storage?...

google-cloud-storageparquetpyarrowapache-arrowgcsfuse

Read More
BackNext