Search code examples
How do you parse pdf files from a raw dataset in Foundry?...


palantir-foundryfoundry-code-repositoriesfoundry-code-workbooksfoundry-python-transformpalantir-foundry-api

Read More
How do I parse large compressed csv files in Foundry?...


pysparkpalantir-foundryfoundry-python-transform

Read More
How do I identify the value of a skewed task of my Foundry job?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transformfoundry-contour

Read More
How can I have nice file names & efficient storage usage in my Foundry Magritte dataset export?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transformfoundry-data-connection

Read More
Shuffle Stage Failing Due To Executor Loss...


apache-sparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
When would I prefer to run a job in static allocation vs. dynamic allocation?...


palantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Why is my Code Repo warning me not to use union and instead use unionByName?...


palantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
How do I compute a range of statuses from a daily indicator?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
How do I control the file counts inside my Hive-partitioned dataset?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Why does my build with executor cores specified OOM?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Does a count() over a DataFrame materialize the data to the driver / increase a risk of OOM?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Why don't I see smaller tasks for my requested repartitioning?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Why is my Code Repo warning me about using withColumn in a for/while loop?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
How do I add a column indicating the row number from a file on disk?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Why don't I see log lines in my PySpark code when I would expect them to appear?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
How do I parse xml documents in Palantir Foundry?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
How do I ensure consistent file sizes in datasets built in Foundry Python Transforms?...


palantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
How do I union many distinct schemas into a single output I can dynamically pivot later?...


palantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Why is my build hanging / taking a long time to generate my query plan with many unions?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
How can I merge an incremental dataset and a snapshot dataset while retaining deleted rows?...


palantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Palantir Foundry incremental testing is hard to iterate on, how do I find bugs faster?...


palantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Why do I see repeated materializations of a DataFrame in my build?...


pysparkpalantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
Is there a way to populate column descriptions specific to data set?...


palantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
how to access the data frame without my_compute_function...


palantir-foundryfoundry-code-repositoriesfoundry-data-connectionfoundry-python-transform

Read More
Is adding a column description via write_dataframe aware of the branch it is run on?...


palantir-foundryfoundry-code-repositoriesfoundry-python-transform

Read More
BackNext