apache-spark ibm-cloud data-science-experience

Spark history server is not showing 'complete' applications

I am trying to performance tune a slow running DSX job.

I have navigated to the spark history server from the underlying spark service on Bluemix (as per this question).

I have executed a cell containing some basic spark code:

In [1]:
x = sc.parallelize(range(1, 1000000))
x.collect()

Out[1]:
[1,
 2,
 3,
 4,
 5,
 ...

I have then refreshed the Job History Server page in the browser, however, the spark history server is not showing any complete applications:

How can I find the 'complete' applications?

Update

The spark service I'm referring to is IBM's managed spark service on Bluemix so I don't have any control over the configuration.

Update 2

It looks as though the dates are getting corrupted which is why I'm not seeing completed jobs:

Solution

I have taken this up with the spark service engineering team - it is a known issue.

Create column using Spark pandas_udf, with dynamic number of input columns
How to find position of substring column in another column using PySpark?
How to correctly read a CSV file while escaping delimiter comma placed within square brackets using Apache Spark and Scala?
SPARK SQL Equivalent of Qualify + Row_number statements
How to drop a column from a Databricks Delta table?
Converting all columns in spark df from decimal to float for pandas conversion
How to create a copy of a dataframe in pyspark?
Read previous Spark APIs
Unexpected output from least (source data includes nulls)
How to use PySpark UDF in Java / Scala Spark project
How does spark load python package depends on the external library?
Disable PySpark to print info when running
PySpark: How To Deserialise A Proto Payload From A Kafka Message With Variable Message Type
Multiple Sinks Processing not persisting in Databricks Community Edition
How to find longest sequence of consecutive dates?
graph.triplets seems not work as expected
PySpark MongoDB :: java.lang.NoClassDefFoundError: com/mongodb/client/model/Collation
How do I access the fields within a VARIANT column while reading from Kafka using Spark?
pyspark: how to specify rebalance partitioning hint with columns
Is Python UDF still inefficient in Spark?
How to import AnalysisException in PySpark
Updated scalapb class fails to render old dataframe
Create a Column with Values Based on an Array of Column Names Provided in Another Column
How to join on multiple columns in Pyspark?
Databricks: Issue while creating spark data frame from pandas
How to use SparkSQLparse in a simple FROM analysis?
UnsatisfiedLinkError while writing to S3 using Staging S3A Committer on Windows
How to install postgresql in my docker image?
Why Spark won't store Broadcasted data in off heap memory? Why does it store one copy per executor?
Are Parquet files highly structured or semi structured?