How to best handle badly concatenated json

I receive a single json file from a client which is not correct.
The client concatenates multiple json resposnses into one file:

{
    object1
    {
        ...
    }
}   
{
    object2
    {
        ...
    }
}
...

When I parse it with dataframe in pyspark, I always get a count of only one root object which is correct, because it reads only the first object and doesn't care about the rest.
I need to somehow handle this and I'm trying to figure out what is the best way performance wise?
Can dataframe handle bad jsons or can I easily fix this with python?

Solution

You can use the jq module to parse the data.

>>> data = open("tmp.json").read()
>>> data
'{"foo": 1}\n{"bar": 2}\n'
>>> import jq
>>> jq.compile(".").input_text(data).all()
[{'foo': 1}, {'bar': 2}]

What you have isn't really invalid or incorrect; it's just a stream of individual JSON objects rather than a single JSON value like an array.

Unexpected list append
Force matrix_world to be recalculated in Blender
SQLAlchemy and empty columns
ValueError: time data '24:00' does not match format '%H:%M'
Convert RDD of LabeledPoint to DataFrame toDF() Error
How to cancel trigonometric expressions in SymPy
Get view used in Django tests
Precompiled sasl python 3.9+ package for windows
Regex: Substitute pattern in string multiple times without leftovers
How to render raw html in the PyHTML library
Why does my implementation of trilateration give wrong results?
Django admin: how to sort by one of the custom list_display fields that has no database field
TypeError: not all arguments converted during string formatting - psycopg2
Is there a Python equivalent of the C# null-coalescing operator?
Kraken API - Account balances request returning Invalid Nonce
configparser without whitespace surrounding operator
Pytorch tensor to numpy array
Django: How to get a person whose birthday is today from a database?
Performance impact of inheriting from many classes
How can I do a line break (line continuation) in Python (split up a long line of source code)?
Using pydantic to change int to string
Breaking long method chains into multiple lines in Python
What do ** (double star/asterisk) and * (star/asterisk) mean in a function call?
How to install Pygame on Python 3.4?
Rotating values in a list [Python]
Launch default image viewer from pygtk program
what's the inverse of the quantile function on a pandas Series?
How can I install packages using pip according to the requirements.txt file from a local directory?
Python generate all n-permutations of n lists
FastAPI error when handling file together with form-data defined in a Pydantic model