I assume that the performance of option # 2 will be equal or greater than option # 1 - my question is mainly whether it makes a difference or not, as uploading pages separately has its own advantages for us (our use case is a bit more complicated, I simplified it for the explanation).
Considering performance, keeping the number of pages minimal likely won't cause a noticeable difference between options. However, option 1 requires more processing time due to the added pre-processing step of splitting multi-page documents.
While option 2 offers a simpler workflow, it may process irrelevant information beyond the first page, potentially impacting accuracy and efficiency compared to option 1, which analyzes each page individually.
Ultimately, the best choice depends on your specific needs. While option 2 is straightforward, option 1 might yield better accuracy for your custom extractor due to its focus on individual pages.
You might find this discussion interesting as it touches on processing multi-page invoices in Document AI.