Fix empty PDF issue with Puppeteer ^23.0.0 #876

JeppeKnockaert · 2024-08-21T15:35:08Z

This pull request reintroduces the changes done for Puppeteer v23, while fixing the issue introduced in #870 resulting in a white PDF.

The issue was caused by not using base64 to communicate between puppeteer and Browsershot.

I also added a test that actually checks the contents of the PDF. To do this, I introduced the pdf-to-text package used in https://github.com/spatie/laravel-pdf and also added the same note in the README.

There is still one flaky test: 'can handle a permissions error with full output', which expects no errors, but sometimes gets a 404 on the favicon of example.com (because they have not favicon). It doesn't always fail, because the request for a favicon happens asynchronously.

This reverts commit 7174e3f.

bluesheep100 · 2024-08-22T06:53:27Z

I discovered this as well last night, but it was quite late, so I didn't have time to PR it :(
Good work though. Are you sure there's no way to write this new test without introducing another outside dependency? (pdftotext)

I'm not personally sure why exactly this issue manifested as it did, but as far as I could tell, attempting to return the raw buffer from the JS side would mess up the encoding or something, because the broken PDFs are encoded as UTF-8, whereas the working one (as decoded from base64 by PHP) is ANSI.

JeppeKnockaert · 2024-08-22T07:07:14Z

Good point, I looked at it again and tried to see if I could just assert the binary data to contain the text. This works, but it also works with the empty PDF, resulting in a false positive. The thing that causes it to be a white page seems to be very subtle, because all the content is actually there in the binary data.

bluesheep100 · 2024-08-22T08:56:55Z

I don't know much of anything about the PDF structure, but it appears they come out as blocks of stream data, prefixed with a length and "stream" keyword. I'd assume the difference in file encoding might affect that stream data and make it not display, while the binary remains identical.

freekmurze · 2024-08-22T09:14:05Z

Very nice, thank you!

JeppeKnockaert added 3 commits August 21, 2024 16:33

Revert "revert changes in 4.2"

401f2ca

This reverts commit 7174e3f.

Use base64 for communication between puppeteer and Browsershot

ba1fef2

Add test that checks PDF contents

f421dc8

JeppeKnockaert force-pushed the fix-pdf-export-2 branch from 12e0627 to 90b49f8 Compare August 21, 2024 15:44

Fix randomly failing test by blocking favicon

93e9e35

JeppeKnockaert force-pushed the fix-pdf-export-2 branch from c1a1240 to 93e9e35 Compare August 21, 2024 15:51

JeppeKnockaert mentioned this pull request Aug 21, 2024

[Bug]: Can't preview PDF in browser spatie/laravel-pdf#171

Closed

freekmurze merged commit 601f275 into spatie:main Aug 22, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix empty PDF issue with Puppeteer ^23.0.0 #876

Fix empty PDF issue with Puppeteer ^23.0.0 #876

Fix empty PDF issue with Puppeteer ^23.0.0 #876

Fix empty PDF issue with Puppeteer ^23.0.0 #876

Conversation