Understanding PDF Corruption
A PDF file starts with a header (e.g., `%PDF-1.7`) and ends with a trailer containing an EOF marker. In between, there is a complex web of numbered objects referencing each other via byte offsets listed in an XREF table at the end of the file.
Most "Corruption" happens when this XREF table is damaged or truncated (often due to an incomplete download). PDF viewers rely on this table to know where page 1 starts vs page 2. If the table is wrong, the viewer sees a blank screen or an error.
RiseConvert's repair tool ignores the broken table. Instead, it "brute-forces" the file, reading every byte to recognize object patterns (like `1 0 obj ... endobj`) and builds a new, valid table from scratch.
