Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
monsters [2021/07/23 17:32]
christian [Missing object]
monsters [2021/07/24 08:12] (current)
christian [Incorrect stream length]
Line 41: Line 41:
 === Handling === === Handling ===
  
-This will raise a ''MissingObjectError'' with a ''MissingObject'' containing the type information for the expected object as parameter.+A proceedable ''MissingObjectError'' is raised. The reference points to a ''MissingObject'' containing the expected type.
  
 When writing out the reference to a new PDF, a string ''(The original object is missing)'' is written as object instead (if type information is available, it is added to the message). This preserves the correct reference which may be used in several places. Subsequently, this will result in a type mismatch when reading that PDF (unless the original object was a string as well, which is unlikely). When writing out the reference to a new PDF, a string ''(The original object is missing)'' is written as object instead (if type information is available, it is added to the message). This preserves the correct reference which may be used in several places. Subsequently, this will result in a type mismatch when reading that PDF (unless the original object was a string as well, which is unlikely).
Line 51: Line 51:
 ===== Incorrect stream length ===== ===== Incorrect stream length =====
  
-The ''/Length'' of a stream is smaller than the number of bytes.+The ''/Length'' of a stream is different from the size of the content. Ths content are the bytes between token ''stream'', followed by one ''lf'', and token ''endstream'' with optional extra whitespace before the token. 
 + 
 +The following cases are possible: 
 +  * ''/Length'' is smaller than the content. The ''endstream'' token lies ahead. 
 +  * ''/Length'' is larger than the content.  The ''endstream'' token, or parts of it, has been read as part of the stream already. 
 + 
 +The particular monster where I encountered this, had always one byte too much in the content. Therefore, not the general problem was handled, but just the simple case where the content is exactly 1 larger than the number of bytes given by the ''/Length'' attribute.
  
 === Example === === Example ===
Line 57: Line 63:
 <code> <code>
 42 0 obj 42 0 obj
-<</Length 9>> + <</Length 9>> 
-stream+ stream
 abcdefghij abcdefghij
-endstream+ endstream
 endobj endobj
 </code> </code>
Line 71: Line 77:
  
 If there are more bytes extra, a ''ReadError'' is raised and no stream object is created. The error may be proceeded, but if the stream is used later, another error will occur. If there are more bytes extra, a ''ReadError'' is raised and no stream object is created. The error may be proceeded, but if the stream is used later, another error will occur.
 +
 +== Known problem ==
 +
 +The general problem has not been adressed. One idea is to find the end of the stream content of the current object. With this information it is possible to determine if the ''/Length'' entry is too small or too big and what to do about it.
 +
 +The end of the stream would be before the ''endobj'' and ''endstream'' tokens before the start of the next object. The cross reference table has the offsets of all objects in consecutive order. Thus, the next object after the current is given by the next reference in the table. If the current object is the last object in the PDF, the end of the object is before the ''xref'' token starting the cross reference table.
 +
 +Object streams need not be considered, because they cannot contain streams.
 +
 +This should be easy for the simple case of only one xref table. But handling several xrefs from different updates deemed too complex at the time (that's why I write this here as a reminder for the next time I need to deal with this problem).
  
 === Reference === === Reference ===
  
-Seen in ''/info/Producer'': ''Bluebeam PDF Library 18''+Seen in ''/Info/Producer'': ''Bluebeam PDF Library 18''
  • monsters.1627054351.txt.gz
  • Last modified: 2021/07/23 17:32
  • by christian