Commit Graph

30 Commits (5383eeae49d18bf59de427d16ea3f12d61d8b899)

Author SHA1 Message Date
Micah Snyder a8ca96687a Clean up of PDF object finding logic. Changes include recording object sizes as objects are found, identifying object streams in the object parsing section instead of the PDF parsing section, and limiting of stream and other object parsing to the size of the object instead of the size of the PDF. It is also easier to read and includes more inline documentation. 6 years ago
Micah Snyder 52cddcbcfd Updating and cleaning up copyright notices. 6 years ago
Micah Snyder 72fd33c8b2 clang-format'd using new .clang-format rules. 6 years ago
Micah Snyder (micasnyd) 89d5207b31 Added new pdf object stream parsing capability. 7 years ago
Micah Snyder 66f70f6a34 Removing macro used by TooManyFilters heuristic. 7 years ago
Micah Snyder 726918859a bb11896 -- increasing size of max filter for pdf filter heuristic. Still have work to be done to investigate whitelisting issue. This commit additionally fixes a compile error when building with json-c and fixes a couple of minor warnings. 8 years ago
Micah Snyder e09d884341 eliminated a large number of warnings, many of which had to do with mixing types. i switched some types to size_t and a couple to ptrdiff_t to make things more consistent, but there is a huge amount of work to be done to make types consistent. int, unsigned int, unsigned, off_t, and other types are ill-suited to storing buffer lengths or memory addresses. 8 years ago
Kevin Lin a865a9ace6 pdf: add filter count heuristic 9 years ago
Kevin Lin 1d0cdc67c1 pdf: open abi for pdfdecode usage 9 years ago
Kevin Lin a9584bfeed pdf: filter order tracking implementation (max:64) 9 years ago
Mickey Sola 46a35abe56 mass update of copyright headers 10 years ago
Kevin Lin 9d33052fe7 pdf: correctly handle encryption objects to decrypt 10 years ago
Kevin Lin 24db616f5b pdf: base64 encode strings that fail to finalize 10 years ago
Kevin Lin e2b1880fa6 pdf: string decryption and code clean-up 10 years ago
Kevin Lin e098bf4bd9 Revert "pdf strings are now base64 encoded if utf conversion fails" 10 years ago
Kevin Lin 6c3cc09415 pdf strings are now base64 encoded if utf conversion fails 10 years ago
Shawn Webb 09ff140939 Add more PDF properties 11 years ago
Shawn Webb 224d1c4de0 Use the right struct with the PDF stats callbacks 11 years ago
Shawn Webb 930b93953b Reorganize new PDF code to a new file 11 years ago
Shawn Webb dd101bee5b Add dictionary parsing support. 11 years ago
Shawn Webb 7a98488d66 Add to the array linked list in the right order, rather than reversed. 11 years ago
Shawn Webb 440f1fffc6 Add the ability to parse arrays in PDFs. Check if the /Kids count doesn't match /Count for /Pages objects in PDFs. 11 years ago
Shawn Webb 5091689d30 Handle hex strings. Fix a couple bugs when handling indirect objects. 11 years ago
Shawn Webb 1412b807df If an indirect object is specified, then try to use that. 11 years ago
Török Edvin dc200c6b19 Add bytecode API for pdf. 15 years ago
aCaB 2d5dbc3730 binhex, pdf, unzip_single to fmap 16 years ago
Tomasz Kojm 72ce4b70eb improve handling of PDF, CAB, RTF, OLE2 and HTML files (sync with branch/0.93) 17 years ago
Tomasz Kojm 2023340a41 update copyrights and stick more files to GPLv2; move and add more credits to the AUTHORS file; add COPYING.BSD 17 years ago
aCaB d070d47570 otf pdf scanning 18 years ago
Sven Strickroth a99111f050 remove old CVS-stuff and make the repository look more like SVN 19 years ago
Tomasz Kojm 48b7b4a747 update GPL headers with new address for FSF 19 years ago
Nigel Horne 2c313298c8 Send ctx to pdf 20 years ago
Tomasz Kojm 798308ded9 activate PDF code 20 years ago
Nigel Horne d056cc177b First draft 20 years ago
Nigel Horne 5f1ab8db98 First draft 20 years ago
Tomasz Kojm b2d28e76d6 update 21 years ago
Luca Gibelli e3aaff8e10 Initial revision 22 years ago