clamav

Commit Graph

Author	SHA1	Message	Date
Micah Snyder	e48dfad49a	Windows: Fix C/Rust FFI compat issue + Windows compile warnings Primarily this commit fixes an issue with the size of the parameters passed to cli_checklimits(). The parameters were "unsigned long", which varies in size depending on platform. I've switched them to uint64_t / u64. While working on this, I observed some concerning warnigns on Windows, and some less serious ones, primarily regarding inconsistencies with `const` parameters. Finally, in `scanmem.c`, there is a warning regarding use of `wchar_t *` with `GetModuleFileNameEx()` instead of `GetModuleFileNameExW()`. This made me realize this code assumes we're not defining `UNICODE`, which would have such macros use the 'A' variant. I have fixed it the best I can, although I'm still a little uncomfortable with some of this code that uses `char` or `wchar_t` instead of TCHAR. I also remove the `if (GetModuleFileNameEx) {` conditional, because this macro/function will always be defined. The original code was checking a function pointer, and so this was a bug when integrating into ClamAV. Regarding the changes to `rijndael.c`, I found that this module assumes `unsigned long` == 32bits. It does not. I have corrected it to use `uint32_t`.	1 year ago
Micah Snyder	1cb7ab4dc3	Improve LZH file type magic sigs, and C-Rust FFI memory leak	1 year ago
Micah Snyder	3ae9c1e434	Add LHA/LZH archive support File type magic signatures chosen based on the extensions supported by Rust delharc crate. See: https://docs.rs/delharc/latest/delharc/	1 year ago
Micah Snyder	1e5ddefcee	Clang-format touchup	1 year ago
Micah Snyder	0e5d3f97f1	Fix GitHub code scan issues	1 year ago
Micah Snyder	405829ee88	Refine max-allocation and safer-allocation function and macro names We add the _OR_GOTO_DONE suffix to the macros that go to done if the allocation fails. This makes it obvious what is different about the macro versus the equivalent function, and that error handling is built-in. Renamed the cli_strdup to safer_strdup to make it obvious that it exists because it is safer than regular strdup. Regular strdup doesn't have the NULL check before trying to dup, and so may result in a NULL-deref crash. Also remove unused STRDUP (_OR_GOTO_DONE) macro, since the one with the NULL-check is preferred.	1 year ago
Micah Snyder	39070d1c76	Remove additional memory allocation limits relating to signature load Variables like the number of signature parts are considered trusted user input and so allocations based on those values need not check the memory allocation limit. Specifically for the allocation of the normalized buffer in cli_scanscript, I determined that the size of SCANBUFF is fixed and so safe, and the maxpatlen comes from the signature load and is therefore also trusted, so we do not need to check the allocation limit.	1 year ago
Micah Snyder	609ace2e3c	Remove unnecessary max-allocation limit checks from bytecode runtime Allocations for bytecode signatures to work need not check against the memory allocation limit, as bytecode signatures are considered trusted user input. You may note that I did not remove allocation limits from the bytecode API functions that may be called by the signatures such as adding json objects, hashsets, lzma and bz2 decompressors, etc. This is because it is likely that a bytecode signature may call them more times based on the structure of the file being scanned - particularly for the json objects.	1 year ago
Micah Snyder	9dc80eb8e7	Add max-allocation limit to bytecode API's malloc function Bytecode signature's are able to allocate buffers, but should probably adhere to clamav's max allocation limit. This adds a check to make sure they don't accidentally alloc too much based on untrusted user input.	1 year ago
Micah Snyder	c9a725dc70	Remove duplicate copy of CLI_STRDUP macro A code merge resulted in a duplicate copy of the CLI_STRDUP macro. Also fixed formatting.	1 year ago
Micah Snyder	7033a18e67	Remove duplicate max-alloc checks for lzma and 7z alloc functions Some sort of code merge way-back-when resulted in two identical max-allocation checks. I removed the noisy ones.	1 year ago
Micah Snyder	902623972d	Remove max-allocation limits where not required The cli_max_malloc, cli_max_calloc, and cli_max_realloc functions provide a way to protect against allocating too much memory when the size of the allocation is derived from the untrusted input. Specifically, we worry about values in the file being scanned being manipulated to exhaust the RAM and crash the application. There is no need to check the limits if the size of the allocation is fixed, or if the size of the allocation is necessary for signature loading, or the general operation of the applications. E.g. checking the max-allocation limit for the size of a hash, or for the size of the scan recursion stack, is a complete waste of time. Although we significantly increased the max-allocation limit in a recent release, it is best not to check an allocation if the allocation will be safe. It would be a waste of time. I am also hopeful that if we can reduce the number allocations that require a limit-check to those that require it for the safe scan of a file, then eventually we can store the limit in the scan- context, and make it configurable.	1 year ago
Micah Snyder	8e04c25fec	Rename clamav memory allocation functions We have some special functions to wrap malloc, calloc, and realloc to make sure we don't allocate more than some limit, similar to the max-filesize and max-scansize limits. Our wrappers are really only needed when allocating memory for scans based on untrusted user input, where a scan file could have bytes that claim you need to allocate some ridiculous amount of memory. Right now they're named: - cli_malloc - cli_calloc - cli_realloc - cli_realloc2 ... and these names do not convey their purpose This commit renames them to: - cli_max_malloc - cli_max_calloc - cli_max_realloc - cli_max_realloc2 The realloc ones also have an additional feature in that they will not free your pointer if you try to realloc to 0 bytes. Freeing the memory is undefined by the C spec, and only done with some realloc implementations, so this stabilizes on the behavior of not doing that, which should prevent accidental double-free's. So for the case where you may want to realloc and do not need to have a maximum, this commit adds the following functions: - cli_safer_realloc - cli_safer_realloc2 These are used for the MPOOL_REALLOC and MPOOL_REALLOC2 macros when MPOOL is disabled (e.g. because mmap-support is not found), so as to match the behavior in the mpool_realloc/2 functions that do not make use of the allocation-limit.	1 year ago
Micah Snyder	6d6e04ddf8	Optimization: replace limited allocation calls There are a large number of allocations for fix sized buffers using the `cli_malloc` and `cli_calloc` calls that check if the requested size is larger than our allocation threshold for allocations based on untrusted input. These allocations will always be higher than the threshold, so the extra stack frame and check for these calls is a waste of CPU. This commit replaces needless calls with A -> B: - cli_malloc -> malloc - cli_calloc -> calloc - CLI_MALLOC -> MALLOC - CLI_CALLOC -> CALLOC I also noticed that our MPOOL_MALLOC / MPOOL_CALLOC are not limited by the max-allocation threshold, when MMAP is found/enabled. But the alternative was set to cli_malloc / cli_calloc when disabled. I changed those as well. I didn't change the cli_realloc/2 calls because our version of realloc not only implements a threshold but also stabilizes the undefined behavior in realloc to protect against accidental double-free's. It may be worth implementing a cli_realloc that doesn't have the threshold built-in, however, so as to allow reallocaitons for things like buffers for loading signatures, which aren't subject to the same concern as allocations for scanning possible malware. There was one case in mbox.c where I changed MALLOC -> CLI_MALLOC, because it appears to be allocating based on untrusted input.	1 year ago
Micah Snyder	2cc47c83ac	Make image fuzzy hashing optional Image fuzzy hashing is enabled by default. The following options have been added to allow users to disable it, if desired. New clamscan options: --scan-image[=yes()/no] --scan-image-fuzzy-hash[=yes()/no] New clamd config options: ScanImage yes()/no ScanImageFuzzyHash yes()/no New libclamav scan options: options.parse &= ~CL_SCAN_PARSE_IMAGE; options.parse &= ~CL_SCAN_PARSE_IMAGE_FUZZY_HASH; This commit also changes scan behavior to disable image fuzzy hashing for specific types when the DCONF (.cfg) signatures disable those types. That is, if DCONF disables the PNG parser, it should not only disable the CVE/format checker for PNG files, but also disable image fuzzy hashing for PNG files. Also adds a DCONF option to disable image fuzzy hashing: OTHER_CONF_IMAGE_FUZZY_HASH DCONF allows scanning features to be disabled using a configuration "signature".	1 year ago
Neil Wilson	5e63c42696	Update libclamav.map with missing symbols	1 year ago
rsundriyal	24226f765f	Bumped version from 1.3.0 -> 1.4.0-devel for new release changes	1 year ago
Micah Snyder	5f934c16b4	Update bytecode api functionality levels and add news from recent patch versions	1 year ago
Micah Snyder	2b55c15b8c	OLE2: Fix integer overflow that may result in over read An integer overflow when calculating remainingBytes may cause a large buffer over read. Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=62542	1 year ago
Micah Snyder	82491dabaa	PDF: Fix 1-byte overread An overread may occur if attempting to decrypt an empty string. Issue introduced during 1.3 development. Fixes: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=66281	1 year ago
Micah Snyder	ebe3c50555	PDF: Minor optimizations Store temp files with obj id and gen id so analysts know which is which. Don't dump decoded objects immediately. They'll get dumped later at the end of pdf_extract_obj(). At the end of PDF object extraction, we don't need to find out the "dumpid" (aka the object index in our list of pdf objects). It isn't actually used! So I removed the unused parameter.	1 year ago
Micah Snyder	35f277c8cb	PDF: Add support for checking empty owner password Specifically for algorithm 6 (/R 6). Use the O and OE strings to test if an empty owner password will decrypt the file.	1 year ago
Micah Snyder	d114e3fc66	PDF: Fix PDF metadata decryption issues The encrypted metadata may be stored in a <> block containing hex bytes. Strip off the <> and decode the hex to binary.	1 year ago
Micah Snyder	9cb28e51e6	Bump copyright dates for 2024	1 year ago
RainRat	1b17e20571	Fix typos (no functional changes)	1 year ago
Micah Snyder	fd11f1b468	Add CL_TYPE_PYTHON_COMPILED and associated file type magic signatures It may be necessary to differentiate between *.pyc and other binary types in case additional processing is needed. Outside of being able to differentiate the by file type, the scanner will treat CL_TYPE_PYTHON_COMPILED the same as CL_TYPE_BINARY_DATA. That is - we're not adding parser at this time to further break down .pyc files.	1 year ago
Micah Snyder	cb45aebe13	Fix several coverity issues in UnRAR interface code Fix Coverity issues 192935, 192932, 192928, and 192917. None of these are particularly serious. I thought I'd clean them up while trying to track down a strange crash that occurs in Windows debug builds with my specific setup when freeing the metadata filename pointer malloced by the UnRAR iface "peek" function. I wasn't able to figure out why freeing that causes a crash, so instead I converted it to an array that need not be freed, and my troubles melted away.	1 year ago
Micah Snyder	a51b3ca606	FMap: Windows & 32bit compatibility fix for Rust interface The fmap structure has some stuff that differs in size in memory between Linux and Windows, and between 32bit and 64bit architectures. Notably, `time_t` appears to be defined by the Rust bindgen module as `ulong` which may be either 8 bytes or 4 bytes, depending architecture (thanks, C). To resolve this, we'll store time as a uint64_t instead. The other problem in the fmap structure is the windows file and map handles should always be exist, and may only be used in Windows, for consistency in sizing of the structure.	1 year ago
Micah Snyder	00b78531e3	Silence PNG early end-of-file warning While interesting, it does not appear this warning is useful to anyone.	1 year ago
Micah Snyder	3b2f8c044a	Support for extracting attachments from OneNote section files Includes rudimentary support for getting slices from FMap's and for interacting with libclamav's context structure. For now will use a Cisco-Talos org fork of the onenote_parser until the feature to read open a onenote section from a slice (instead of from a filepath) is added to the upstream.	1 year ago
RainRat	dbc6ffeb5f	Remove redundant memset Fixes https://github.com/Cisco-Talos/clamav/issues/1087	1 year ago
RainRat	d649c15dcf	Fix minor struct initialization copy-paste bug Incorrect `memset` use in `cli_regex2suffix` does not fully clear the `root_node` structure.	1 year ago
Albert Chin-A-Young	a8cc82df57	Fix SIGBUS on HP-UX/IA when built as a 64-bit binary Use memcpy() to fix alignment issues.	2 years ago
RainRat	caf324e544	Fix typos (no functional changes)	2 years ago
Micah Snyder	ebd30d7dbe	Fix PDF decryption issue for some empty password files Some PDF's with an empty password can't be decrypted. Investigation found that the problem is a strlen check to prevent an overflow rather than passing down the actual length of the allocated field. Specifically, the UE buffer may have NULL values in it, so a strlen check will claim the field is shorter than it is and then later checks fail because the length is the wrong size. While at it, I improved code comments on the function reading dictionary key-value strings and switched a flag use a bool rather than an int.	2 years ago
Micah Snyder	8430de5f8b	Fix alert-exceeds-max feature for files > 2GB and < max-filesize The --alert-exceeds-max feature should alert for all files larger than 2GB because 2GB is the internal limit for individual files. This isn't working correctly because the `goto done;` exit condition after recording the exceeds-max heuristic skips over the logic that reports the alert. This fix moves the ">2GB" check up to the location where the max-filesize engine option is set by clamd or clamscan. If max-filesize > 2GB - 1 is requested, then max-filesize is set to 2GB - 1. Additionally, a warning is printed if max-filesize > 2GB is requested (with an exception for when it's maxed out by setting --max-filesize=0). Resolves: https://github.com/Cisco-Talos/clamav/issues/1030	2 years ago
Yasuhiro Kimura	14d7d215b1	Fix link error with LLD 17 Developers of FreeBSD base system are currently working to upgrade its LLVM/Clang/LLDB/LLD to 17. As a part of it they tried building all ports in FreeBSD ports collections to check if build of them succeeds with LLVM/Clang/LLD 17. As a result there are some ports that fail to be built with it and unfortunately `security/clamav` is one of them. The build of it fails with link error as following. ``` ld: error: version script assignment of 'CLAMAV_PRIVATE' to symbol 'cli_cvdunpack' failed: symbol not defined ld: error: version script assignment of 'CLAMAV_PRIVATE' to symbol 'cli_dbgmsg_internal' failed: symbol not defined ld: error: version script assignment of 'CLAMAV_PRIVATE' to symbol 'init_domainlist' failed: symbol not defined ld: error: version script assignment of 'CLAMAV_PRIVATE' to symbol 'init_whitelist' failed: symbol not defined ld: error: version script assignment of 'CLAMAV_PRIVATE' to symbol 'cli_parse_add' failed: symbol not defined ld: error: version script assignment of 'CLAMAV_PRIVATE' to symbol 'cli_bytecode_context_clear' failed: symbol not defined cc: error: linker command failed with exit code 1 (use -v to see invocation) ``` According to the investigation of ClamAV's source code, `cli_cvdunpack` is a static function so it isn't visible to external consumers. And other mentioned symbols aren't found anywhere. So fix link error by removing all of them from linker version script.	2 years ago
rasundri	1ba3f035d3	Bumped version from 1.2.0 -> 1.3.0-devel FLEVEL 190 -> 200 Updated the News to have section for the new version and added previous patch versions.	2 years ago
Micah Snyder	e27a450bf8	ZIP: Always parse file names Having the filename is useful for certain callbacks, and will likely be more useful in the future if we can start comparing detected filetypes with file extensions. E.g. if filetype is just "binary" or "text" we may be able to do better by trusting a ".js" extension to determine the type. Or else if detected file type is "pe" but the extension is ".png" we may want to say it's suspicious. Also adjusted the example callback program to disable metadata option. The CL_SCAN_GENERAL_COLLECT_METADATA is no longer required for the Zip parser to record filenames for embedded files, and described in the previous commit. This program can be used to demonstrate that it is behaving as desired.	2 years ago
Micah Snyder	0c03b8b6bb	Error reporting touchup for max-filesize	2 years ago
Andy Ragusa	80bb7c8d26	ClamD: Fix error reporting when exceeding max filesize Previously clamd would report "Cannot allocate memory" when a file exceeded max file size. This commit corrects it to report "Heuristics.Limits.Exceeded.MaxFileSize." Fixes: https://github.com/Cisco-Talos/clamav/issues/670.	2 years ago
Micah Snyder	8e231707d2	Fix infinite loop when scanning some DMG archives When decompressing a zlib stream, it's possible to reach end of stream before running out of available bytes. In the DMG parser, this may cause an infinite loop. This commit adds a check for the condition where stream has ended before running out of input. Fixes: https://github.com/Cisco-Talos/clamav/issues/925	2 years ago
Micah Snyder	178b7706b0	Coverity-415952: Remove logically dead code In aspack decrypt function, there's a check to make sure that backbytes doesn't exceed 57, because it is used as an index in init_array. However, it is mathematically impossible. So this commit removes the check.	2 years ago
Micah Snyder	7d621c9506	Coverity-415954: Remove duplicated/dead code	2 years ago
Micah Snyder	3193a6be87	Coverity-415955: Fix Y2K38 issue with clambc --info datetime	2 years ago
Micah Snyder	0d3dc86f90	Coverity-514958: Error handling check with getpagesize call `cli_getpagesize()` may return -1 in an error condition. If it does, let's just treat it as 4096. I believe the actual coverity complaint is a false positive, but it's fair to account for the error case and this should shut it up.	2 years ago
Micah Snyder	ba49cbfafa	Fix bounds check issue in PDF parser The bytes_remaining variable may be set negative by mistake, when really we just want to decrement it. This issue may result in a 1-byte over read but does not cause any crash. We determined that this issue is not a vulnerability. Fixes: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=58475	2 years ago
Micah Snyder	4c34301a4f	Coverity 416446: Fix possible NULL dereference Missing a NULL-check before calling `cli_unlink()` which has no NULL check of its own before calling `unlink`.	2 years ago
Micah Snyder	ac22076a57	Coverity-416449, 416447: error handling issue in UDF parser In `cli_scanudf()` it's possible to `goto done;` before memset'inng the fileIdentifierList and fileEntryList. This would likely cause uninitialized pointer reads. Instead of memset, just initialize the structs with `= {0};`	2 years ago
Micah Snyder	9a871f7c4e	Export symbols for calculating image fuzzy hash These symbols are used by an internal python tool for generating signatures: - fuzzy_hash_calculate_image - ffierror_fmt `ffierror_fmt` is required to free the error structure passed back in case of an error. Since version 1.1.0 started using libclamav.map again, we need to explicitly export these symbols.	2 years ago

1 2 3 4 5 ...

5061 Commits (00886ee90d07a3b382041b1a9ef0c01e093f571e)