clamav

Commit Graph

Author	SHA1	Message	Date
Micah Snyder	14320ec243	Rust: Update pinned dependency versions in Cargo.lock file	1 year ago
Micah Snyder	6e1afbbb62	Reduce C-Rust FFI complexity for HTML CSS image extraction logic The C-Rust FFI code is needlessly complex. Now that we are calling into magic_scan from Rust, we can simply hand off the <style> block contents to Rust code to handle extraction and scanning.	1 year ago
Micah Snyder	b6ebfbdf11	clam-format touchup	1 year ago
Andy Ragusa	79f2a5f2f6	Add parser for ALZ archives	1 year ago
RainRat	143d23c326	Fix typos and remove duplicate #include	1 year ago
Micah Snyder	ed93242fef	Review fix: typo	1 year ago
Micah Snyder	e48dfad49a	Windows: Fix C/Rust FFI compat issue + Windows compile warnings Primarily this commit fixes an issue with the size of the parameters passed to cli_checklimits(). The parameters were "unsigned long", which varies in size depending on platform. I've switched them to uint64_t / u64. While working on this, I observed some concerning warnigns on Windows, and some less serious ones, primarily regarding inconsistencies with `const` parameters. Finally, in `scanmem.c`, there is a warning regarding use of `wchar_t *` with `GetModuleFileNameEx()` instead of `GetModuleFileNameExW()`. This made me realize this code assumes we're not defining `UNICODE`, which would have such macros use the 'A' variant. I have fixed it the best I can, although I'm still a little uncomfortable with some of this code that uses `char` or `wchar_t` instead of TCHAR. I also remove the `if (GetModuleFileNameEx) {` conditional, because this macro/function will always be defined. The original code was checking a function pointer, and so this was a bug when integrating into ClamAV. Regarding the changes to `rijndael.c`, I found that this module assumes `unsigned long` == 32bits. It does not. I have corrected it to use `uint32_t`.	1 year ago
Micah Snyder	1cb7ab4dc3	Improve LZH file type magic sigs, and C-Rust FFI memory leak	1 year ago
Micah Snyder	3ae9c1e434	Add LHA/LZH archive support File type magic signatures chosen based on the extensions supported by Rust delharc crate. See: https://docs.rs/delharc/latest/delharc/	1 year ago
Micah Snyder	9cb28e51e6	Bump copyright dates for 2024	1 year ago
RainRat	1b17e20571	Fix typos (no functional changes)	1 year ago
Micah Snyder	1132209ef5	Fix C/Rust FFI pointer type to resolve arm64 compatibility issue and update generated sys.rs	1 year ago
Micah Snyder	a51b3ca606	FMap: Windows & 32bit compatibility fix for Rust interface The fmap structure has some stuff that differs in size in memory between Linux and Windows, and between 32bit and 64bit architectures. Notably, `time_t` appears to be defined by the Rust bindgen module as `ulong` which may be either 8 bytes or 4 bytes, depending architecture (thanks, C). To resolve this, we'll store time as a uint64_t instead. The other problem in the fmap structure is the windows file and map handles should always be exist, and may only be used in Windows, for consistency in sizing of the structure.	1 year ago
Micah Snyder	9adde39b80	CMake: Rust bindgen w/ C libs outside std include path In order to generate Rust bindings for C code, Rust's bindgen module needs to know where to find all headers included by the API. If they're all inside the project or inside the standard include path (e.g. /usr/include and /usr/local/include) that's fine. But for third- party C library headers from outside the standard include path, that's a problem. We didn't really notice this problem when generating on Unix systems until we switched to use OpenSSL 3.1 and tested on systems that have the OpenSSL 1.1.1 dev package installed. The ability to find headers outside the project path is also needed to generate bindings on Windows, if desired. This commit solves the problem by passing include directories for the ClamAV::libclamav CMake build target to the Rust build via the CARGO_INCLUDE_DIRECTORIES environment variable. Then, in the `libclamav_rust/build.rs` script, where we run bindgen, we split that `;` separated string into invididual paths and add each to the bindgen builder.	1 year ago
Micah Snyder	3b2f8c044a	Support for extracting attachments from OneNote section files Includes rudimentary support for getting slices from FMap's and for interacting with libclamav's context structure. For now will use a Cisco-Talos org fork of the onenote_parser until the feature to read open a onenote section from a slice (instead of from a filepath) is added to the upstream.	1 year ago
driverxdw	63fa9b4553	CMake: add static library installation of libclamav submodules For static builds, also install libclamav_rust, libclammspack, libclamunrar_iface, and libclamunrar static libraries.	1 year ago
Micah Snyder	e3a0d9a538	Minor libclamav_rust doc and cbindgen typo fix	1 year ago
RainRat	caf324e544	Fix typos (no functional changes)	2 years ago
Micah Snyder	86ba9bc8ce	Fix warning when scanning some HTML files HTML files with <style> blocks containing non-utf8 sequences are causing warnings when processing them to extract base64 encoded images. To resolve this, we can use the to_string_lossy() method that may allocate and sanitize a copy of the content if the non-utf8 characters are encountered. Resolves: https://github.com/Cisco-Talos/clamav/issues/1082	2 years ago
Micah Snyder	3f7671928d	Cargo: Eliminate security warning about unused atty dependency atty is unmaintained but is still used by clap. Disabling the default features for cbindgen removes the clap dependency and thus removes atty. Resolves: https://github.com/Cisco-Talos/clamav/security/dependabot/2	2 years ago
Micah Snyder	6cdce8e4a9	Build system: Bump bindgen to latest version I'm unsure why, but building with cmke -D MAINTAINER_MODE=ON is failing right now. Updating to a newer version of bindgen appears to resolve the issue. I was able to update it by changing the version specified in libclamav_rust/Cargo.toml, and then running `cargo update -p bindgen` Not that I expect anyone else to be running maintainer-mode, but I did also confirm using `cargo-msrv` that the minimum supported version of rust did not change as a result of this commit.	2 years ago
Micah Snyder	2a21451e1f	Fix possible crash in HTML CSS image extraction When processing UTF-8 HTML code, the image extraction logic may panic if the string contains a multi-byte grapheme that includes a '(', ')', whitespace, or one of the other characters used to split the text when searching for the base64 image content. The panic is because the `split_at()` method will panic if you try to split in the middle of a unicode grapheme. This commit fixes the issue by processing the HTML string one grapheme at a time instead of one character (byte) at a time. The `grapheme_indices()` method is used to get the correct position of the start of each grapheme for splitting the string.	2 years ago
Orion Poplawski	f66861755a	Update cbindgen 0.24	2 years ago
Micah Snyder	5e59393994	Freshclam, Sigtool: Fix bug creating new files in CLD with CDIFF The CLOSE command is failing to create a file when appending changes if the file does not already exist. This prevents adding new files to a database with a CDIFF and caused failures applying the test-3.cdiff file in the freshclam feature tests. Also improved the error message to show which command, specifically, is failing (not just the line number).	2 years ago
Micah Snyder	ccd68ffb64	Fix Rust Clippy linter complaints	2 years ago
Micah Snyder	60f3413bf3	Freshclam/Sigtool: Fix CDIFF Unlink operation Any cdiff or script using the UNLINK operation will fail to delete the file claiming "No DB open for action UNLINK". The UNLINK operation appears to be trying to delete a currently open database, when in fact it should ensure no database is open before deleting the local file given by the single "db_name" parameter.	2 years ago
Micah Snyder	1789c10eae	Update generated sys.rs file	2 years ago
Micah Snyder	1dd159bd95	CMake: Fix issue generating Rust bindings with -D MAINTAINER_MODE=ON	2 years ago
Micah Snyder	6eebecc303	Bump copyright for 2023	2 years ago
Micah Snyder	dcaaf86a4b	HTML <style> image extraction improvement I found that the `url(data:` type does not matter to a browser. In addition, whitespace may be placed in a few locations and the browser will ignore it. This commit accounts for this, and updates the test accordingly.	2 years ago
Micah Snyder	33eeb46b58	Test: verify clamscan detecting 2 images from same HTML style block	2 years ago
Micah Snyder	6f54fe2d66	Find and scan base64'd images found in HTML <style> url() args This commit adds a feature to find, decode, and scan each image found within HTML <style> tags where the image data is embedded in `url()` function parameters a base64 blob In C in the html normalization process we extract style tag contents to new buffer for processing. We call into a new feature in Rust code to find and decode each image (if there are multiple). Once extracted, the images are scanned as contained files of unknown type, and file type identifcation will determine the actual type.	2 years ago
Micah Snyder	53baf95b34	Mitigate crashes when image decoder fails If the image decoders (i.e. jpeg, tiff, png, etc.) fail to load an image due to a panic, the application will crash. This commit attempts to catch those panics and handle the error.	3 years ago
Micah Snyder	542baf69c6	Update generated sys.rs file	3 years ago
Micah Snyder	6fffae9843	Update generated sys.rs file	3 years ago
Micah Snyder	c052dbeed9	Update generated sys.rs file	3 years ago
Micah Snyder	01fddccd8e	Fix issue reporting all Heuristic/PUA matches in allmatch mode When potentially unwanted indicators are found, they are not reported right away, except in heuristic-precedence mode. At the end of the scan, we then report all of the heuristic/PUA alerts. In my initial overhaul of the allmatch mode, I introduced an issue where it is never reporting more than 1 heuristic/PUA alert. This commit fixes that by reporting all potentially-unwanted indicators at the end. Note that instead of using `cli_virus_found_cb()` which would report the top layer file descriptor as well in the callback, I'm reporting `-1` for the file descriptor instead, because we don't know when the alert was added. If it was at a deeper layer in the file, we would not longer have access to it at the end of the scan. Also removed excessive debug statements '... infected with ...' that use `cli_get_last_virus()` because it doesn't really reflect all of the matches in allmatch mode, because the match would've been reported previously (so it is excessive), and because I don't like the word 'infected'. :)	3 years ago
Micah Snyder	621381e0cd	Allmatch-mode overhaul, part 1: append_virus Rework the append_virus mechanism to store evidence (strong indicators, pua indicators, and eventually weak indicators) in vectors. When appending a "virus", we will return CLEAN when in allmatch-mode, and simply add the indicator to the appropriate vector. Later we can check if there were any alerts to return a vector by summing the lengths of the strong and pua indicator vectors. This does away with storing the latest "virname" in the scan context. Instead, we can query for the last indicator in the evidence, giving priority to strong indicators. When heuristic-precendence is enabled, add PUA as Strong instead of as PotentiallyUnwanted. This way, they will be treated equally and reported in order in allmatch mode. Also document reason for disabling cache with metadata JSON enabled	3 years ago
Micah Snyder	d938bd9ff9	Tests: break out clamscan tests into separate files The `clamscan_test.py` file is getting way too long. Created a new `unit_tests/clamscan` directory and separated all tests into separate test files. I also fixed an issue with the clamscan `ign2` test: The `ign2` test wasn't written correctly and was actually testing detection despite using the `-d` parameter to try to ignore a signature. There is a minor bug where `ign2` files may be loaded after other files when using the `-d` option. It is only guaranteed to be loaded first if you load all the sigs from the same directory. I fixed the test. In the future, we should make it so all database files are sorted in a list before load time regardless of where they're sourced from.	3 years ago
Micah Snyder	cf812993b6	Update generated sys.rs file	3 years ago
Micah Snyder	cd3134568a	Code quality: Refactor layer attributes as scan parameter The current implementation sets a "next layer attributes" flag field in the scan context. This may introduce bugs if accidentally not cleared during error handling, causing that attribute to be applied to a different layer than intended. This commit resolves that by adding an attribute flag to the major internal scan functions and removing the "next layer attributes" from the scan context. This attributes flag shares the same flag fields as the attributes flag in the new file inspection callback and the flags are defined in `clamav.h`.	3 years ago
Micah Snyder	9d6ebd6d50	Adds file inspection callback and example code libclamav callbacks can be used to access embedded file content at each layer of extraction during the course of a scan. The existing callbacks only provide access to the file descriptor and a guess at the file type. This patch adds a new callback for the purposes of file/archive inspection that provides additional insight into the embedded file. This includes: - ancestors: an array of parent file names - parent file size: the size of the direct parent layer - file name: current layer's filename, if any. - file buffer (pointer) - file size: size of file buffer - file type: just a guess at the current file's type - file descriptor: may be -1 if the layer is in-memory only. - layer attributes: a flag field. see LAYER_ATTRIBUTE_* defines in clamav.h Two new example apps are added that are automatically built when compiling under CMake: - ex2 demonstrates the prescan callback. - ex3 demonstrates the new file inspection callback. The examples are now installed if enabled, so you can test them in the Docker image, and so that they'll be colocated with the DLLs so you can test them on Windows. The installed examples should also be able to find the UnRAR library at run time, without having to set LD_LIBRARY_PATH. This commit also sets the fmap->name in an fmap-scan using the basname of the provided filename if the caller provided the filename and the provided fmap does not have the name set.	3 years ago
Micah Snyder	6814f71448	CMake+Rust: Don't rebuild Rust dependencies Right now, each Rust-based target added in CMake is being built in its own directory under the build path. This causes Rust to build each module from scratch, meaning any dependencies they have in common are built twice. The solution in this commit is to specify the top level build directory as the target directory for every Rust build and test. Note this also changes where the `clamav_rust.h` file is generated. It is now also placed in the top-level build directory, instead of under the `build/libclamav_rust` directory. That's a bit of a side-effect, and could be rectified if needed, but it appears to have no ill-effects. It's the same location that we drop the clamav-types.h file, so I think it's fine, for now. Note that `clamav_rust.h` is not a public header, it's just so libclamav functions can call into libclamav_rust functions.	3 years ago
Micah Snyder	64d6861d93	CMake, Rust: precompile test executable Precompile the test executable during the main build, after libclamav has built. This will make it so the compile time does not count against the test time. It also, unfortunately, make the main build take way longer. Removed 3 duplicate variables from the test ENVIRONMENT variable.	3 years ago
Micah Snyder	8b6e53a08a	Update generated sys.rs file	3 years ago
Micah Snyder	d9c8cab5be	Windows: Fix utf8 filepath issues * Windows: Fix utf8 filename support issue The function used to verify the real path for the file being scanned fails on some utf8 filenames, like: file_with_ümlaut.eicar.com The scan continues despite the failure, but the error reported from clamd and warnings from clamscan are confusing because it looks like the scan failed even if a verdict shows up later. The fix here is to convert the path from utf8 to utf16 and then use the CreateFileW() API to grab the file handle for the real path check. Resolves: https://github.com/Cisco-Talos/clamav/issues/418 Resolves: CLAM-1782 * Windows: Fix utf8 libclamav logging issues Print to the log using rust eprint() instead of fputs() Rust's eprint() function supports full utf8, while fputs() gets confused and prints stuff like: file_with_├╝mlaut.eicar.com instead of: file_with_ümlaut.eicar.com	3 years ago
Micah Snyder	0d96061e2f	Update generated sys.rs internal Rust bindings	3 years ago
Micah Snyder	ed57b85074	CMake, LLVM, Win32: Fix link issue when LLVM lib list are full paths	3 years ago
Micah Snyder	8bf70207d5	CMake: Fix LLVM linking issues: libclamav_rust, -ltinfo We must pass the LLVM library dependencies to the libclamav_rust build.rs script so it links the libclamav_rust unit test executable with LLVM. Also: - We can remove the libtinfo dependency that was hardcoded for the LLVM 3.6 support, and must remove it for the build to work on Alpine, macOS. - Also, increased the libcheck default timeout from 60s to 300s after experiencing a failure while testing this. - Also made one of the valgrind suppressions more generic to account for inline optimization differences observed in testing this.	3 years ago
Micah Snyder	5756f0eab8	Update generated sys.rs file	3 years ago

1 2

77 Commits (clamav-1.4.0-rc)