clamav

Commit Graph

Author	SHA1	Message	Date
Micah Snyder (micasnyd)	b9ca6ea103	Update copyright dates for 2021 Also fixes up clang-format.	4 years ago
Micah Snyder	e4e3149368	Fix fmap-duplicate performance issue The fmap_duplicate function is used create a new fmap with a view into an existing fmap. When the new view is a different size than the old fmap, a new hash must be calculated for the duplicate fmap. However, when the duplicated fmap is the same size as the original fmap, the hash will be the same and there's no point recalculating. The issue is apparent when scanning large EXE files because the hash was being calculated at the beginning and end of the scan. Digging into this issue revealed that hash calculations for fmaps were also being performed at the wrong place. For scans of maps we use fmap_duplicate() early in the process to apply the name API argument to the duplicate fmap. Fixing the logic so we doing recalculate the hash revealed that we never calculated hashes for fmap's created from buffers in the first place, so that also had to be fixed be relocating where the hash is calculated. I also found that fmap_duplicate()'s offset argument used an off_t, though it and all caller offsets are not allowed to be negative. This was a bit of tangent to fix a bunch of off_t variables and paramters that should've been size_t. Added a couple unit tests to verify that making duplicate fmaps, and duplicate-duplicate fmaps works as expected after the change. Changed CLI_ISCONTAINED() and CLI_ISCONTAINED2() macros to cast to size_t, because pointers and buffer sizes may not be negative, and these two macros do not rely on substraction.	4 years ago
Micah Snyder	9b9999d778	Rename core scanning functions Many of the core scanning functions' names no longer represent their specific purpose or arguments. This commit aims to make the names more intuitive. Names are now prefixed with "magic" if they involve file-typing and file-type parsing. In addition, each function now includes the type of input being scanned whether its "desc", "fmap", or "buff". Some of the APIs also now specify "type" to indicate that a type other than "ANY" may be passed in to select the type rather than use file type magic for type recognition. \| current name \| new name \| \| ------------------------- \| --------------------------------- \| \| magic_scandesc() \| cli_magic_scan() \| \| cli_magic_scandesc_type() \| <delete> \| \| cli_magic_scandesc() \| cli_magic_scan_desc() \| \| cli_base_scandesc() \| cli_magic_scan_desc_type() \| \| cli_partition_scandesc() \| <delete> \| \| cli_map_scandesc() \| magic_scan_nested_fmap_type() \| \| cli_map_scan() \| cli_magic_scan_nested_fmap_type() \| \| cli_mem_scandesc() \| cli_magic_scan_buff() \| \| cli_scanbuff() \| cli_scan_buff() \| \| cli_scandesc() \| cli_scan_desc() \| \| cli_fmap_scandesc() \| cli_scan_fmap() \| \| cli_scanfile() \| cli_magic_scan_file() \| \| cli_scandir() \| cli_magic_scan_dir() \| \| cli_filetype2() \| cli_determine_fmap_type() \| \| cli_filetype() \| cli_compare_ftm_file() \| \| cli_partitiontype() \| cli_compare_ftm_partition() \| \| cli_scanraw() \| scanraw() \|	5 years ago
Micah Snyder	005cbf5a37	Record names of extracted files A way is needed to record scanned file names for two purposes: 1. File names (and extensions) must be stored in the json metadata properties recorded when using the --gen-json clamscan option. Future work may use this to compare file extensions with detected file types. 2. File names are useful when interpretting tmp directory output when using the --leave-temps option. This commit enables file name retention for later use by storing file names in the fmap header structure, if a file name exists. To store the names in fmaps, an optional name argument has been added to any internal scan API's that create fmaps and every call to these APIs has been modified to pass a file name or NULL if a file name is not required. The zip and gpt parsers required some modification to record file names. The NSIS and XAR parsers fail to collect file names at all and will require future work to support file name extraction. Also: - Added recursive extraction to the tmp directory when the --leave-temps option is enabled. When not enabled, the tmp directory structure remains flat so as to prevent the likelihood of exceeding MAX_PATH. The current tmp directory is stored in the scan context. - Made the cli_scanfile() internal API non-static and added it to scanners.h so it would be accessible outside of scanners.c in order to remove code duplication within libmspack.c. - Added function comments to scanners.h and matcher.h - Converted a TDB-type macros and LSIG-type macros to enums for improved type safey. - Converted more return status variables from `int` to `cl_error_t` for improved type safety, and corrected ooxml file typing functions so they use `cli_file_t` exclusively rather than mixing types with `cl_error_t`. - Restructured the magic_scandesc() function to use goto's for error handling and removed the early_ret_from_magicscan() macro and magic_scandesc_cleanup() function. This makes the code easier to read and made it easier to add the recursive tmp directory cleanup to magic_scandesc(). - Corrected zip, egg, rar filename extraction issues. - Removed use of extra sub-directory layer for zip, egg, and rar file extraction. For Zip, this also involved changing the extracted filenames to be randomly generated rather than using the "zip.###" file name scheme.	5 years ago
Micah Snyder	206dbaefe8	Update copyright dates for 2020	5 years ago
Micah Snyder	52cddcbcfd	Updating and cleaning up copyright notices.	6 years ago
Micah Snyder	72fd33c8b2	clang-format'd using new .clang-format rules.	6 years ago
Micah Snyder	d39cb6581f	Updating libclamunrar from legacy C implementation to modern unrar 5.6.5. API changes and supporting changes included to pass the filepath of the scanned file into libclamav through the cli_ctx structure, required by the unrar library to open archives. The filename argument may be optional for the scandesc scanning variant, but libclamav will make a best effort to identify the filename from the file descriptor if it was not provided. In addition, included the ability to prefix temp file and directory names with file basenames.	7 years ago
Mickey Sola	46a35abe56	mass update of copyright headers	10 years ago
Shawn Webb	221825fd59	Update copyright information.	11 years ago
Kevin Lin	328a33258a	modified cli_map_scan and cli_map_scandesc to take a cli_file_t modified all respective calls to the above change	12 years ago
David Raynor	3cab931d78	Add ForceToDisk option for clamd and force-to-disk arg for clamscan	12 years ago
David Raynor	1d1c4b154f	bb #1570 : partition typing and HFS+	12 years ago
Török Edvin	b3a8f9980d	cli_map_scandesc convenience API	14 years ago
Török Edvin	87f763991b	Introduce cli_map_scandesc to scan a portion of the existing file And switch CPIO, MACHO, and SWF to use it. Now they no longer need to dump a tempfile and remap. To investigate if it is possible to do this with TAR.	14 years ago
Török Edvin	b7ae31f1c7	fmapify matcher/magic_scan partially	14 years ago
Török Edvin	769f37a6f6	Default off, you can turn on via 'DevLiblog'. This also replaces the cli__stats variants with a callback for stats, so that clamd can call the cl__callback variants instead, and pass the filename as context.	15 years ago
Tomasz Kojm	7770d314ff	libclamav: allow logical sigs to be used as file type sigs (bb#2228)	15 years ago
Tomasz Kojm	edbba730b3	clamd: add ExtendedDetectionInfo (bb#1228, #1626 )	15 years ago
Török Edvin	7f0d1148d6	clamd, clamscan, libclamav: new option HeuristicScanPrecedence (bb #649 ) docs/: update docs for HeuristicScanPrecedence and ScanPartialMessages unit_tests/: add test for HeuristicScanPrecedence git-svn: trunk@4037	17 years ago
Tomasz Kojm	2023340a41	update copyrights and stick more files to GPLv2; move and add more credits to the AUTHORS file; add COPYING.BSD git-svn: trunk@3749	17 years ago
Tomasz Kojm	c754386654	mail: scan text attachments and decoded base64 bodies also with type 4 sigs (bb#378) git-svn: trunk@3615	18 years ago
Tomasz Kojm	bb34cb31fe	update some copyrights and stick to GPL v2 git-svn: trunk@3003	18 years ago
Sven Strickroth	a99111f050	remove old CVS-stuff and make the repository look more like SVN git-svn: trunk@2755	19 years ago
Tomasz Kojm	48b7b4a747	update GPL headers with new address for FSF git-svn: trunk@1901	19 years ago
Tomasz Kojm	3c91998bb4	simplify internal function declarations by passing a context structure git-svn: trunk@1845	20 years ago
Tomasz Kojm	20384c878e	extract and scan SIS packages git-svn: trunk@1805	20 years ago
Tomasz Kojm	0d0fc120bb	update git-svn: trunk@1804	20 years ago
Tomasz Kojm	5612732cad	add support for cl_engine and cli_matcher git-svn: trunk@1726	20 years ago
Tomasz Kojm	33f89aa5ab	fix compiler warnings git-svn: trunk@1407	20 years ago
Tomasz Kojm	3805ebcbe2	minor cleanup git-svn: trunk@855	21 years ago
Tomasz Kojm	85dd846059	libclamav: pe: integrate Petite unpacker from aCaB (not yet activated) git-svn: trunk@715	21 years ago
Tomasz Kojm	8000d0786b	Use new patter matching algorithm. Cleanup. git-svn: trunk@674	21 years ago
Tomasz Kojm	4048c4f6dc	initial support for MD5 signatures git-svn: trunk@667	21 years ago
Tomasz Kojm	084ee140cf	extend engine to support character alternatives and distance limits in multipattern signatures git-svn: trunk@661	21 years ago
Tomasz Kojm	888f5794bf	new method of file type detection; HTML normalisation git-svn: trunk@648	21 years ago
Tomasz Kojm	ed026d3677	minor fixes git-svn: trunk@475	21 years ago
Tomasz Kojm	b5b62ca7f1	Don't limit '*' to a single 128KB buffer. git-svn: trunk@457	21 years ago
Tomasz Kojm	e4ae772644	Big update git-svn: trunk@88	22 years ago
Luca Gibelli	e3aaff8e10	Initial revision git-svn: trunk@7	22 years ago

24 Commits (9f407d83b3dd2f18b2ffb764da71ccd992f16872)