clamav

Commit Graph

Author	SHA1	Message	Date
Val Snyder	8a77214c82	Add CL_TYPE_AI_MODEL and associated file type magic signatures This is just preliminary support for identifying an assortment of different AI model files. So far, this detects the following types: - GGML GGUF (.gguf) - ONNX AI (.onnx) - TensorFlow Lite (.tflite) Additional types to consider: - SafeTensors (.safetensors) - TensorFlow (.pb, .ckpt, .tfrecords) - Keras (.keras) - pickle (.pkl) - numpy (.npy, .npz) - coreml (.coreml) - PyTorch (.pt, .pth, .bin, .mar, .pte, .pt2, .ptl) Outside of being able to differentiate by file type, the scanner will treat CL_TYPE_AI_MODEL the same as CL_TYPE_BINARY_DATA. We're not adding parsers to further process these files, for now.	2 months ago
Val Snyder	7ff29b8c37	Bump copyright dates for 2025	3 months ago
Micah Snyder	b6ebfbdf11	clam-format touchup	1 year ago
Andy Ragusa	79f2a5f2f6	Add parser for ALZ archives	1 year ago
Micah Snyder	1cb7ab4dc3	Improve LZH file type magic sigs, and C-Rust FFI memory leak	1 year ago
Micah Snyder	3ae9c1e434	Add LHA/LZH archive support File type magic signatures chosen based on the extensions supported by Rust delharc crate. See: https://docs.rs/delharc/latest/delharc/	1 year ago
Micah Snyder	9cb28e51e6	Bump copyright dates for 2024	1 year ago
Micah Snyder	fd11f1b468	Add CL_TYPE_PYTHON_COMPILED and associated file type magic signatures It may be necessary to differentiate between *.pyc and other binary types in case additional processing is needed. Outside of being able to differentiate the by file type, the scanner will treat CL_TYPE_PYTHON_COMPILED the same as CL_TYPE_BINARY_DATA. That is - we're not adding parser at this time to further break down .pyc files.	1 year ago
Micah Snyder	3b2f8c044a	Support for extracting attachments from OneNote section files Includes rudimentary support for getting slices from FMap's and for interacting with libclamav's context structure. For now will use a Cisco-Talos org fork of the onenote_parser until the feature to read open a onenote section from a slice (instead of from a filepath) is added to the upstream.	1 year ago
Andy Ragusa	b4f0836236	Add support for UDF files Add support for specifically for Beginning Extended Area Descriptor (BEA01) type of UDF files.	2 years ago
Micah Snyder	6eebecc303	Bump copyright for 2023	2 years ago
micasnyd	140c88aa4e	Bump copyright for 2022 Includes minor format corrections.	3 years ago
Andrew	b88f0abecf	libclamav: Update the ordering of internal FTM sigs This commit updates the ordering of the internal FTM sigs to match what's in daily.ftm today. No FTM signature changes are included as part of this commit (only re-ordering).	4 years ago
Andrew	ebb9989c19	libclamav: sync built-in FTM rules with daily.ftm Note that some rules added to the built-in list hadn't been propagated into daily.ftm, so I'll merge those in alongside this commit.	4 years ago
Micah Snyder (micasnyd)	b9ca6ea103	Update copyright dates for 2021 Also fixes up clang-format.	4 years ago
Micah Snyder	4d2b100ade	Code review fixes The new GIF, PNG, TIFF, and JPEG types should be enabled for 0.103.1+ (aka FLEVEL 122+)	4 years ago
Micah Snyder	1ae678c945	JPEG format validator improvements Adds debug output to the JPEG format validator to help resolve issues with unusually formatted JPEGs and to validate that the JPEG parser is working correctly. Relaxes the rules around duplicate application markers or application markers that appear later than expected, due to prior XMP metadata, etc. Removed the requirement for an application marker to exist, as some older JPEGs don't appear to use JFIF, Exif, or SPIFF application extensions. I tested against a relatively large data set of JPEGs from Mac & Windows stock photos, personal photos, and assorted downloaded photos. FP rates when alerting on broken media should be very low.	4 years ago
Micah Snyder	4cce1fcd20	GIF, PNG bugfixes; Add AlertBrokenMedia option Added a new scan option to alert on broken media (graphics) file formats. This feature mitigates the risk of malformed media files intended to exploit vulnerabilities in other software. At present media validation exists for JPEG, TIFF, PNG, and GIF files. To enable this feature, set `AlertBrokenMedia yes` in clamd.conf, or use the `--alert-broken-media` option when using `clamscan`. These options are disabled by default for now. Application developers may enable this scan option by enabling `CL_SCAN_HEURISTIC_BROKEN_MEDIA` for the `heuristic` scan option bit field. Fixed PNG parser logic bugs that caused an excess of parsing errors and fixed a stack exhaustion issue affecting some systems when scanning PNG files. PNG file type detection was disabled via signature database update for 0.103.0 to mitigate effects from these bugs. Fixed an issue where PNG and GIF files no longer work with Target:5 (graphics) signatures if detected as CL_TYPE_PNG/GIF rather than as CL_TYPE_GRAPHICS. Target types now support up to 10 possible file types to make way for additional graphics types in future releases. Scanning JPEG, TIFF, PNG, and GIF files will no longer return "parse" errors when file format validation fails. Instead, the scan will alert with the "Heuristics.Broken.Media" signature prefix and a descriptive suffix to indicate the issue, provided that the "alert broken media" feature is enabled. GIF format validation will no longer fail if the GIF image is missing the trailer byte, as this appears to be a relatively common issue in otherwise functional GIF files. Added a TIFF dynamic configuration (DCONF) option, which was missing. This will allow us to disable TIFF format validation via signature database update in the event that it proves to be problematic. This feature already exists for many other file types. Added CL_TYPE_JPEG and CL_TYPE_TIFF types.	4 years ago
Micah Snyder	9f2de39e04	New tmp sub-dir per scan; JSON meta improvements This commit improves the layout of the tmp file output and the JSON metadata output when using the --leave-temps and --gen-json options. For all scans, each scan target will get a unique tmp sub-directory. If using --leave-temps, that subdir will include the basename of the original file to make it easier to identify. Additionally, when using --leave-temps option, all extracted objects will have their subdirectories extracted in recursive subdirectories including filename prefixes where available. When not using the --leave-temps option, the layout of the tmp sub-directory will remain flat, so as to alleviate the possibility of exceeding PATH_MAX. The JSON metadata generated by the --gen-json option is now generated for all file types, not just a select few. The format is also pretty-printed for readability and now includes filenames and file paths when available. Also: - Added missing ALLMATCH check when determining if bytecode hooks should be run. - Added cl_engine_get_str API to windows libclamav symbol export file.	5 years ago
Mickey Sola	19894948b7	ppr/73 - add credit to news; fix formatting to be compliant with clamav standards	5 years ago
Aldo Mazzeo	f366b7c703	Transforming the PNG checker into a PNG exploit seeker	5 years ago
Mickey Sola	a25d48d7fa	gif - clang formatted; copyright dates fixed	5 years ago
Aldo Mazzeo	153a87a74b	Making the GIF parser more tolerant and supporting GIF overlays	5 years ago
Mickey Sola	0018365456	amp - add new signature for file typing which matches against word xml documents which lack the <w:wordDocument tag	5 years ago
Micah Snyder	206dbaefe8	Update copyright dates for 2020	5 years ago
Micah Snyder	0450e68551	Added new EGG archive extraction feature, written from scratch based on ESTsoft's EGG archive specification. EGG extraction support includes deflate, bzip2, and lzma decompression. AZO (LZO?) decompression not yet supported. Solid archives not yet supported. Split archives may have some limited success. This commit also includes updates to autoconf iconv.m4 file enable detection of libiconv in alternative install locations.	6 years ago
Micah Snyder	52cddcbcfd	Updating and cleaning up copyright notices.	6 years ago
Micah Snyder	72fd33c8b2	clang-format'd using new .clang-format rules.	6 years ago
Micah Snyder (micasnyd)	56bb195e07	bb12102: adding CL_TYPE_LNK for Windows Shortcut Files.	7 years ago
Steven Morgan	aedd18ac32	bb11586 - change CL_TYPE_EPS to CL_TYPE_PS.	9 years ago
Steven Morgan	e98acd72db	bb11586 - add file type CL_TYPE_EPS for raw scan matching of PostScript files.	9 years ago
Kevin Lin	ef48d7cbeb	MHTML: added filetype and switch case	9 years ago
Kevin Lin	bd026b0d9b	filetype consistency	9 years ago
Steven Morgan	99ce69a171	Change RTF file magic from '{\rtf' to '{\rt'	9 years ago
Kevin Lin	6cd5a9dc4e	hwpole2: new filetype and handler for hwp embedded ole2 files	10 years ago
Kevin Lin	904fe15510	add HMPML filetype, tab fixes in filetype.c	10 years ago
Kevin Lin	146fbb29ad	add HWP 3.x internal filetypes	10 years ago
Mickey Sola	46a35abe56	mass update of copyright headers	10 years ago
Steven Morgan	71d3778a63	bb11361 - add file magics for TIFF files.	10 years ago
Kevin Lin	517ce6c007	updated internal msxml 2003 file magics	10 years ago
Kevin Lin	842914d78c	added default filetype magic for LZMA compressed SWF	10 years ago
Kevin Lin	e66b3f9e48	constain default file magics for msxml documents (decrease fps)	10 years ago
Kevin Lin	76074a5755	fixed filetype int magic entry for CL_TYPE_XML_XL	10 years ago
Kevin Lin	740d013e1f	added file magic signatures for MSDOC 2003 XML files	10 years ago
Shawn Webb	005c986166	Adjust the XDP filetyping.	11 years ago
Shawn Webb	30a7509744	Add proof-of-concept XDP support. This feature requires libxml2 support. This commit bumps FLEVEL and introduces a new filetype based on the expected XML namespace for XDP files.	11 years ago
Steven Morgan	a03db40e78	add version to property file magic.	11 years ago
Steven Morgan	63ca7667bf	change internal magic to { "Magic": "CLAMJSON"	11 years ago
Steven Morgan	5b7892278c	Stronger magic for internal json file typing.	11 years ago
Steven Morgan	f76e3ad7d7	up flevel for internal json file typing.	11 years ago

1 2

90 Commits (8a77214c8292f5f7ce94dee7b4f363b5c14d91af)