Török Edvin
048725d0d6
wrap screnc output in script tags, so that the HTML normalizer will normalize
...
it as javascript, and match some more samples with existing signatures
git-svn: trunk@4214
17 years ago
Török Edvin
7d4b5f164a
use clistrdup/free instead of blobs (bb #828 )
...
git-svn: trunk@4203
17 years ago
Török Edvin
6a53bbdf9a
JS normalizer fixes (bb #1155 )
...
git-svn: trunk@4202
17 years ago
Török Edvin
7959343dc7
allow make distcheck to succeed (bb #1142 )
...
fix some more warnings
filter -Werror in clamav-milter.c
git-svn: trunk@4141
17 years ago
Török Edvin
5cd3f734a9
add lots of warnings flags to 'make distcheck',
...
and turn some warnings into errors (bb #1092 )
git-svn: trunk@4129
17 years ago
Török Edvin
e357da7b90
more leak fixes (bb #1141 )
...
git-svn: trunk@4127
17 years ago
Török Edvin
e2354bdb46
fix infloop in screnc detection code (introduced around SVN r3945).
...
generic safeguard against infinite loops due to state == next_state
add testcase
git-svn: trunk@4062
17 years ago
Török Edvin
fb6208fd12
fixes for bb #177 :
...
if all we have is a script in a html file still normalize it
keep global scope, we need it to parse variable names in the correct scope
git-svn: trunk@4058
17 years ago
Török Edvin
5be3029f8f
fix crash in js parser
...
fix configure test for check: if it is not available, and we didn't pass
--enable-check, do not complain
add unit test for jsnorm bug
git-svn: trunk@3949
17 years ago
Török Edvin
745d4b38b3
add paranthesis
...
fix compiler warning
git-svn: trunk@3946
17 years ago
Török Edvin
a5f196457f
improve normalization of screnc encoded files(bb #1022 )
...
extract duplicated code into its own function
git-svn: trunk@3945
17 years ago
Török Edvin
8be1d5a4bb
enable JS normalizer
...
git-svn: trunk@3919
17 years ago
Tomasz Kojm
72ce4b70eb
improve handling of PDF, CAB, RTF, OLE2 and HTML files (sync with branch/0.93)
...
git-svn: trunk@3862
17 years ago
Tomasz Kojm
2023340a41
update copyrights and stick more files to GPLv2; move and add more credits to the AUTHORS file; add COPYING.BSD
...
git-svn: trunk@3749
17 years ago
Tomasz Kojm
5fc380f1cc
always check return value of cli_gentemp()
...
git-svn: trunk@3696
17 years ago
Török Edvin
94ec89550b
handle ' inside comments
...
git-svn: trunk@3686
17 years ago
Török Edvin
eb0757aaa8
more improvements to script end tag detection
...
git-svn: trunk@3684
17 years ago
Török Edvin
0f2477751d
don't add newline between script tags.
...
git-svn: trunk@3682
17 years ago
Török Edvin
6f7c0a1548
better script end detection
...
git-svn: trunk@3679
17 years ago
Török Edvin
2354901f92
change ' to " inside scripts
...
git-svn: trunk@3678
17 years ago
Török Edvin
22b961c205
strip spaces from scripts, normalize screnc
...
git-svn: trunk@3675
17 years ago
Török Edvin
d5a5fef965
* libclamav/htmlnorm.c: generate only nocomment.html (always contains script too) and notags.html (bb #851 )
...
* libclamav/hashtab.h: len and data were reversed, invalidating entitylist.h
* libclamav/filetypes_int.h: improve HTML filetype detection (bb #853 )
git-svn: trunk@3660
18 years ago
Török Edvin
ec774193d3
SVN r3619 broke phishing detection, fixed it
...
git-svn: trunk@3625
18 years ago
Török Edvin
0664128a61
tagless version of HTML file (bb #162 )
...
fix compiler warning
git-svn: trunk@3619
18 years ago
aCaB
2455584143
sync to trunk
...
git-svn-id: file:///var/lib/svn/clamav-devel/branches/newlimits@3599 77e5149b-7576-45b1-b177-96237e5ba77b
18 years ago
Török Edvin
b3fc7f9747
use entconv to detect UTF-16BE, and UCS-4 variants
...
use only cli_readline() we don't need exact conversion
drop unused functions,
simplify encoding_norm_readline(), and rename to encoding_normalize_toascii()
git-svn: trunk@3571
18 years ago
Török Edvin
a6de01aa14
handle NULL characters in HTML files. (bb #539 ).
...
git-svn: trunk@3543
18 years ago
Török Edvin
8b22c9b52a
optimize char reference handling
...
git-svn: trunk@3532
18 years ago
Török Edvin
b0b8398b48
* contrib/entitynorm:
...
* use fewer entities, browsers don't support all either.
* update to generate code for new entconv.
* no need for configure, use just a simple Makefile
(it is an internal tool)
libclamav/entconv.c, hashtab.c, htmlnorm.c:
* don't allocate memory for each entity_norm call.
* don't touch length of mmaped area (bb #785 )
* update htmlnorm to use new entity_norm
git-svn: trunk@3515
18 years ago
Török Edvin
4e1127c594
AC_TRY_LINK already adds a main(), remove duplicate main()
...
entconv improvements to improve security and performance
Part I for (bb #686 , #386 )
TODO:
* optimize entity_norm
* create testfiles for unicode encoding variants
* create a regression test
* check for memory leaks
git-svn: trunk@3511
18 years ago
Tomasz Kojm
0808081e13
properly truncate long URLs (Edwin, bb#645)
...
git-svn: trunk@3372
18 years ago
Tomasz Kojm
45d6cbd9a8
fix possible NULL dereference (bb#582)
...
git-svn: trunk@3185
18 years ago
Tomasz Kojm
1c6fa20917
fix possible NULL dereference (bb#582)
...
git-svn: trunk@3184
18 years ago
Török Edvin
736112931b
handle & in URLs, even with
...
entity-converter off; don't leave & in URLs (bb
#535 )
git-svn: trunk@3100
18 years ago
Tomasz Kojm
84fd5a614c
fix some possible error path leaks by changing cli_realloc() to cli_realloc2()
...
git-svn: trunk@3064
18 years ago
Török Edvin
5e2a487ca8
fix uninitialized value warning
...
git-svn: trunk@3047
18 years ago
Török Edvin
462e8e5eb3
apply next set of patches for enabling phishing code
...
git-svn: trunk@3043
18 years ago
Török Edvin
f74bc8271b
Update code to use new AC matcher.
...
Fix URL truncation.
git-svn: trunk@3039
18 years ago
Török Edvin
4215d6f35b
compilation fix
...
git-svn-id: file:///var/lib/svn/clamav-devel/branches/temp_dconf_phishenable@2986 77e5149b-7576-45b1-b177-96237e5ba77b
18 years ago
Török Edvin
d6dfde3fd3
replace CL_EXPERIMENTAL with dconf in htmlnorm.
...
Pass dconf structure to cli_html_normalise
git-svn-id: file:///var/lib/svn/clamav-devel/branches/temp_dconf_phishenable@2983 77e5149b-7576-45b1-b177-96237e5ba77b
18 years ago
Török Edvin
e4ba6d85cc
leave <0x20 characters untouched in cl_experimental (don't normalize them to &xx;)
...
git-svn: trunk@2942
18 years ago
Török Edvin
66f7a69148
ampersands were missed in URLs.(bb #377 ).
...
git-svn: trunk@2905
18 years ago
Török Edvin
6b53b2341d
Better handling of empty charset in meta tag.
...
git-svn: trunk@2901
18 years ago
Sven Strickroth
a99111f050
remove old CVS-stuff and make the repository look more like SVN
...
git-svn: trunk@2755
19 years ago
Tomasz Kojm
fc83da8244
remove some gcc warnings
...
git-svn: trunk@2701
19 years ago
Tomasz Kojm
e2a621c621
better fix
...
git-svn: trunk@2699
19 years ago
Tomasz Kojm
6529ffb8f1
fix possible crash
...
git-svn: trunk@2698
19 years ago
Tomasz Kojm
093789b9ef
fix whitespace normalization (bb#283)
...
git-svn: trunk@2687
19 years ago
Tomasz Kojm
c8184020c4
apply entconv patch from Edwin
...
git-svn: trunk@2675
19 years ago
Tomasz Kojm
6f38c93985
s/strdup/cli_strdup/g
...
git-svn: trunk@2662
19 years ago