Commit Graph

48 Commits

Author SHA1 Message Date
Török Edvin
f2b71eb961 extract URLs from mail body (bb #1482).
git-svn: trunk@5014
2009-04-02 20:36:22 +00:00
Török Edvin
4b2400fd7c fix memory leak (bb #1291)
git-svn: trunk@4395
2008-11-13 10:03:59 +00:00
Török Edvin
08402afa4c fix more compiler warnings (bb #273)
git-svn: trunk@4353
2008-11-07 18:49:47 +00:00
Török Edvin
b9b477842d fix compiler warnings, more to come (bb #273)
git-svn: trunk@4346
2008-11-06 14:27:43 +00:00
Tomasz Kojm
c702906433 cosmetics
git-svn: trunk@4255
2008-10-13 11:17:35 +00:00
Török Edvin
d9282b9765 properly null-terminate buffer (bb #1223)
git-svn: trunk@4252
2008-10-11 13:56:29 +00:00
aCaB
1405207a2b bb#1219
git-svn: trunk@4229
2008-10-07 13:35:21 +00:00
Török Edvin
048725d0d6 wrap screnc output in script tags, so that the HTML normalizer will normalize
it as javascript, and match some more samples with existing signatures

git-svn: trunk@4214
2008-10-01 09:56:48 +00:00
Török Edvin
7d4b5f164a use clistrdup/free instead of blobs (bb #828)
git-svn: trunk@4203
2008-09-23 20:52:33 +00:00
Török Edvin
6a53bbdf9a JS normalizer fixes (bb #1155)
git-svn: trunk@4202
2008-09-23 20:52:27 +00:00
Török Edvin
7959343dc7 allow make distcheck to succeed (bb #1142)
fix some more warnings
  filter -Werror in clamav-milter.c

git-svn: trunk@4141
2008-08-25 14:18:48 +00:00
Török Edvin
5cd3f734a9 add lots of warnings flags to 'make distcheck',
and turn some warnings into errors (bb #1092)

git-svn: trunk@4129
2008-08-21 20:21:43 +00:00
Török Edvin
e357da7b90 more leak fixes (bb #1141)
git-svn: trunk@4127
2008-08-21 15:58:02 +00:00
Török Edvin
e2354bdb46 fix infloop in screnc detection code (introduced around SVN r3945).
generic safeguard against infinite loops due to state == next_state
add testcase

git-svn: trunk@4062
2008-08-01 17:37:06 +00:00
Török Edvin
fb6208fd12 fixes for bb #177:
if all we have is a script in a html file still normalize it
keep global scope, we need it to parse variable names in the correct scope

git-svn: trunk@4058
2008-08-01 14:50:50 +00:00
Török Edvin
5be3029f8f fix crash in js parser
fix configure test for check: if it is not available, and we didn't pass
--enable-check, do not complain
add unit test for jsnorm bug


git-svn: trunk@3949
2008-07-14 19:31:56 +00:00
Török Edvin
745d4b38b3 add paranthesis
fix compiler warning


git-svn: trunk@3946
2008-07-14 16:24:43 +00:00
Török Edvin
a5f196457f improve normalization of screnc encoded files(bb #1022)
extract duplicated code into its own function


git-svn: trunk@3945
2008-07-14 14:50:52 +00:00
Török Edvin
8be1d5a4bb enable JS normalizer
git-svn: trunk@3919
2008-07-08 11:33:32 +00:00
Tomasz Kojm
72ce4b70eb improve handling of PDF, CAB, RTF, OLE2 and HTML files (sync with branch/0.93)
git-svn: trunk@3862
2008-05-27 16:30:47 +00:00
Tomasz Kojm
2023340a41 update copyrights and stick more files to GPLv2; move and add more credits to the AUTHORS file; add COPYING.BSD
git-svn: trunk@3749
2008-04-02 15:24:51 +00:00
Tomasz Kojm
5fc380f1cc always check return value of cli_gentemp()
git-svn: trunk@3696
2008-03-06 20:19:22 +00:00
Török Edvin
94ec89550b handle ' inside comments
git-svn: trunk@3686
2008-03-01 09:24:37 +00:00
Török Edvin
eb0757aaa8 more improvements to script end tag detection
git-svn: trunk@3684
2008-02-27 18:51:00 +00:00
Török Edvin
0f2477751d don't add newline between script tags.
git-svn: trunk@3682
2008-02-27 15:57:30 +00:00
Török Edvin
6f7c0a1548 better script end detection
git-svn: trunk@3679
2008-02-26 22:20:13 +00:00
Török Edvin
2354901f92 change ' to " inside scripts
git-svn: trunk@3678
2008-02-26 21:34:44 +00:00
Török Edvin
22b961c205 strip spaces from scripts, normalize screnc
git-svn: trunk@3675
2008-02-26 11:05:11 +00:00
Török Edvin
d5a5fef965 * libclamav/htmlnorm.c: generate only nocomment.html (always contains script too) and notags.html (bb #851)
* libclamav/hashtab.h:  len and data were reversed, invalidating entitylist.h
  * libclamav/filetypes_int.h: improve HTML filetype detection (bb #853)


git-svn: trunk@3660
2008-02-20 15:49:43 +00:00
Török Edvin
ec774193d3 SVN r3619 broke phishing detection, fixed it
git-svn: trunk@3625
2008-02-13 10:24:51 +00:00
Török Edvin
0664128a61 tagless version of HTML file (bb #162)
fix compiler warning


git-svn: trunk@3619
2008-02-11 21:41:58 +00:00
Török Edvin
b3fc7f9747 use entconv to detect UTF-16BE, and UCS-4 variants
use only cli_readline() we don't need exact conversion
drop unused functions,
simplify encoding_norm_readline(), and rename to encoding_normalize_toascii()


git-svn: trunk@3571
2008-02-01 19:38:52 +00:00
Török Edvin
a6de01aa14 handle NULL characters in HTML files. (bb #539).
git-svn: trunk@3543
2008-01-25 16:39:40 +00:00
Török Edvin
8b22c9b52a optimize char reference handling
git-svn: trunk@3532
2008-01-23 15:43:32 +00:00
Török Edvin
b0b8398b48 * contrib/entitynorm:
* use fewer entities, browsers don't support all either.
		       	* update to generate code for new entconv.
		       	* no need for configure, use just a simple Makefile
			 (it is an internal tool)
  libclamav/entconv.c, hashtab.c, htmlnorm.c:
			* don't allocate memory for each entity_norm call.
			* don't touch length of mmaped area (bb #785)
			* update htmlnorm to use new entity_norm


git-svn: trunk@3515
2008-01-21 15:52:21 +00:00
Török Edvin
4e1127c594 AC_TRY_LINK already adds a main(), remove duplicate main()
entconv improvements to improve security and performance
	Part I for  (bb #686, #386)
	TODO:
	* optimize entity_norm
	* create testfiles for unicode encoding variants
	* create a regression test
	* check for memory leaks


git-svn: trunk@3511
2008-01-20 22:18:14 +00:00
Tomasz Kojm
0808081e13 properly truncate long URLs (Edwin, bb#645)
git-svn: trunk@3372
2007-12-06 14:53:22 +00:00
Tomasz Kojm
45d6cbd9a8 fix possible NULL dereference (bb#582)
git-svn: trunk@3185
2007-08-21 20:30:15 +00:00
Tomasz Kojm
1c6fa20917 fix possible NULL dereference (bb#582)
git-svn: trunk@3184
2007-08-21 20:27:40 +00:00
Török Edvin
736112931b handle & in URLs, even with
entity-converter off; don't leave & in URLs (bb
  #535)


git-svn: trunk@3100
2007-06-16 17:03:42 +00:00
Tomasz Kojm
84fd5a614c fix some possible error path leaks by changing cli_realloc() to cli_realloc2()
git-svn: trunk@3064
2007-05-25 23:10:58 +00:00
Török Edvin
5e2a487ca8 fix uninitialized value warning
git-svn: trunk@3047
2007-05-01 20:13:27 +00:00
Török Edvin
462e8e5eb3 apply next set of patches for enabling phishing code
git-svn: trunk@3043
2007-05-01 16:46:52 +00:00
Török Edvin
f74bc8271b Update code to use new AC matcher.
Fix URL truncation.


git-svn: trunk@3039
2007-04-28 20:15:22 +00:00
Török Edvin
e4ba6d85cc leave <0x20 characters untouched in cl_experimental (don't normalize them to &xx;)
git-svn: trunk@2942
2007-03-14 19:21:52 +00:00
Török Edvin
66f7a69148 ampersands were missed in URLs.(bb #377).
git-svn: trunk@2905
2007-03-05 19:31:17 +00:00
Török Edvin
6b53b2341d Better handling of empty charset in meta tag.
git-svn: trunk@2901
2007-03-03 23:00:14 +00:00
Sven Strickroth
a99111f050 remove old CVS-stuff and make the repository look more like SVN
git-svn: trunk@2755
2007-02-17 19:02:20 +00:00