<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">

<!--Converted with LaTeX2HTML 2K.1beta (1.48)
original version by:  Nikos Drakos, CBLU, University of Leeds
* revised and updated by:  Marcus Hennecke, Ross Moore, Herb Swan
* with significant contributions from:
  Jens Lippmann, Marek Rouchal, Martin Wilck and others -->
<HTML>
<HEAD>
<TITLE>Scan engine</TITLE>
<META NAME="description" CONTENT="Scan engine">
<META NAME="keywords" CONTENT="clamdoc">
<META NAME="resource-type" CONTENT="document">
<META NAME="distribution" CONTENT="global">

<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<META NAME="Generator" CONTENT="LaTeX2HTML v2K.1beta">
<META HTTP-EQUIV="Content-Style-Type" CONTENT="text/css">

<LINK REL="STYLESHEET" HREF="clamdoc.css">

<LINK REL="next" HREF="node76.html">
<LINK REL="previous" HREF="node74.html">
<LINK REL="up" HREF="node72.html">
<LINK REL="next" HREF="node76.html">
</HEAD>

<BODY >
<!--Navigation Panel-->
<A NAME="tex2html1289"
  HREF="node76.html">
<IMG WIDTH="37" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="next"
 SRC="/usr/share/latex2html/icons/next.png"></A> 
<A NAME="tex2html1285"
  HREF="node72.html">
<IMG WIDTH="26" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="up"
 SRC="/usr/share/latex2html/icons/up.png"></A> 
<A NAME="tex2html1279"
  HREF="node74.html">
<IMG WIDTH="63" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="previous"
 SRC="/usr/share/latex2html/icons/prev.png"></A> 
<A NAME="tex2html1287"
  HREF="node1.html">
<IMG WIDTH="65" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="contents"
 SRC="/usr/share/latex2html/icons/contents.png"></A>  
<BR>
<B> Next:</B> <A NAME="tex2html1290"
  HREF="node76.html">CVD format</A>
<B> Up:</B> <A NAME="tex2html1286"
  HREF="node72.html">LibClamAV</A>
<B> Previous:</B> <A NAME="tex2html1280"
  HREF="node74.html">Database reloading</A>
 &nbsp <B>  <A NAME="tex2html1288"
  HREF="node1.html">Contents</A></B> 
<BR>
<BR>
<!--End of Navigation Panel-->

<H2><A NAME="SECTION00073000000000000000"></A><A NAME="engine"></A>
<BR>
Scan engine
</H2>
    New versions of Clam AntiVirus use a mutation of the Aho-Corasick
    pattern matching algorithm. The algorithm is based a finite state pattern
    matching automaton [<A
 HREF="node85.html#clr">1</A>] and it's a generalization of the famous
    Knuth-Morris-Pratt algorithm. Please take a look at the <code>matcher.h</code>
    for data type definitions. The automaton is represented by a trie. It
    is a rooted tree with some specific properties [<A
 HREF="node85.html#acwww">2</A>]. Every node
    of the trie represents some state of the automaton. In our implementation,
    the node is defined as follows:
    <PRE>
	struct cl_node {
	    short int islast;
	    struct cli_patt *list;
	    int maxpatlen;
	    struct node *next[NUM_CHILDS], *trans[NUM_CHILDS], *fail;
	};
</PRE>
    [To be continued...]

<P>
<BR><HR>
<ADDRESS>
Tomasz Kojm
2004-06-14
</ADDRESS>
</BODY>
</HTML>