An apparatus and method for determining if a query document matches one or more of a plurality of documents in a database. In a coarse matching stage, a compressed file or other query document is scanned to produce a bit profile. Global statistics such as line spacing and text height are calculated from...http://www.google.fr/patents/US6363381?utm_source=gb-gplus-shareBrevet US6363381 - Compressed document matching