A content-based system and method for text processing and retrieval is provided wherein a plurality of pieces of text are processed based on content to generate an index for each piece of text, the index comprising a list of phrases that represent the content of the piece of text. The phrases are grouped...http://www.google.fr/patents/US5963965?utm_source=gb-gplus-shareBrevet US5963965 - Text processing and retrieval system and method