com.theloutons.search.utils
Class DocReadOrgainzer

java.lang.Object
  extended bycom.theloutons.search.utils.DocReadOrgainzer

public class DocReadOrgainzer
extends java.lang.Object

Author:
Tom sg00756 This can be instantiated, but at the moment this is not meaningful.

Method Summary
static org.apache.lucene.document.Document getDocument(java.io.File f, java.util.Properties def, java.io.PrintWriter out)
          At the moment
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Method Detail

getDocument

public static org.apache.lucene.document.Document getDocument(java.io.File f,
                                                              java.util.Properties def,
                                                              java.io.PrintWriter out)
At the moment

Parameters:
f - is the name of the file to be read. At the moment we look at the suffix to decide, html, doc (winword), pdf, txt, csv, rtf and ppt Before deciding what to do, add general properties to the doc. If we cannot handle the document, then the doc is set to document from this file to be returned to be indexed. If the file is not of a type which can be handled, null is returned.
Returns:
the filled and indexed lucene document