Class OldExcelExtractor

  • All Implemented Interfaces:
    Closeable, AutoCloseable, POITextExtractor

    public class OldExcelExtractor
    extends Object
    implements POITextExtractor
    A text extractor for old Excel files, which are too old for HSSFWorkbook to handle. This includes Excel 95, and very old (pre-OLE2) Excel files, such as Excel 4 files.

    Returns much (but not all) of the textual content of the file, suitable for indexing by something like Apache Lucene, or used by Apache Tika, but not really intended for display to the user.